Mindset is a research program to train software to recognize commercial pages. One application of this tool is that you can try to exclude commercial pages out of the result set.

Of course, if this is as good as spam filtering, people will only be partly happy with the results. And yes, there are many commercial Web sites trying to pass out as non-commercial. Everyone is out to sell something, afterall.

Interesting question: would you ever want to do the reverse, that is, exclude non-commercial content?

(Source: Turney.)

No Comments »

No comments yet.

Leave a comment

Warning: When entering a long comment, please ensure that you make copy of your text prior to submitting it. If the server should fail or if you hit a bug, you might lose your work. I am not responsible for your lost effort.

To spammers: I carefully review every single post and make sure that spam gets deleted. You are wasting your time if you are manually entering spam using this form. Read my terms of use to see what I consider to be abusive.

Example: duo plus septem is '9'. The numbers are expressed in latin numerals but you should give your answers using ordinary digits.

 

« Blog's main page

Powered by WordPress