Is this web page trying to sell me something?

Mindset is a research program to train software to recognize commercial pages. One application of this tool is that you can try to exclude commercial pages out of the result set.

Of course, if this is as good as spam filtering, people will only be partly happy with the results. And yes, there are many commercial Web sites trying to pass out as non-commercial. Everyone is out to sell something, afterall.

Interesting question: would you ever want to do the reverse, that is, exclude non-commercial content?

(Source: Turney.)

Published by

Daniel Lemire

A computer science professor at the University of Quebec (TELUQ).

Leave a Reply

Your email address will not be published. Required fields are marked *

To create code blocks or other preformatted text, indent by four spaces:

    This will be displayed in a monospaced font. The first four 
    spaces will be stripped off, but all other whitespace
    will be preserved.
    Markdown is turned off in code blocks:
     [This is not a link](

To create not a block, but an inline code span, use backticks:

Here is some inline `code`.

For more help see