Why can we predict tastes? There are several possible explanations:

  • Intrinsically, individuals have predictable tastes. To test this theory, we would need to isolate each individual. Collect their opinions. Then attempt to make predictions. (You also need to prevent the recommender system from giving feedback to the user!)
  • People are influenced by the perceived popularity of items. People tend to like a movie more when they think it is a blockbuster. A  more general statement is: Preferences are constructed in the process of elicitation.

Jon Dron linked to a 2007 New York Times article by Duncan J. Watts which claims that the second factor can easily dominate. The article is based on experiments. However, I have several arguments to support this claim.  Highly cited research papers are often no more interesting than regular papers. Except that you must also cite them, otherwise people will complain that you are missing a reference. The same is true with music or books. I often read novels based because many people read them: that is how, for example, I found la compagnie des glaces or Dune.

Hence,  collaborative filtering is a circular process:

  • People like certain items (mostly) because they perceive that similar people like them.
  • Collaborative filtering matches people with such items.

Of course, that is a bit pessimistic: people are also influenced by the intrinsic values. However, how can you differentiate the social perception from the intrinsic value of an item? Is this novel entertaining, or is it popular? Outside a laboratory, we cannot tell these factors apart!

How well would collaborative filtering work if individuals were isolated? It would work poorly. There are millions of books and research articles. Many are quite good. Without a social component to push us in some directions, we would have diverse tastes.

2 Comments »

  1. Nice, I didn’t realize you were a fan of the heuristics and biases literature. I recommend you read George Loewenstein and Dan Ariely if you haven’t already.

    http://sds.hss.cmu.edu/src/faculty/loewenstein.php

    http://web.mit.edu/ariely/www/MIT/papers.shtml

    Comment by Daniel Tunkelang — 31/12/2008 @ 10:05

  2. I disagree in the last paragraph. Diversity is essential to predict tastes. Having a “delta function” in the data (ratings -> correlation etc.) means no information, hence no prediction power. A broad distribution of tastes always leads to better results. To predict blockbusters is an easy task, but not very useful. I claim: collaborative filtering works because of divergent tastes. The more divergent the better.

    Comment by marcel — 7/1/2009 @ 4:30

Leave a comment

Warning: When entering a long comment, please ensure that you make copy of your text prior to submitting it. If the server should fail or if you hit a bug, you might lose your work. I am not responsible for your lost effort.

To spammers: I carefully review every single post and make sure that spam gets deleted. You are wasting your time if you are manually entering spam using this form. Read my terms of use to see what I consider to be abusive.

Example: duo plus septem is '9'. The numbers are expressed in latin numerals but you should give your answers using ordinary digits.

 

« Blog's main page

Powered by WordPress