I’m very proud of this little paper called Slope One Predictors for Online Rating-Based Collaborative Filtering. The paper report on some of the core collaborative filtering research leading to the inDiscover web site. I’ll be presenting it at SIAM Data Mining 2005 in April (Newport Beach, California).

This is a case where, with Anna Maclachlan, we did something that few researchers do these days: we looked for something simpler. The main result of the paper is that you can use extremely simple and easy to implement algorithms and get very competitive results.

The current trend, in academia, is to develop crazy algorithms that require not 10 lines of code, not 100 lines of code, but several thousands. I think the same is true in some industries: think of Web Services or Java (with the infinite number of new acronyms).

Well, I like complex algorithms and as a math guy, I like a challenge, but once in a while, I think it pays to go I think “wait! what if the average Joe wants to implement this?”

So, if you write real code and are interested in collaborative filtering, go check this paper.

8 Comments »

  1. Simple collaborative filtering
    Daniel Lemire has a paper out on a new, simple algorithm for collaborative filtering , which is the technology underlying the inDiscover.net free music site.

    Trackback by Seb's Open Research — 11/1/2005 @ 14:32

  2. You wouldn’t have open code for this lying anywhere, would you?

    Comment by Seb — 11/1/2005 @ 14:32

  3. Valid request. I’ll post something in the near future. I do have lots of code, but it is Java code, I think I should post SQL code.

    Comment by Daniel Lemire — 11/1/2005 @ 19:45

  4. Yes, some SQL code would be fantastic! Maybe with some pseudo-code for some of the logic… Something us non-math geeks can understand :)

    Comment by Peter — 12/1/2005 @ 9:26

  5. oh oh, looks like I’m not getting any sleep soon!

    Comment by Robin — 13/1/2005 @ 18:28

  6. It’s beauty is its simplicity.

    Great paper!

    Comment by Dr. E. Garcia — 2/7/2005 @ 14:20

  7. For code samples, go there

    http://www.daniel-lemire.com/fr/abstracts/TRD01.html

    Comment by Daniel Lemire — 2/7/2005 @ 14:23

  8. [...] Slope One Predictors for Online Rating-Based Collaborative Filtering (SDM’05 / April 20-23th 2005) [...]

    Pingback by Daniel Lemire’s blog » My most commented posts so far — 2/1/2006 @ 23:45

Leave a comment

Warning: When entering a long comment, please ensure that you make copy of your text prior to submitting it. If the server should fail or if you hit a bug, you might lose your work. I am not responsible for your lost effort.

To spammers: I carefully review every single post and make sure that spam gets deleted. You are wasting your time if you are manually entering spam using this form. Read my terms of use to see what I consider to be abusive.

Example: duo plus septem is '9'. The numbers are expressed in latin numerals but you should give your answers using ordinary digits.

 

« Blog's main page

Powered by WordPress