Blogosphere and Time Series

Though blogpulse seems to be going nowhere, as far as I can see, it is still one of the most fascinating tool out there. What it does is plot word occurrences versus time on the blogosphere. The recall is rather poor compared to Technorati but the time series plot are very nice.

Here’s one comparative plot that a student in my Information Retrieval course (Mahmoud El-Bachir) has submitted:

You can see clearly when Christmas is (Noël in French) and when the new year is… I think you also have the Chinese New Year too! (Seek the smaller bump).

My only beef is that I do not have access to the raw data: it would be really cool to build applications on top of blogpulse, but I guess it goes against their business model.

Published by

Daniel Lemire

A computer science professor at the University of Quebec (TELUQ).

Leave a Reply

Your email address will not be published. Required fields are marked *

To create code blocks or other preformatted text, indent by four spaces:

    This will be displayed in a monospaced font. The first four 
    spaces will be stripped off, but all other whitespace
    will be preserved.
    Markdown is turned off in code blocks:
     [This is not a link](

To create not a block, but an inline code span, use backticks:

Here is some inline `code`.

For more help see