Kunal Anand: Some XML exam questions

Kunal has almost picked up my challenge on his blog: come up with deep homework questions having to do with XML.

  • Given at least 10 blog/link feeds, determine the top ten outbound URLs?
  • Parse an iTunes library file and capture all the unique artist/albums.
  • Given a user’s XML file from del.icio.us, determine the top 10 intersecting tags.
  • Scrape a dynamic list from a web site (i.e. the Google Zeitgeist) and serialize a well-formed Atom feed.

The last one seems like mostly hard labour probably requiring quite a bit of fiddling.

The other ones are all interesting because they are examples of aggregation and that’s not trivial to do in XSLT/XPath. Naturally, Kunal suggests to solve these problems using a nice script language like Python, but solving them in XSLT is much more fun because it is harder.

Published by

Daniel Lemire

A computer science professor at the Université du Québec (TELUQ).

Leave a Reply

Your email address will not be published. Required fields are marked *

To create code blocks or other preformatted text, indent by four spaces:

    This will be displayed in a monospaced font. The first four 
    spaces will be stripped off, but all other whitespace
    will be preserved.
    
    Markdown is turned off in code blocks:
     [This is not a link](http://example.com)

To create not a block, but an inline code span, use backticks:

Here is some inline `code`.

For more help see http://daringfireball.net/projects/markdown/syntax