Good real world example on map/reduce/filter

I'm trying to find some good real world usage of map-reduce for a demo.

The target audience would be cluster person so they have enough knowledge on parallel algorithm.

Any good idea?

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

See "MapReduce: Simplified Data Processing on Large Clusters"

Have you looked at the paper on Google's implementation of MapReduce (Dean & Ghemawat, OSDI'04)?
They list several examples:

  • Distributed Grep
  • Count of URL Access Frequency
  • Reverse Web-Link Graph
  • Term-Vector per Host
  • Inverted Index
  • Distributed Sort