Thursday, June 18, 2009

Hadoop and Distributed Computing at Yahoo!

Apache Hadoop* is an open source Java software framework for running data-intensive applications on large clusters of commodity hardware. Hadoop, which was invented by Doug Cutting (now a Yahoo! employee), is a top level Apache project. It relies on an active community of contributors from all over the world for its success.

Hadoop implements two important elements. The first is a computational paradigm called Map/Reduce, which takes an application and divides it into multiple fragments of work, each of which can be executed on any node in the cluster. The second is a distributed file system called HDFS. HDFS stores data on nodes in the cluster with the goal of providing greater bandwidth across the cluster.

1 comment:

  1. I stumbled upon your site at a free advertising campaign. I would like to inform you that my company, Geo Mixers, is now offering a free traffic system where the hits ratio is 1 to 1. If you feel like you have been wasting your time on other advertisements, or even have seen how some of them work, I would like to invite you to come check the new and revolutionary system for free. Please go and register at http://www.geomixers.com to start boosting your site!

    Or you can learn more about us here:
    http://www.youtube.com/watch?v=0YSoi13tll0

    Thank You for your time,
    The Geo Mixers Team.

    ReplyDelete