Thursday, June 18, 2009

Hadoop and Distributed Computing at Yahoo!

Apache Hadoop* is an open source Java software framework for running data-intensive applications on large clusters of commodity hardware. Hadoop, which was invented by Doug Cutting (now a Yahoo! employee), is a top level Apache project. It relies on an active community of contributors from all over the world for its success.

Hadoop implements two important elements. The first is a computational paradigm called Map/Reduce, which takes an application and divides it into multiple fragments of work, each of which can be executed on any node in the cluster. The second is a distributed file system called HDFS. HDFS stores data on nodes in the cluster with the goal of providing greater bandwidth across the cluster.

1 comment:

  1. I stumbled upon your site at a free advertising campaign. I would like to inform you that my company, Geo Mixers, is now offering a free traffic system where the hits ratio is 1 to 1. If you feel like you have been wasting your time on other advertisements, or even have seen how some of them work, I would like to invite you to come check the new and revolutionary system for free. Please go and register at to start boosting your site!

    Or you can learn more about us here:

    Thank You for your time,
    The Geo Mixers Team.