Yahoo to distribute its version of Hadoop
Yahoo announced plans Wednesday to release an open-source version of its take on Hadoop, a grid-computing framework used to run many parts of its business.
Yahoo is a major force in the development of Hadoop, which is principally overseen by the Apache Software Foundation. Hadoop is essentially an open-source version of the software Google uses to run its Web indexing servers, and Yahoo uses it for much the same purpose internally.
Hadoop runs on tens of thousands of servers inside Yahoo, said Nigel Daley, quality and release engineering manager for Yahoo Grid Technologies, in a blog post Wednesday. That's a much larger implementation than other companies and organizations might wish to deploy, but at the same time they would like to benefit from the reliability tweaks that Yahoo has made to Hadoop in order to support its enormous Web properties.
"This distribution is largely a response to the numerous requests that we have received to share Yahoo!'s internally tested and scale-proven releases," Daley wrote. The code is available for download immediately here on Yahoo's site.
Tom Krazit writes about the ever-expanding world of Internet search, including Google, Yahoo, online advertising, and portals, as well as the evolution of mobile computing. He has written about traditional PC companies, chip manufacturers, and mobile computers, spending the last three years covering Apple. E-mail Tom. 



Could this also be a sign that yahoo doesn't want to make search the backbone of its services?
- by lonestarState June 11, 2009 7:41 AM PDT
- Yahoo has released Hadoop enhancements to the Apache foundation. Life is a fork so why not release their own version. Case in point: LINUX distros
- Reply to this comment
-
(4 Comments)