• Home
  • Blog
  • About
  • Contact
CloudAve
Software in Business. The Business of Software.
  • Business
    • Analysis
    • Entrepreneurship
    • Marketing
    • Strategy
    • Small business
  • Technology
    • Application Software
    • Infrastructure
    • Open Source
    • Mobile
    • Platforms
    • Product reviews
    • Security
  • Misc
    • Design
    • Just for fun
    • Trends & Concepts
  • Your POV
  • Sponsors
Browse: Home / Yahoo! Releases Its Own Hadoop Distribution

Yahoo! Releases Its Own Hadoop Distribution

By Guest Authors on June 10, 2009

hadoopYahoo! is releasing its own distribution of Hadoop:

Hadoop is a distributed file system and parallel execution environment that enables its users to process massive amounts of data.
In response to frequent requests from the Hadoop community, Yahoo! is opening up its investment in Hadoop quality engineering to benefit the larger ecosystem and to increase the pace of innovation around open and collaborative research and development.
The Yahoo! Distribution of Hadoop has been tested and deployed at Yahoo! on the largest Hadoop clusters in the world.

Hadoop is free Java software framework born out of an open-source implementation of Google’s published computing infrastructure and fostered within the Apache Software Foundation.
Yahoo! has been the primary developer and contributor to Apache’s Hadoop.
In 2006, Hadoop founder Doug Cutting joined Yahoo, which provided a dedicated team and resources, to lead the project of developing the open-source software and turn Hadoop into a system that ran at web scale. Today, Yahoo! is running the largest Hadoop cluster in the world, which includes more than 25,000 servers and provides the framework for many Yahoo properties including Yahoo Search, Yahoo Mail, and several content and ad services.

Yahoo says its opening up the source code to Hadoop to “benefit the larger ecosystem increase the pace of innovation around open and collaborative research and development.”.
As Nigel Daley, Quality and Release Engineering Manager at Yahoo! Grid Technologies, summarizes:

Hadoop is helping us solve key science and research problems in hours or days instead of months. It provides us a platform to solve extreme problems requiring massive amounts of data processing. It underpins major revenue-generating systems. Opening our distribution enables a faster pace of innovation for the entire Hadoop ecosystem and broadens the use — and ultimately the quality — of this key platform across the industry.

Related articles by
Zemanta
  • Yahoo!
    exposes very own stuffed elephant code
  • Digging
    Deeper Into Data With Hadoop
  • Cloudera
    CEO: Hadoop Will Go Beyond Web Apps
  • More Than Enterprise Scale, It Is Science Scale – Cloudera And Hadoop Are Shaking The Marketplace
  • Cloudera – Redhat of Cloud Computing?
Post Views: 0

Share:

  • Twitter
  • Facebook
  • LinkedIn

Posted in General | Tagged Apache, hadoop, yahoo, YHOO

Guest Authors

« Previous Next »
feed mail facebook twitter linkedin

Popular Posts

  • Cutting Through the Fog of Cloud Computing Definitions
  • Home
  • Rate your company's innovation culture
  • Global Intranet Trends for 2011 (part 1)
  • Amazon, Google, Facebook, Microsoft, Apple, and Oracle Org Chart
  • The Documentation Maturity Model (infographic)
  • Prank OS. Chrome Prank. Prank Chrome. Whatever. It Worked.
  • Hype Cycle for Emerging Technologies 2011: Idea Management Enlightenment

Archives

Authors

  • Adron Hall
  • Chirag Mehta
  • Christian Reilly
  • Dan Morrill
  • Derek Pilling
  • Hutch Carpenter
  • Jarret Pazahanick
  • Jason M. Lemkin
  • Joel York
  • John Taschek
  • Krishnan Subramanian
  • Mark Suster
  • Michael Krigsman
  • Ofir Nachmani
  • Paul Miller
  • Quinton Wall
  • Randy Bias
  • Robert Duffner
  • Sadagopan
  • wprss
  • Zoli Erdos
Sponsored by: