Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Hadoop: The Definitive Guide

Buy
Hadoop: The Definitive Guide, 9781449311520 (1449311520), O'Reilly, 2012
Hadoop got its start in Nutch. A few of us were attempting to build an open source web search engine and having trouble managing computations running on even a handful of computers. Once Google published its GFS and MapReduce papers, the route became clear. They’d devised systems to solve precisely the problems we were having with Nutch. So we started, two of us, half-time, to try to re-create these systems as a part of Nutch.

We managed to get Nutch limping along on 20 machines, but it soon became clear that to handle the Web’s massive scale, we’d need to run it on thousands of machines and, moreover, that the job was bigger than two half-time developers could handle.

Around that time, Yahoo! got interested, and quickly put together a team that I joined. We split off the distributed computing part of Nutch, naming it Hadoop. With the help of Yahoo!, Hadoop soon grew into a technology that could truly scale to the Web.

In 2006, Tom White started contributing to Hadoop. I already knew Tom through an excellent article he’d written about Nutch, so I knew he could present complex ideas in clear prose. I soon learned that he could also develop software that was as pleasant to read as his prose.

From the beginning, Tom’s contributions to Hadoop showed his concern for users and for the project. Unlike most open source contributors, Tom is not primarily interested in tweaking the system to better meet his own needs, but rather in making it easier for anyone to use.

Initially, Tom specialized in making Hadoop run well on Amazon’s EC2 and S3 services. Then he moved on to tackle a wide variety of problems, including improving the MapReduce APIs, enhancing the website, and devising an object serialization framework. In all cases, Tom presented his ideas precisely. In short order, Tom earned the role of Hadoop committer and soon thereafter became a member of the Hadoop Project Management Committee.

Tom is now a respected senior member of the Hadoop developer community. Though he’s an expert in many technical corners of the project, his specialty is making Hadoop easier to use and understand.
(HTML tags aren't allowed.)

Euclidean & Non-Euclidean Geometries: Development and History
Euclidean & Non-Euclidean Geometries: Development and History

This is the definitive presentation of the history, development and philosophical significance of non-Euclidean geometry as well as of the rigorous foundations for it and for elementary Euclidean geometry, essentially according to Hilbert. Appropriate for liberal arts students, prospective high school teachers, math. majors, and even bright high...

Wireshark Essentials
Wireshark Essentials

Get up and running with Wireshark to analyze network packets and protocols effectively

About This Book

  • Troubleshoot problems, identify security risks, and measure key application performance metrics with Wireshark
  • Gain valuable insights into the network and application protocols, and the...
Network Performance and Security: Testing and Analyzing Using Open Source and Low-Cost Tools
Network Performance and Security: Testing and Analyzing Using Open Source and Low-Cost Tools

Network Performance Security: Testing and Analyzing Using Open Source and Low-Cost Tools gives mid-level IT engineers the practical tips and tricks they need to use the best open source or low cost tools available to harden their IT infrastructure. The book details how to use the tools and how to interpret them. Network Performance...


Guide to Elliptic Curve Cryptography (Springer Professional Computing)
Guide to Elliptic Curve Cryptography (Springer Professional Computing)
The study of elliptic curves by algebraists, algebraic geometers and number theorists
dates back to the middle of the nineteenth century. There now exists an extensive literature
that describes the beautiful and elegant properties of these marvelous objects. In
1984, Hendrik Lenstra described an ingenious algorithm for factoring
...
Zope Web Application Construction Kit
Zope Web Application Construction Kit

Zope is one of the leading open-source Web application servers and content management systems. Designed specifically to publish dynamic content, it is a Web-based publishing system built around a 100% object framework for rapidly deploying enterprise-class content management solutions.

...
The New How [Paperback]: Creating Business Solutions Through Collaborative Strategy
The New How [Paperback]: Creating Business Solutions Through Collaborative Strategy

What people are saying about The New How
 

"How are you going to get rid of your Air Sandwich if you don't even know what it is? Provocative and practical at the same time."
--Seth Godin, author of Linchpin

"The New How is informative and...

©2021 LearnIT (support@pdfchm.net) - Privacy Policy