Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Hadoop: The Definitive Guide

Buy
Hadoop: The Definitive Guide, 9780596521974 (0596521979), O'Reilly, 2009
Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large datasets, the Apache Hadoop framework is an open source implementation of the MapReduce algorithm on which Google built its empire. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems: programmers will find details for analyzing large datasets, and administrators will learn how to set up and run Hadoop clusters. Complete with case studies that illustrate how Hadoop solves specific problems, this book helps you:

Use the Hadoop Distributed File System (HDFS) for storing large datasets, and run distributed computations over those datasets using MapReduce Become familiar with Hadoop's data and I/O building blocks for compression, data integrity, serialization, and persistence Discover common pitfalls and advanced features for writing real-world MapReduce programs Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud Use Pig, a high-level query language for large-scale data processing Take advantage of HBase, Hadoop's database for structured and semi-structured data Learn ZooKeeper, a toolkit of coordination primitives for building distributed systems

If you have lots of data -- whether it's gigabytes or petabytes -- Hadoop is the perfect solution. Hadoop: The Definitive Guide is the most thorough book available on the subject. "Now you have the opportunity to learn about Hadoop from a master-not only of the technology, but also of common sense and plain talk." -- Doug Cutting, Hadoop Founder, Yahoo!

About the Author

Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He works for Cloudera, a company set up to offer Hadoop support and training. Previously he was as an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O'Reilly, java.net and IBM's developerWorks, and has spoken at several conferences, including at ApacheCon 2008 on Hadoop. Tom has a Bachelor's degree in Mathematics from the University of Cambridge and a Master's in Philosophy of Science from the University of Leeds, UK.

(HTML tags aren't allowed.)

Tech Job Hunt Handbook: Career Management for Technical Professionals
Tech Job Hunt Handbook: Career Management for Technical Professionals

Tech Job Hunt Handbook is a career management book—just for technology professionals—that reflects today’s new economic realities. The world of work is constantly changing, and staying professionally relevant while competing for more specialized tech jobs in areas like cloud computing, mobile and social...

Graphics of Large Datasets: Visualizing a Million (Statistics and Computing)
Graphics of Large Datasets: Visualizing a Million (Statistics and Computing)
Graphics are great for exploring data, but how can they be used for looking at the large datasets that are commonplace to-day? This book shows how to look at ways of visualizing large datasets, whether large in numbers of cases or large in numbers of variables or large in both. Data visualization is useful for data cleaning, exploring data,...
Testable JavaScript
Testable JavaScript
You have to test your code, so why not make the process as easy and painless as possible? Client-side JavaScript is especially difficult to test properly, as we have very little control over the environment within which our code runs. Multiple operating systems, multiple versions of operating systems, multiple...

The Java Tutorial: A Short Course on the Basics, 4th Edition (The Java Series)
The Java Tutorial: A Short Course on the Basics, 4th Edition (The Java Series)

A hands-on guide to the Java programming language, The Java™ Tutorial, Fourth Edition is perfect for any developer looking for a proven path to proficiency with Java SE. This popular tutorial "from the Source" has been completely revised and...

Hack the Stack: Using Snort and Ethereal to Master the 8 Layers of an Insecure Network
Hack the Stack: Using Snort and Ethereal to Master the 8 Layers of an Insecure Network

This book looks at network security in a new and refreshing way. It guides readers step-by-step through the "stack" -- the seven layers of a network. Each chapter focuses on one layer of the stack along with the attacks, vulnerabilities, and exploits that can be found at that layer. The book even includes a chapter on the mythical...

Real-Life Math: Everyday Use of Mathematical Concepts
Real-Life Math: Everyday Use of Mathematical Concepts
"What does this have to do with real life?" is a question that plagues mathematics teachers across America, as students are confronted with abstract topics in their high school mathematics courses. The National Council of Teachers of Mathematics emphasizes the importance of making real world connections in teaching mathematics so that...
©2021 LearnIT (support@pdfchm.net) - Privacy Policy