Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Hadoop in Practice

Buy
Hadoop in Practice, 9781617290237 (1617290238), Manning Publications, 2012

Summary

Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you'll face, like querying big data using Pig or writing a log file loader. You'll explore each problem step by step, learning both how to build and deploy that specific solution along with the thinking that went into its design. As you work through the tasks, you'll find yourself growing more comfortable with Hadoop and at home in the world of big data.

About the Technology

Hadoop is an open source MapReduce platform designed to query and analyze data distributed across large clusters. Especially effective for big data systems, Hadoop powers mission-critical software at Apple, eBay, LinkedIn, Yahoo, and Facebook. It offers developers handy ways to store, manage, and analyze data.

About the Book

Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You'll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book's examples create a well-structured and understandable codebase you can tweak to meet your own needs.

This book assumes the reader knows the basics of Hadoop.

Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book.

What's Inside
  • Conceptual overview of Hadoop and MapReduce
  • 85 practical, tested techniques
  • Real problems, real solutions
  • How to integrate MapReduce and R
Table of Contents
PART 1 BACKGROUND AND FUNDAMENTALS
PART 2 DATA LOGISTICS
PART 3 BIG DATA PATTERNS
PART 4 DATA SCIENCE
PART 5 TAMING THE ELEPHANT
  1. Hadoop in a heartbeat
  2. Moving data in and out of Hadoop
  3. Data serialization?working with text and beyond

  4. Applying MapReduce patterns to big data
  5. Streamlining HDFS for big data

  6. Diagnosing and tuning performance problems
  7. Utilizing data structures and algorithms
  8. Integrating R and Hadoop for statistics and more
  9. Predictive analytics with Mahout
  10. Hacking with Hive
  11. Programming pipelines with Pig

  12. Crunch and other technologies
  13. Testing and debugging
(HTML tags aren't allowed.)

PC Interfacing and Data Acquisition: Techniques for Measurement, Instrumentation and Control.
PC Interfacing and Data Acquisition: Techniques for Measurement, Instrumentation and Control.
Until fairly recently most scientific data-gathering systems and industrial control procedures were based on electromechanical devices such as chart recorders and analogue gauges. The capability to process and analyse data was rather limited (and in some cases error prone) unless one had access to a minicomputer or mainframe. Today, that situation...
Open Source Geospatial Tools: Applications in Earth Observation (Earth Systems Data and Models)
Open Source Geospatial Tools: Applications in Earth Observation (Earth Systems Data and Models)

This book focuses on the use of open source software for geospatial analysis. It demonstrates the effectiveness of the command line interface for handling both vector, raster and 3D geospatial data. Appropriate open-source tools for data processing are clearly explained and discusses how they can be used to solve everyday tasks.

A...

Fitness For Dummies
Fitness For Dummies

The latest and greatest in getting fit and staying that way!

Fitness For Dummies, 4th Edition, provides the latest information and advice for properly shaping, conditioning, and strengthening your body to enhance overall fitness and health. With the help of fitness professionals Suzanne Schlosberg and Liz Neporent,...


Easy Office 2003
Easy Office 2003

Easy Microsoft Office 2003 takes the work out of learning this new software application by using short, easy-to-follow lessons that show you how to accomplish basic tasks quickly and efficiently! It is the perfect book for beginners who want to learn the Office 2003 applications through a visual, full-color approach. More than 100 hands-on...

HTML5 Media
HTML5 Media

Flash is dead.

At least, that’s what we’re told: thanks to the introduction of the HTML5 video and audio elements, Flash is now dead.

Of course, we know this statement isn’t true: Flash will have its place in web pages for many years to come. However, thanks to the new HTML5 media...

Learning PHP, MySQL & JavaScript: With jQuery, CSS & HTML5 (Learning Php, Mysql, Javascript, Css & Html5)
Learning PHP, MySQL & JavaScript: With jQuery, CSS & HTML5 (Learning Php, Mysql, Javascript, Css & Html5)
The fully revised, updated and extended 4th edition of the hugely popular web development book - includes CSS, HTML5, jQuery and the mysqli extension.

Build interactive, data-driven websites with the potent combination of open-source technologies and web standards, even if you only have basic HTML knowledge. With
...
©2021 LearnIT (support@pdfchm.net) - Privacy Policy