Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Hadoop in Practice

Buy
Hadoop in Practice, 9781617292224 (1617292222), Manning Publications, 2014

Summary

Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop, and Mahout, all of which have seen major new versions recently. In short, this is the most practical, up-to-date coverage of Hadoop available anywhere.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the Book

It's always a good time to upgrade your Hadoop skills! Hadoop in Practice, Second Edition provides a collection of 104 tested, instantly useful techniques for analyzing real-time streams, moving data securely, machine learning, managing large-scale clusters, and taming big data using Hadoop. This completely revised edition covers changes and new features in Hadoop core, including MapReduce 2 and YARN. You'll pick up hands-on best practices for integrating Spark, Kafka, and Impala with Hadoop, and get new and updated techniques for the latest versions of Flume, Sqoop, and Mahout. In short, this is the most practical, up-to-date coverage of Hadoop available.

Readers need to know a programming language like Java and have basic familiarity with Hadoop.

What's Inside

  • Thoroughly updated for Hadoop 2
  • How to write YARN applications
  • Integrate real-time technologies like Storm, Impala, and Spark
  • Predictive analytics using Mahout and RR
  • Readers need to know a programming language like Java and have basic familiarity with Hadoop.

About the Author

Alex Holmes works on tough big-data problems. He is a software engineer, author, speaker, and blogger specializing in large-scale Hadoop projects.

Table of Contents

PART 1 BACKGROUND AND FUNDAMENTALS
PART 2 DATA LOGISTICS
PART 3 BIG DATA PATTERNS
PART 4 BEYOND MAPREDUCE
  1. Hadoop in a heartbeat
  2. Introduction to YARN
  3. Data serialization—working with text and beyond
  4. Organizing and optimizing data in HDFS
  5. Moving data into and out of Hadoop
  6. Applying MapReduce patterns to big data
  7. Utilizing data structures and algorithms at scale
  8. Tuning, debugging, and testing
  9. SQL on Hadoop
  10. Writing a YARN application
(HTML tags aren't allowed.)

Hadoop Operations
Hadoop Operations
Over the past few years, there has been a fundamental shift in data storage, management, and processing. Companies are storing more data from more sources in more formats than ever before. This isn’t just about being a “data packrat” but rather building products, features, and intelligence predicated on knowing more about...
Learning Spark: Lightning-Fast Big Data Analysis
Learning Spark: Lightning-Fast Big Data Analysis

Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3, this book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. With Spark, you can tackle big datasets quickly through simple APIs in Python, Java,...

Apache Hive Essentials
Apache Hive Essentials

Immerse yourself on a fantastic journey to discover the attributes of big data by using Hive

About This Book

  • Discover how Hive can coexist and work with other tools in the Hadoop ecosystem to create big data solutions
  • Grasp the skills needed, learn the best practices, and avoid the...

Splunk Operational Intelligence Cookbook
Splunk Operational Intelligence Cookbook

Over 70 practical recipes to gain operational data intelligence with Splunk Enterprise

About This Book

  • Learn how to use Splunk to effectively gather, analyze, and report on the operational data across your environment
  • Expedite your operational intelligence reporting, be empowered to...
Republic of Lies: American Conspiracy Theorists and Their Surprising Rise to Power
Republic of Lies: American Conspiracy Theorists and Their Surprising Rise to Power

A riveting tour through the landscape and meaning of modern conspiracy theories, exploring the causes and tenacity of this American malady, from Birthers to Pizzagate and beyond.

American society has always been fertile ground for conspiracy theories, but with the election of Donald Trump, previously
...

Implementing Splunk: Big Data Reporting and Development for Operational Intelligence
Implementing Splunk: Big Data Reporting and Development for Operational Intelligence

Splunk is a data collection, indexing, and visualization engine for operational intelligence. It's a powerful and versatile search and analysis engine that lets you investigate, troubleshoot, monitor, alert, and report on everything that's happening in your entire IT infrastructure from one location in real time. Splunk collects,...

©2019 LearnIT (support@pdfchm.net) - Privacy Policy