Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Field Guide to Hadoop: An Introduction to Hadoop, Its Ecosystem, and Aligned Technologies

Buy

If your organization is about to enter the world of big data, you not only need to decide whether Apache Hadoop is the right platform to use, but also which of its many components are best suited to your task. This field guide makes the exercise manageable by breaking down the Hadoop ecosystem into short, digestible sections. You’ll quickly understand how Hadoop’s projects, subprojects, and related technologies work together.

Each chapter introduces a different topic—such as core technologies or data transfer—and explains why certain components may or may not be useful for particular needs. When it comes to data, Hadoop is a whole new ballgame, but with this handy reference, you’ll have a good grasp of the playing field.

Topics include:

  • Core technologies—Hadoop Distributed File System (HDFS), MapReduce, YARN, and Spark
  • Database and data management—Cassandra, HBase, MongoDB, and Hive
  • Serialization—Avro, JSON, and Parquet
  • Management and monitoring—Puppet, Chef, Zookeeper, and Oozie
  • Analytic helpers—Pig, Mahout, and MLLib
  • Data transfer—Scoop, Flume, distcp, and Storm
  • Security, access control, auditing—Sentry, Kerberos, and Knox
  • Cloud computing and virtualization—Serengeti, Docker, and Whirr
(HTML tags aren't allowed.)

Collaborative Enterprise Architecture: Enriching EA with Lean, Agile, and Enterprise 2.0 practices
Collaborative Enterprise Architecture: Enriching EA with Lean, Agile, and Enterprise 2.0 practices

Ever-changing business needs have prompted large companies to rethink their enterprise IT. Today, businesses must allow interaction with their customers, partners, and employees at more touch points and at a depth never thought previously. At the same time, rapid advances in information technologies, like business digitization, cloud...

IP-Traffic Theory and Performance (Signals and Communication Technology)
IP-Traffic Theory and Performance (Signals and Communication Technology)
This book presents different approaches in IP traffic theory and classifies them, especially towards applications in the Internet. It comprises the state of the art in this area, which is currently presented only by numerous research papers and overview articles.

The book provides an ideal starting point for detailed studies of traffic analysis...

Sams Teach Yourself Networking in 24 Hours (4th Edition)
Sams Teach Yourself Networking in 24 Hours (4th Edition)
In just 24 sessions of one hour or less, learn how to use today’s key networking techniques and technologies to build, secure, and troubleshoot both wired and wireless networks. Using this book’s straightforward, step-by-step approach, you master every skill you need—from working with Ethernet and Bluetooth to spam prevention to...

The Vascular Endothelium I (Handbook of Experimental Pharmacology) (v. 1)
The Vascular Endothelium I (Handbook of Experimental Pharmacology) (v. 1)
It was with great pleasure that I accepted the invitation of Springer to edit this book.My association with the vascular endothelium covers a large part of my scientific career and, as with any good long-standing relationship, it has had moments of great excitement and periods of laborious construction. It has sometimes been...
A Guide to Lean Six Sigma Management Skills
A Guide to Lean Six Sigma Management Skills
Authored by Dr, Howard Gitlow, one of the most respected Six Sigma Master Black Belts, this well-organized volume demonstrates the implementation of quality improvements into the all areas of the workplace from the shop floor through a company™s executive offices. Illustrating his points with a number of case studies, the book provides a...
Apache Cookbook: Solutions and Examples for Apache Administrators
Apache Cookbook: Solutions and Examples for Apache Administrators
There's plenty of documentation on installing and configuring the Apache web server, but where do you find help for the day-to-day stuff, like adding common modules or fine-tuning your activity logging? That's easy. The new edition of the Apache Cookbook offers you updated solutions to the problems you're likely to encounter with the new versions...
©2021 LearnIT (support@pdfchm.net) - Privacy Policy