Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Apache Flume: Distributed Log Collection for Hadoop - Second Edition

Buy

Design and implement a series of Flume agents to send streamed data into Hadoop

About This Book

  • Construct a series of Flume agents using the Apache Flume service to efficiently collect, aggregate, and move large amounts of event data
  • Configure failover paths and load balancing to remove single points of failure
  • Use this step-by-step guide to stream logs from application servers to Hadoop's HDFS

Who This Book Is For

If you are a Hadoop programmer who wants to learn about Flume to be able to move datasets into Hadoop in a timely and replicable manner, then this book is ideal for you. No prior knowledge about Apache Flume is necessary, but a basic knowledge of Hadoop and the Hadoop File System (HDFS) is assumed.

What You Will Learn

  • Understand the Flume architecture, and also how to download and install open source Flume from Apache
  • Follow along a detailed example of transporting weblogs in Near Real Time (NRT) to Kibana/Elasticsearch and archival in HDFS
  • Learn tips and tricks for transporting logs and data in your production environment
  • Understand and configure the Hadoop File System (HDFS) Sink
  • Use a morphline-backed Sink to feed data into Solr
  • Create redundant data flows using sink groups
  • Configure and use various sources to ingest data
  • Inspect data records and move them between multiple destinations based on payload content
  • Transform data en-route to Hadoop and monitor your data flows

In Detail

Apache Flume is a distributed, reliable, and available service used to efficiently collect, aggregate, and move large amounts of log data. It is used to stream logs from application servers to HDFS for ad hoc analysis.

This book starts with an architectural overview of Flume and its logical components. It explores channels, sinks, and sink processors, followed by sources and channels. By the end of this book, you will be fully equipped to construct a series of Flume agents to dynamically transport your stream data and logs from your systems into Hadoop.

A step-by-step book that guides you through the architecture and components of Flume covering different approaches, which are then pulled together as a real-world, end-to-end use case, gradually going from the simplest to the most advanced features.

(HTML tags aren't allowed.)

CCIE Professional Development Routing TCP/IP, Volume I, Second Edition
CCIE Professional Development Routing TCP/IP, Volume I, Second Edition

A detailed examination of interior routing protocols -- completely updated in a new edition

  • A complete revision of the best-selling first edition--widely considered a premier text on TCP/IP routing protocols

  • A core textbook for CCIE...

Production Systems Engineering: Cost and Performance Optimization
Production Systems Engineering: Cost and Performance Optimization

Optimize Economic and Technological Requirements in Production System Designs

This pioneering work offers proven techniques, partially created and developed at The Charles Stark Draper Laboratory, for determining optimal resource allocation and cost-effective production system designs for today’s any-volume...

RADIUS
RADIUS
RADIUS, or Remote Authentication Dial-In User Service, is a widely deployed protocol that enables companies to authenticate, authorize and account for remote users who want access to a system or service from a central network server. RADIUS provides a complete, detailed guide to the underpinnings of the RADIUS...

Building Web Services with Java: Making Sense of XML, SOAP, WSDL and UDDI
Building Web Services with Java: Making Sense of XML, SOAP, WSDL and UDDI
The Web services approach is the next step in the evolution of distributed computing. Based on open industry standards, Web services enable your software to integrate with partners and clients in a fashion that is loosely coupled, simple, and platform-independent. Building Web Services with Java: Making Sense of XML, SOAP,...
Spring Live
Spring Live
This book is written for Java developers familiar with web frameworks. Its main purpose is for Java developers to learn Spring and evaluate it against other frameworks. One of my hopes is to compare Spring to other web frameworks, or at least show how it can be integrated with other frameworks (i.e. Struts, WebWork, maybe even Tapestry down the...
Advances in Magnetism: From Molecules to Materials
Advances in Magnetism: From Molecules to Materials

In the past few years our understanding of magnetic behavior, once thought to be mature, has enjoyed a new impetus from contributions ranging from molecular chemistry, materials chemistry and sciences to solid-state physics. The book spans recent trends in magnetism for molecule - as well as inorganic-based materials, with emphasis on new...

©2019 LearnIT (support@pdfchm.net) - Privacy Policy