Beginning Apache Pig: Big Data Processing Made Easy

Beginning Apache Pig: Big Data Processing Made Easy, 9781484223369 (1484223365), Apress, 2016

Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to develop big data applications.

The book is divided into four parts: the complete features of Apache Pig; integration with other tools; how to solve complex business problems; and optimization of tools.

You'll discover topics such as MapReduce and why it cannot meet every business need; the features of Pig Latin such as data types for each load, store, joins, groups, and ordering; how Pig workflows can be created; submitting Pig jobs using Hue; and working with Oozie. You'll also see how to extend the framework by writing UDFs and custom load, store, and filter functions. Finally you'll cover different optimization techniques such as gathering statistics about a Pig script, joining strategies, parallelism, and the role of data formats in good performance.

What You Will Learn

• Use all the features of Apache Pig • Integrate Apache Pig with other tools • Extend Apache Pig • Optimize Pig Latin code • Solve different use cases for Pig Latin

Who This Book Is For

All levels of IT professionals: architects, big data enthusiasts, engineers, developers, and big data administrators

Comments

Amazing Books

A Variational Approach to Fracture and Other Inelastic Phenomena

Springer, 2013

This book exposes a number of mathematical models for fracture of growing difficulty. All models are treated in a unified way, based on incremental energy minimization. They differ from each other by the assumptions made on the inelastic part of the total energy, here called the "cohesive energy". Each model describes a specific...

Mobile Commerce : Opportunities, Applications, and Technologies of Wireless Business

Cambridge University Press, 2001

This book provides the context, architectures, case studies, and intelligent analysis that will help the reader grasp the rapidly evolving subject of mobile commerce. May explains the technological aspects of mobile commerce to business decision makers and the business models to the technologists who design and build these electronic systems. It...

The Princeton Companion to Mathematics

Princeton Press, 2008

This is a one-of-a-kind reference for anyone with a serious interest in mathematics. Edited by Timothy Gowers, a recipient of the Fields Medal, it presents nearly two hundred entries, written especially for this book by some of the world's leading mathematicians, that introduce basic mathematical tools and vocabulary; trace the...

Game Developer's Market Guide (Game Development)

Premier Press, 2003

This book is for everyone involved in game development and for those who
want to break into the industry.
Calling someone a “game developer” covers a lot of territory. A developer might be
an artist making 3D models; a producer handling external development; a level
designer or composer; a programmer or writer. The...

What Are Syndication Feeds

O'Reilly, 2005

When you enter the world of syndicated content, you're often faced with the question of what is the "proper" way to do syndication. While syndication feeds have become a standard tool on the Web--you've seen their signposts: a little orange button labeled XML in white letters, or maybe buttons that say Atom, RSS 2.0, RSS 1.0, or even...

Learn iOS 7 App Development

Apress, 2013

Learn iOS App Development is both a rapid tutorial and a useful reference. You'll quickly get up to speed with Objective-C, Cocoa Touch, and the iOS 7 SDK. It's an all-in-one getting started guide to building your first iPhone or iPad app. You'll learn best practices that ensure your code will be efficient and...