This guide is an ideal learning tool and reference for Apache Pig, the open source engine for executing parallel data flows on Hadoop. With Pig, you can batch-process data without having to create a full-fledged application—making it easy for you to experiment with new datasets.
Programming Pig introduces new users to Pig, and provides experienced users with comprehensive coverage on key features such as the Pig Latin scripting language, the Grunt shell, and User Defined Functions (UDFs) for extending Pig. If you need to analyze terabytes of data, this book shows you how to do it efficiently with Pig.
Delve into Pig’s data model, including scalar and complex data types
Write Pig Latin scripts to sort, group, join, project, and filter your data
Use Grunt to work with the Hadoop Distributed File System (HDFS)
Build complex data processing pipelines with Pig’s macros and modularity features
Embed Pig Latin in Python for iterative processing and other advanced tasks
Create your own load and store functions to handle data formats and storage mechanisms
Get performance tips for running scripts on Hadoop clusters in less time
SQL All-in-One For Dummies
SQL is the internationally recognized standard language for dealing with data in relational databases. Developed by IBM, SQL became an international standard in 1986. The standard was updated in 1989, 1992, 1999, 2003, and 2008. It continues to evolve and gain capability. Database vendors continually update their products to incorporate the...
Microsoft SQL Server 2008 High Availability
Every new version of SQL Server brings with it new tools and features for database administrators (DBAs), developers, and architects, for them to be able to provide an effective solution for the end users in a simpler and more efficient manner. The terms effective and efficiency can be measured in a technical perspective as High Availability...
Frommer's Paris 2011 (Frommer's Colour Complete Guides)
Discovering the City of Light and making it your own has always been the most compelling reason to visit Paris. If you’re a fi rst-timer, everything, of course, will be new to you. If you’ve been away for awhile, expect changes: Taxi drivers may no longer correct your fractured French, but address you in English—tantamount...
Data Access Patterns: Database Interactions in Object-Oriented Applications
Efficient, high-quality data access code is crucial to the performance and usability of virtually any enterprise application-and there's no better way to improve an existing system than to optimize its data access code. Regardless of database engine, platform, language, or application, developers repeatedly encounter the same...
PowerPoint 2013 For Dummies
Get up and running with this full-color guide to PowerPoint2013!
PowerPoint, the number one presentation software, has beenrevised and improved with the introduction of Microsoft Office2013. With this all-new, full-color book by your side, you willlearn how to take full advantage of all of PowerPoint's powerfuland...