Practical Hive: A Guide to Hadoop's Data Warehouse System

Practical Hive: A Guide to Hadoop's Data Warehouse System, 9781484202722 (1484202724), Apress, 2016

Dive into the world of SQL on Hadoop and get the most out of your Hive data warehouses. This book is your go-to resource for using Hive: authors Scott Shaw, Ankur Gupta, David Kjerrumgaard, and Andreas Francois Vermeulen take you through learning HiveQL, the SQL-like language specific to Hive, to analyze, export, and massage the data stored across your Hadoop environment. From deploying Hive on your hardware or virtual machine and setting up its initial configuration to learning how Hive interacts with Hadoop, MapReduce, Tez and other big data technologies, Practical Hive gives you a detailed treatment of the software.

In addition, this book discusses the value of open source software, Hive performance tuning, and how to leverage semi-structured and unstructured data.

What You Will Learn

Install and configure Hive for new and existing datasets
Perform DDL operations
Execute efficient DML operations
Use tables, partitions, buckets, and user-defined functions
Discover performance tuning tips and Hive best practices

Who This Book Is For

Developers, companies, and professionals who deal with large amounts of data and could use software that can efficiently manage large volumes of input. It is assumed that readers have the ability to work with SQL.

Comments

Amazing Books

Strauss (Master Musicians Series)

Oxford University Press, 2019

Richard Strauss is an outlier in the context of twentieth century music. Some consider him a composer of the late romantic period, while others declare him a traitor of modernity for his role in National Socialism. Despite the controversy surrounding him, Strauss's works--even beyond his most well-known operas Elektra and...

Cosmology

Oxford University Press, 2008

This book is unique in the detailed, self-contained, and comprehensive treatment that it gives to the ideas and formulas that are used and tested in modern cosmological research. It divides into two parts, each of which provides enough material for a one-semester graduate course. The first part deals chiefly with the isotropic and homogeneous...

Foundations of Python Network Programming: The comprehensive guide to building network applications with Python

Apress, 2010

This second edition of Foundations of Python Network Programming targets Python 2.5 through Python 2.7, the most popular production versions of the language. Python has made great strides since Apress released the first edition of this book back in the days of Python 2.3. The advances required new chapters to be written from the ground up,...

Play to Win: The Nonprofit Guide to Competitive Strategy

Jossey-Bass, 2004

A Step-by-Step Guide for Helping Your Nonprofit Compete Successfully

In this important resource, acclaimed nonprofit consultant, author, educator, and speaker David La Piana shows nonprofit leaders how they can increase the likelihood of organizational success by tapping into the power of competitive strategy. Play to Win demonstrates how your...

CCNA Cisco Certified Network Associate : Study Guide (with CD-ROM)

Sybex, 2000

Here's the book you need to prepare for Cisco's new CCNA exam, 640-607. Written by a Cisco internetworking expert who knows exactly what it takes to pass the test, this Study Guide provides:

Assessment testing to focus and direct your studies In-depth coverage of official exam objectives Configuration practice with a Router...

java interview questions: Top 20 java interview programs and answers

Independent Publishers Group, 2017

Java Interview Question is here to help you through the INTERVIEW process, teaching you what you need to know and enabling you to perform at your very best. I've coached and interviewed hundreds of software engineers. The result is this book. These interview questions are real; they are not pulled out of computer science textbooks. They...