Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data, 2nd Edition

Buy

This book takes you on a fantastic journey to discover the attributes of big data using Apache Hive.

Key Features

  • Grasp the skills needed to write efficient Hive queries to analyze the Big Data
  • Discover how Hive can coexist and work with other tools within the Hadoop ecosystem
  • Uses practical, example-oriented scenarios to cover all the newly released features of Apache Hive 2.3.3

Book Description

In this book, we prepare you for your journey into big data by frstly introducing you to backgrounds in the big data domain, alongwith the process of setting up and getting familiar with your Hive working environment.

Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skills in using the Hive language in an effcient manner. Toward the end, the book focuses on advanced topics, such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey.

By the end of the book, you will be familiar with Hive and able to work effeciently to find solutions to big data problems

What you will learn

  • Create and set up the Hive environment
  • Discover how to use Hive's definition language to describe data
  • Discover interesting data by joining and filtering datasets in Hive
  • Transform data by using Hive sorting, ordering, and functions
  • Aggregate and sample data in different ways
  • Boost Hive query performance and enhance data security in Hive
  • Customize Hive to your needs by using user-defined functions and integrate it with other tools

Who This Book Is For

If you are a data analyst, developer, or simply someone who wants to quickly get started with Hive to explore and analyze Big Data in Hadoop, this is the book for you. Since Hive is an SQL-like language, some previous experience with SQL will be useful to get the most out of this book.

Table of Contents

  1. OVERVIEW OF BIG DATA AND HIVE
  2. SETTING UP THE HIVE ENVIRONMENT
  3. DATA DEFINITION AND DESCRIPTION
  4. Data Correlation and Scope
  5. DATA MANIPULATION
  6. DATA AGGREGATION AND SAMPLING
  7. Extensibility Considerations
  8. Working with Other Tools
  9. Performance Considerations
  10. Security Considerations
(HTML tags aren't allowed.)

Creating Mac Widgets with Dashcode (Firstpress)
Creating Mac Widgets with Dashcode (Firstpress)
In 100 pages, you will learn to create and use widgets using Dashcode. Widgets are simple, typically small applications for a specific purpose, such as a weather report, a calculator, a stock quote, and the like that reside on the Mac Dashboard. Building these before Dashcode was time consuming though not hard; but now, you can whip up a widget...
Coding Projects in Python
Coding Projects in Python

Using fun graphics and easy-to-follow instructions, this straightforward, this visual guide shows young learners how to build their own computer projects using Python, an easy yet powerful free programming language available for download.

Perfect for kids ages 10 and over who are ready to take a second step after Scratch, Coding...

Advances in Minimum Description Length: Theory and Applications (Neural Information Processing)
Advances in Minimum Description Length: Theory and Applications (Neural Information Processing)
The process of inductive inference—to infer general laws and principles from particular instances—is the basis of statistical modeling, pattern recognition, and machine learning. The Minimum Descriptive Length (MDL) principle, a powerful method of inductive inference, holds that the best explanation, given a limited set of observed...

The DevOps 2.1 Toolkit: Docker Swarm: Building, testing, deploying, and monitoring services inside Docker Swarm clusters (The DevOps Toolkit Series) (Volume 2)
The DevOps 2.1 Toolkit: Docker Swarm: Building, testing, deploying, and monitoring services inside Docker Swarm clusters (The DevOps Toolkit Series) (Volume 2)

The book envelops all aspects of building, testing, deploying, and monitoring services inside Docker Swarm clusters. We'll go through all the tools required for running a cluster. We'll go through the whole process with clusters running locally on a laptop. Once we are confident with the outcome, we'll translate the experience to...

Microsoft Big Data Solutions
Microsoft Big Data Solutions

Tap the power of Big Data with Microsoft technologies

Big Data is here, and Microsoft's new Big Data platform is a valuable tool to help your company get the very most out of it. This timely book shows you how to use HDInsight along with HortonWorks Data Platform for Windows to store, manage, analyze, and share Big Data...

Advanced Memory Optimization Techniques for Low-Power Embedded Processors
Advanced Memory Optimization Techniques for Low-Power Embedded Processors
The design of embedded systems warrants a new perspective because of the following two reasons: Firstly, slow and energy inefficient memory hierarchies have already become the bottleneck of the embedded systems. It is documented in the literature as the memory wall problem. Secondly, the software running on the contemporary embedded devices is...
©2019 LearnIT (support@pdfchm.net) - Privacy Policy