Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
HDInsight Essentials

Buy
HDInsight Essentials, 9781849695367 (1849695369), Packt Publishing, 2013

Tap your unstructured Big Data and empower your business using the Hadoop distribution from Windows

Overview

  • Architect a Hadoop solution with a modular design for data collection, distributed processing, analysis, and reporting
  • Build a multi-node Hadoop cluster on Windows servers
  • Establish a Big Data solution using HDInsight with open source software, and provide useful Excel reports
  • Run Pig scripts and build simple charts using Interactive JS (Azure)

In Detail

We live in an era in which data is generated with every action and a lot of these are unstructured; from Twitter feeds, Facebook updates, photos and digital sensor inputs. Current relational databases cannot handle the volume, velocity and variations of data. HDInsight gives you the ability to gain the full value of Big Data with a modern, cloud-based data platform that manages data of any size and type, whether structured or unstructured.

A hands-on guide that shows you how to seamlessly store and process Big Data of all types through Microsoft’s modern data platform; which provides simplicity, ease of management, and an open enterprise-ready Hadoop service all running in the Cloud. You will then learn how to analyze your Hadoop data with PowerPivot, Power View, Excel, and other Microsoft BI tools; thanks to integration with the Microsoft data platform, this will give you a solid foundation to build your own HDInsight solution, both on premise and on Cloud.

Firstly, we will provide an overview of Hadoop and Microsoft Big Data strategy, where HDinsight plays a key role. We will then show you how to set up your HDInsight cluster and take you through the 4 stages of collecting, processing, analysing and reporting. For each of these stages, you will see a practical example with working code.

You will then learn core Hadoop concepts like HDFS and MapReduce. You will also get a closer look at how Microsoft’s HDInsight leverages Hortonworks Data Platform that uses Apache Hadoop. You will then be guided through Hadoop commands and programming using open source software, such as Hive and Pig with HDInsight. Finally, you will learn to analyze and report using PowerPivot, Power View, Excel, and other Microsoft BI tools.

This guide provides step-by-step instructions on how to build a Big Data solution using HDInsight with open source software, provide useful Excel reports, and open up the full value of HDInsight.

What you will learn from this book

  • Explore the characteristics of a Big Data problem
  • Analyse and report your data using PowerPivot, Power View, Excel, and other Microsoft BI tools
  • Explore the architectural considerations for scalability, maintainability, and security
  • Understand the concept of Data Ingestion to your HDInsight cluster including community tools and scripts
  • Administer and monitor your HDInsight cluster including capacity and process management
  • Get to know the Hadoop ecosystem with various tools and software based on their roles
  • Get to know the HDInsight differentiator and how it is built on top of Apache Hadoop
  • Transform your data using open source software such as MapReduce, Hive, Pig and JavaScript

Approach

This book is a fast-paced guide full of step-by-step instructions on how to build a multi-node Hadoop cluster on Windows servers.

(HTML tags aren't allowed.)

UNIX® Shells by Example Fourth Edition
UNIX® Shells by Example Fourth Edition

The world's #1 shell programming book—now fully updated for Linux and more!

UNIX Shells by Example is the world's #1 shell programming book, from the world's #1 shell programming instructor: Ellie Quigley. In ...

Powerhouse Partners : A Blueprint for Building Organizational Culture for Breakaway Results
Powerhouse Partners : A Blueprint for Building Organizational Culture for Breakaway Results
For any manager or executive committed to achieving business goals through effective strategic partnership, this book gathers successes of some of the world's leading companies to deliver a tool kit for shaping a partnering culture.

From the author who introduced the groundbreaking concept of Partnering Intelligence come the...

Access Denied in the Information Age
Access Denied in the Information Age
It was to be expected that a new millennium should bring with it a flurry of observers claiming that we stand on the threshold of a new society. We have not been disappointed. The media excitement at the fantastic financial speculation on the high-tech and internet stock market is but one instance cited to support this claim. This new society is...

Learning GNU Emacs, Third Edition
Learning GNU Emacs, Third Edition

The third edition of Learning GNU Emacs describes Emacs 21.3 from the ground up, including new user interface features such as an icon-based toolbar and an interactive interface to Emacs customization. A new chapter details how to install and run Emacs on Mac OS X, Windows, and Linux, including tips for using...

The Anthology of Rap
The Anthology of Rap

From the school yards of the South Bronx to the tops of the Billboard charts, rap has emerged as one of the most influential musical and cultural forces of our time. In The Anthology of Rap, editors Adam Bradley and Andrew DuBois explore rap as a literary form, demonstrating that rap is also a wide-reaching and vital...

Windows to Linux Business Desktop Migration
Windows to Linux Business Desktop Migration
Over the last four years, Linux has established itself as the fastest growing server platform for enterprise Information Technology. As the server platform grows the desktop platform is also growing, domestically and abroad. One of the areas most lacking information, however, is in the capability of Linux as a desktop replacement for a Microsoft...
©2021 LearnIT (support@pdfchm.net) - Privacy Policy