Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Big Data Analytics with Microsoft HDInsight in 24 Hours, Sams Teach Yourself

Buy

With Microsoft HDInsight, business professionals and data analysts can rapidly leverage the power of Hadoop on a flexible, scalable cloud-based platform, using Microsoft's accessible business intelligence, visualization, and productivity tools. Now, in just 24 lessons of one hour or less, you can learn all the skills and techniques you'll need to provision, configure, monitor, troubleshoot, and use HDInsight, even if you're new to big data analytics. Each short, easy lesson builds on all that's come before: you'll learn all of HDInsight's essentials as you solve real data analytics problems. Sams Teach Yourself Big Data Analytics with Microsoft HDInsight in 24 Hours covers all this, and much more:

  • Introduction of Big Data, NoSQL systems, its Business Value Proposition and use cases examples
  • Introduction to Hadoop, Architecture, Ecosystem and Microsoft HDInsight
  • Getting to know Hadoop 2.0 and the innovations it provides like HDFS2 and YARN
  • Quickly installing, configuring, and monitoring Hadoop (HDInsight) clusters in the cloud and automating cluster provisioning
  • Customize the HDInsight cluster and install additional Hadoop ecosystem projects using Script Actions
  • Administering HDInsight from the Hadoop command prompt or Microsoft PowerShell
  • Using the Microsoft Azure HDInsight Emulator for learning or development
  • Understanding HDFS, HDFS vs. Azure Blob Storage, MapReduce Job Framework and Job Execution Pipeline
  • Doing big data analytics with MapReduce, writing your MapReduce programs in your choice of .NET programming language such as C#
  • Using Hive for big data analytics, demonstrate end to end scenario and how Apache Tez improves the performance several folds
  • Consuming HDInsight data from Microsoft BI Tools over Hive ODBC Driver - Using HDInsight with Microsoft BI and Power BI to simplify data integration, analysis, and reporting
  • Using PIG for big data transformation workflows step by step
  • Apache HBase on HDInsight, its architecture, data model, HBase vs. Hive, programmatically managing HBase data with C# and Apache Phoenix
  • Using Sqoop or SSIS (SQL Server Integration Services) to move data to/from HDInsight and build data integration workflows for transferring data
  • Using Oozie for scheduling, co-ordination and managing data processing workflows in HDInsight cluster
  • Using R programming language with HDInsight for performing statistical computing on Big Data sets
  • Using Apache Spark's in-memory computation model to run big data analytics up to 100 times faster than Hadoop MapReduce
  • Perform real-time Stream Analytics on high-velocity big data streams with Storm
  • Integration of Enterprise Data Warehouse with Hadoop and Microsoft Analytics Platform System (APS), formally known as SQL Server Parallel Data Warehouse (PDW)

Step-by-step instructions walk you through common questions, issues, and tasks; Q-and-As, Quizzes, and Exercises build and test your knowledge; "Did You Know?" tips offer insider advice and shortcuts; and "Watch Out!" alerts help you avoid problems. By the time you're finished, you'll be comfortable going beyond the book to create any HDInsight app you can imagine!

(HTML tags aren't allowed.)

Introduction to Machine Learning with R: Rigorous Mathematical Analysis
Introduction to Machine Learning with R: Rigorous Mathematical Analysis

Machine learning is an intimidating subject until you know the fundamentals. If you understand basic coding concepts, this introductory guide will help you gain a solid foundation in machine learning principles. Using the R programming language, you’ll first start to learn with regression modelling and then move into more...

Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning
Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning

From news and speeches to informal chatter on social media, natural language is one of the richest and most underutilized sources of data. Not only does it come in a constant stream, always changing and adapting in context; it also contains information that is not conveyed by traditional data sources. The key to unlocking natural...

Practical Neo4j
Practical Neo4j

Why have developers at places like Facebook and Twitter increasingly turned to graph databases to manage their highly connected big data? The short answer is that graphs offer superior speed and flexibility to get the job done.

It’s time you added skills in graph databases to your toolkit....


Building Products for the Enterprise: Product Management in Enterprise Software
Building Products for the Enterprise: Product Management in Enterprise Software

If you’re new to software product management or just want to learn more about it, there’s plenty of advice available—but most of it is geared toward consumer products. Creating high-quality software for the enterprise involves a much different set of challenges. In this practical book, two expert product managers...

Neo4j Graph Data Modeling
Neo4j Graph Data Modeling

Design efficient and flexible databases by optimizing the power of Neo4j

About This Book

  • Model your data as a graph using Neo4j to design databases with minimum hassle
  • Discover new patterns using graphs and solve problems that are difficult to solve using any other database
  • ...
R Machine Learning By Example
R Machine Learning By Example

Understand the fundamentals of machine learning with R and build your own dynamic algorithms to tackle complicated real-world problems successfully

About This Book

  • Get to grips with the concepts of machine learning through exciting real-world examples
  • Visualize and solve complex problems...
©2018 LearnIT (support@pdfchm.net) - Privacy Policy