Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Spark Cookbook

Buy
Spark Cookbook, 9781783987061 (1783987065), Packt Publishing, 2015

Over 60 recipes on Spark, covering Spark Core, Spark SQL, Spark Streaming, MLlib, and GraphX libraries

About This Book

  • Become an expert at graph processing using GraphX
  • Use Apache Spark as your single big data compute platform and master its libraries
  • Learn with recipes that can be run on a single machine as well as on a production cluster of thousands of machines

Who This Book Is For

If you are a data engineer, an application developer, or a data scientist who would like to leverage the power of Apache Spark to get better insights from big data, then this is the book for you.

What You Will Learn

  • Install and configure Apache Spark with various cluster managers
  • Set up development environments
  • Perform interactive queries using Spark SQL
  • Get to grips with real-time streaming analytics using Spark Streaming
  • Master supervised learning and unsupervised learning using MLlib
  • Build a recommendation engine using MLlib
  • Develop a set of common applications or project types, and solutions that solve complex big data problems
  • Use Apache Spark as your single big data compute platform and master its libraries

In Detail

By introducing in-memory persistent storage, Apache Spark eliminates the need to store intermediate data in filesystems, thereby increasing processing speed by up to 100 times.

This book will focus on how to analyze large and complex sets of data. Starting with installing and configuring Apache Spark with various cluster managers, you will cover setting up development environments. You will then cover various recipes to perform interactive queries using Spark SQL and real-time streaming with various sources such as Twitter Stream and Apache Kafka. You will then focus on machine learning, including supervised learning, unsupervised learning, and recommendation engine algorithms. After mastering graph processing using GraphX, you will cover various recipes for cluster optimization and troubleshooting.

(HTML tags aren't allowed.)

The EU’s Policy on the Integration of Migrants: A Case of Soft-Europeanization? (Palgrave Studies in European Union Politics)
The EU’s Policy on the Integration of Migrants: A Case of Soft-Europeanization? (Palgrave Studies in European Union Politics)
This book addresses a timely, yet largely overlooked, issue in political science: the integration of migrants in a multilevel polity. In a context characterised by the increasing salience of migration-related questions, and despite the gradual construction of a European Union immigration policy over the past two decades, no competence...
Beginning C# Objects: From Concepts to Code
Beginning C# Objects: From Concepts to Code

Beginning C# Objects: From Concepts to Code is a comprehensive yet approachable guide for anyone interested in learning the C# language, beginning with the basics.

To begin, this book addresses the two fundamental concepts that programmers must grasp in order to write a professional object-oriented C# application: the nature and...

MCSE Training Kit : Microsoft SQL Server 2000 Database Design and Implementation (Exam 70-229)
MCSE Training Kit : Microsoft SQL Server 2000 Database Design and Implementation (Exam 70-229)
Welcome to MCSE Training Kit: Microsoft SQL Server 2000 Database Design and Implementation. This training kit introduces you to SQL Server 2000 and provides detailed information about how to design and implement a SQL Server database. The training kit takes you through the steps of how to plan and implement a database, create and maintain...

CCNP BCMSN Exam Certification Guide (CCNP Self-Study, 642-811), Second Edition
CCNP BCMSN Exam Certification Guide (CCNP Self-Study, 642-811), Second Edition
Study guide helps you master all the topics on the new CCNP BCMSN exam, including: switch operation and configuration, VLAN Trunking Protocol (VTP), aggregating switch links, and more.

Prepare for the CCNP Switching exam with the only Cisco Systems authorized self-study preparation book!

Matlab, Second Edition: A Practical Introduction to Programming and Problem Solving
Matlab, Second Edition: A Practical Introduction to Programming and Problem Solving

The purpose of this book is to teach fundamentals of programming concepts and skills needed for basic problem solving, all using MATLABW as the vehicle. MATLAB is a powerful software package that has built-in functions to accomplish a diverse range of tasks, from mathematical operations to three-dimensional imaging. Additionally,...

Encyclopaedia Arcane: Battle Magic - The Eldritch Storm
Encyclopaedia Arcane: Battle Magic - The Eldritch Storm
Encyclopaedia Arcane: Battle Magic Continuing the Encyclopaedia Arcane series. Battle Magic introduces the greatest force of sorcery into the d20 System. Battle Mages of incredible power are able to blast entire hordes of enemies apart with fire and lightning - now, for the very first time, players too can access this awesome destructive force....
©2021 LearnIT (support@pdfchm.net) - Privacy Policy