Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Fast Data Processing with Spark - Second Edition

Buy

Perform real-time analytics using Spark in a fast, distributed, and scalable way

About This Book

  • Develop a machine learning system with Spark's MLlib and scalable algorithms
  • Deploy Spark jobs to various clusters such as Mesos, EC2, Chef, YARN, EMR, and so on
  • This is a step-by-step tutorial that unleashes the power of Spark and its latest features

Who This Book Is For

Fast Data Processing with Spark - Second Edition is for software developers who want to learn how to write distributed programs with Spark. It will help developers who have had problems that were too big to be dealt with on a single computer. No previous experience with distributed programming is necessary. This book assumes knowledge of either Java, Scala, or Python.

What You Will Learn

  • Install and set up Spark on your cluster
  • Prototype distributed applications with Spark's interactive shell
  • Learn different ways to interact with Spark's distributed representation of data (RDDs)
  • Query Spark with a SQL-like query syntax
  • Effectively test your distributed software
  • Recognize how Spark works with big data
  • Implement machine learning systems with highly scalable algorithms

In Detail

Spark is a framework used for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does, but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and built-in tools for interactive query analysis (Spark SQL), large-scale graph processing and analysis (GraphX), and real-time analysis (Spark Streaming), it can be interactively used to quickly process and query big datasets.

Fast Data Processing with Spark - Second Edition covers how to write distributed programs with Spark. The book will guide you through every step required to write effective distributed programs from setting up your cluster and interactively exploring the API to developing analytics applications and tuning them for your purposes.

(HTML tags aren't allowed.)

Building the Global Fiber Optics Superhighway
Building the Global Fiber Optics Superhighway

Many wonderful stories have contributed to the growth and worldwide renown of the fiber optics industry. From its improbable roots in the 1960s and the important early laser work by Stewart Miller and colleagues at Bell Laboratories to seminal discoveries by Coming’s Don Keck, Robert Maurer, and Peter Schultz in 1970 demonstrating that...

Basic Methods in Antibody Production and Characterization
Basic Methods in Antibody Production and Characterization

Written for researchers and professionals in the fields of biomedical research, immunology, biochemistry, molecular biology, pathology, and biotechnology, Basic Methods in Antibody Production and Characterization uses a cookbook approach to presenting the methods for the production, characterization, and use of antibodies.
Antibodies
...

Practical Algorithms for Programmers
Practical Algorithms for Programmers

The purpose of this book is to provide a practical compendium of algorithms for use in applications. Unlike most works on algorithms, this book is not a  textbook: you will not find implementation details left as an exercise for the reader, nor will you find highly theoretical discussions of algorithms with small  snippets of code...


Time Series Analysis of Discourse: Method and Case Studies (Routledge Studies in Linguistics)
Time Series Analysis of Discourse: Method and Case Studies (Routledge Studies in Linguistics)

This volume serves as a comprehensive introduction to Time Series Analysis (TSA), used commonly in financial and engineering sciences, to demonstrate its potential to complement qualitative approaches in discourse analysis research. The book begins by discussing how time has previously been conceptualized in the literature, drawing...

Managing for Knowledge - HR's Strategic Role
Managing for Knowledge - HR's Strategic Role

This practical book draws on the author’s own experience, as well as that of leading-edge Human Resource and Knowledge Management practitioners including Linda Holbeche, Elizabeth Lank and Dave Snowden, each of whom recognizes, that building a knowledge-centric culture cannot be achieved through technology alone.

It covers areas...

Getting Started with CouchDB
Getting Started with CouchDB
When I was about nine years old, I had an Acorn Electron, a home computer developed by Acorn Machines and one of the major precursors to modern home computing. It was tiny by today’s standards, having just 32K of RAM, a 2MHz CPU, and with the staggering ability to store a massive 360 Kb on the 3 inch Amstrad disks I was using...
©2021 LearnIT (support@pdfchm.net) - Privacy Policy