Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library

Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library, 9781484235782 (1484235789), Apress, 2018

Develop applications for the big data landscape with Spark and Hadoop. This book also explains the role of Spark in developing scalable machine learning and analytics applications with Cloud technologies. Beginning Apache Spark 2 gives you an introduction to Apache Spark and shows you how to work with it.

Along the way, you’ll discover resilient distributed datasets (RDDs); use Spark SQL for structured data; and learn stream processing and build real-time applications with Spark Structured Streaming. Furthermore, you’ll learn the fundamentals of Spark ML for machine learning and much more.

After you read this book, you will have the fundamentals to become proficient in using Apache Spark and know when and how to apply it to your big data applications.

What You Will Learn

Understand Spark unified data processing platform
How to run Spark in Spark Shell or Databricks
Use and manipulate RDDs
Deal with structured data using Spark SQL through its operations and advanced functions
Build real-time applications using Spark Structured Streaming
Develop intelligent applications with the Spark Machine Learning library

Who This Book Is For

Programmers and developers active in big data, Hadoop, and Java but who are new to the Apache Spark platform.

Comments

Amazing Books

Selling Spirituality: The Silent Takeover of Religion

Routledge, 2004

From feng shui to holistic medicine, from aromatherapy candles to yoga weekends, from Christian mystics to New Age gurus, spirituality is big business. There has been an explosion of interest and popular literature on mind, body and spirit and ‘personal development’. We now see the introduction of modes of ‘spirituality’...

HTML, XHTML & CSS For Dummies (Computer/Tech)

For Dummies, 2008

Packed with useful tips, techniques, and code examples

Build quality Web pages with XHTML and add some pizzazz with CSS

You don't have to be a master programmer to build great Web pages! This book shows you what HTML is about and how to use XHTML to format great-looking pages. Then...

ASP.NET Bible

John Wiley & Sons, 2001

The Internet revolution of the late 1990s represented a dramatic shift in the way
individuals and organizations communicate with each other. Traditional applications,
such as word processors and accounting packages, are modeled as stand-alone
applications: they offer users the capability to perform tasks using data stored on the...

The Foundations of Statistics

Dover Publications, 1972

With the 1954 publication of his Foundations of Statistics, in which he proposed a basis that takes into account not only strictly objective and repetitive events, but also vagueness and interpersonal differences, Leonard J. Savage opened the greatest controversy in modern statistical thought. His theory of the...

Complete Digital Design: A Comprehensive Guide to Digital Electronics and Computer System Architecture

McGraw-Hill, 2003

Digital systems are created to perform data processing and control tasks. What distinguishes one system from another is an architecture tailored to efficiently execute the tasks for which it was designed. A desktop computer and an automobile’s engine controller have markedly different attributes dictated by their unique requirements. Despite...

Physical Principles of Electron Microscopy: An Introduction to TEM, SEM, and AEM

Springer, 2008

From the reviews:

"This book comprises a concise introduction to the fundamental physical concepts of electron microscopy and related analytical techniques … . The concepts are well explained and illustrated, and in addition, the author offers a helpful introduction to microscopy, as a whole … . The text includes interesting...