Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark such as Spark Core, DataFrames, Datasets and SQL, Spark Streaming, Spark MLib, and R on Spark with the help of practical code snippets for each topic. Practical Apache Spark also covers the integration of Apache Spark with Kafka with examples. You’ll follow a learn-to-do-by-yourself approach to learning – learn the concepts, practice the code snippets in Scala, and complete the assignments given to get an overall exposure.
On completion, you’ll have knowledge of the functional programming aspects of Scala, and hands-on expertise in various Spark components. You’ll also become familiar with machine learning algorithms with real-time usage.
What You Will Learn
Discover the functional programming features of Scala
Understand the complete architecture of Spark and its components
Integrate Apache Spark with Hive and Kafka
Use Spark SQL, DataFrames, and Datasets to process data using traditional SQL queries
Work with different machine learning concepts and libraries using Spark's MLlib packages
Who This Book Is For
Developers and professionals who deal with batch and stream data processing.
Lung Cancer: Principles and Practice
Thoroughly revised and updated, this Third Edition is the most comprehensive, current reference on lung cancer, with contributions from the world's foremost surgeons, radiation oncologists, medical oncologists, pulmonologists, and basic scientists. This edition includes sixteen new chapters and has been reorganized for greater...
XQuery With the XQuery 1.0 standard, you finally have a tool that will make it much easier to search, extract and manipulate information from XML content stored in databases. This in-depth tutorial not only walks you through the XQuery specification, but also teaches you how to program with this widely anticipated query language.
Reinforced Concrete Designer's Handbook Since the last edition appeared under the Viewpoint imprint of the Cement and Concrete Association, this Handbook has been in the ownership of two new publishers. I am delighted that it has now joined the catalogue of engineering books published by Spon, one of the most respected names in technical publishing in the world, and that its success is... Follicular Lymphoma: Current Management and Novel Approaches
This book provides a comprehensive, state-of-the-art overview of follicular lymphoma. The first section of the text explores the current understanding of the biology and pathogenesis of follicular lymphoma, through reviewing recent changes in the WHO classification of low-grade lymphomas, current diagnostic techniques, and emerging...