Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Apache Mahout Cookbook

Buy
Apache Mahout Cookbook, 9781849518024 (1849518025), Packt Publishing, 2013

A fast, fresh, developer-oriented dive into the world of Mahout

Overview

  • Learn how to set up a Mahout development environment
  • Start testing Mahout in a standalone Hadoop cluster
  • Learn to find stock market direction using logistic regression
  • Over 35 recipes with real-world examples to help both skilled and the non-skilled developers get the hang of the different features of Mahout

In Detail

The rise of the Internet and social networks has created a new demand for software that can analyze large datasets that can scale up to 10 billion rows. Apache Hadoop has been created to handle such heavy computational tasks. Mahout gained recognition for providing data mining classification algorithms that can be used with such kind of datasets.

"Apache Mahout Cookbook" provides a fresh, scope-oriented approach to the Mahout world for both beginners as well as advanced users. The book gives an insight on how to write different data mining algorithms to be used in the Hadoop environment and choose the best one suiting the task in hand.

"Apache Mahout Cookbook" looks at the various Mahout algorithms available, and gives the reader a fresh solution-centered approach on how to solve different data mining tasks. The recipes start easy but get progressively complicated. A step-by-step approach will guide the developer in the different tasks involved in mining a huge dataset. You will also learn how to code your Mahout’s data mining algorithm to determine the best one for a particular task. Coupled with this, a whole chapter is dedicated to loading data into Mahout from an external RDMS system. A lot of attention has also been put on using your data mining algorithm inside your code so as to be able to use it in an Hadoop environment. Theoretical aspects of the algorithms are covered for information purposes, but every chapter is written to allow the developer to get into the code as quickly and smoothly as possible. This means that with every recipe, the book provides the code for reusing it using Maven as well as the Maven Mahout source code.

By the end of this book you will be able to code your procedure to do various data mining tasks with different algorithms and to evaluate and choose the best ones for your tasks.

What you will learn from this book

  • Configure from scratch a full development environment for Mahout with NetBeans and Maven
  • Handle sequencefiles for better performance
  • Query and store results into an RDBMS system with SQOOP
  • Use logistic regression to predict the next step
  • Understand text mining of raw data with Naïve Bayes
  • Create and understand clusters
  • Customize Mahout to evaluate different cluster algorithms
  • Use the mapreduce approach to solve real world data mining problems

Approach

"Apache Mahout Cookbook" uses over 35 recipes packed with illustrations and real-world examples to help beginners as well as advanced programmers get acquainted with the features of Mahout.

Who this book is written for

"Apache Mahout Cookbook" is great for developers who want to have a fresh and fast introduction to Mahout coding. No previous knowledge of Mahout is required, and even skilled developers or system administrators will benefit from the various recipes presented.

(HTML tags aren't allowed.)

Demotic Induced Neurosis
Demotic Induced Neurosis
A man had an accident.
He lost his sense of time and emotional capacity.
This is his seventh attempt to communicate since the accident.

Please be mindful this is a diary and it documents the mental progression I am in since the accident. I am still in mental progression as a result of the accident so nothing I suggest is
...
Bass Guitar for Dummies
Bass Guitar for Dummies
Shake the earth with deep, sonorous vibrations. Be the force that relentlessly drives the music. Rumble like the ominous thunder of an approaching storm. For you, it's not enough just to be heard; you need to be felt, too. You need to play bass guitar.

Imagine your favorite music without bass. It doesn't work, does it? The bass is...

Sensor Based Intelligent Robots: International Workshop, Dagstuhl Castle, Germany, October 15-20, 2000. Selected Revised Papers
Sensor Based Intelligent Robots: International Workshop, Dagstuhl Castle, Germany, October 15-20, 2000. Selected Revised Papers
This book constitutes the thoroughly refereed post-proceedings of the International Workshop on Sensor Based Intelligent Robots held in Dagstuhl Castle, Germany, in October 2000.

The 20 revised full papers were carefully reviewed and improved for inclusion in this book. Addressing a broad variety of aspects of the highly
...

Strategic Planning for Public Relations
Strategic Planning for Public Relations
Strategic Planning for Public Relations offers college and university students a new way to deepen their understanding of public relations and other kinds of strategic communication. It is intended for people serious about entering a profession that is rapidly changing, shedding a past that often involved...
Twitter Power 2.0: How to Dominate Your Market One Tweet at a Time
Twitter Power 2.0: How to Dominate Your Market One Tweet at a Time

Get the business leader's guide to using Twitter to gain competitive advantage.

Since 2006, forward-thinking companies like Apple, JetBlue, Whole Foods, and GM have discovered the instant benefits of leveraging the social media phenomenon known as Twitter to reach consumers directly, build their brand, and increase sales....

Selenium 1.0 Testing Tools: Beginners Guide
Selenium 1.0 Testing Tools: Beginners Guide

The Selenium 1.0 Testing Tools Beginner's guide shows developers and testers how to create automated tests using a browser. You'll be able to create tests using Selenium IDE, Selenium Remote Control and Selenium 2 as well. A chapter is completely dedicated to Selenium 2. We will then see how our tests use element locators such as CSS,...

©2021 LearnIT (support@pdfchm.net) - Privacy Policy