Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Data Preparation for Data Mining (The Morgan Kaufmann Series in Data Management Systems)

Buy
Data Preparation for Data Mining addresses an issue unfortunately ignored by most authorities on data mining: data preparation. Thanks largely to its perceived difficulty, data preparation has traditionally taken a backseat to the more alluring question of how best to extract meaningful knowledge. But without adequate preparation of your data, the return on the resources invested in mining is certain to be disappointing.

Dorian Pyle corrects this imbalance. A twenty-five-year veteran of what has become the data mining industry, Pyle shares his own successful data preparation methodology, offering both a conceptual overview for managers and complete technical details for IT professionals. Apply his techniques and watch your mining efforts pay off-in the form of improved performance, reduced distortion, and more valuable results.

Features

  • Offers in-depth coverage of an essential but largely ignored subject.
  • Goes far beyond theory, leading you-step by step-through the author's own data preparation techniques.
  • Provides practical illustrations of the author's methodology using realistic sample data sets.
  • Includes algorithms you can apply directly to your own project, along with instructions for understanding when automation is possible and when greater intervention is required.
  • Explains how to identify and correct data problems that may be present in your application.
  • Prepares miners, helping them head into preparation with a better understanding of data sets and their limitations.

On the enclosed CD-ROM, you'll find a suite of programs as C source code and compiled into a command-line-driven toolkit. This code illustrates how the author's techniques can be applied to arrive at an automated preparation solution that works for you. Also included are demonstration versions of three commercial products that help with data preparation, along with sample data with which you can practice and experiment.

About the Author

Dorian Pyle is Chief Scientist and Founder of PTI (www.pti.com), which develops and markets PowerhouseT predictive and explanatory analytics software. Dorian has over 20 years experience in artificial intelligence and machine learning techniques which are used in what is known today as "data mining" or "predictive analytics". He has applied this knowledge as a consultant with Knowledge Stream Partners, Xchange, Naviant, Thinking Machines, and Data Miners and with various companies directly involved in credit card marketing for banks and with manufacturing companies using industrial automation. In 1976 he was involved in building artificially intelligent machine learning systems utilizing the pioneering technologies that are currently known as neural computing and associative memories. He is current in and familiar with using the most advanced technologies in data mining including: entropic analysis (information theory), chaotic and fractal decomposition, neural technologies, evolution and genetic optimization, algebra evolvers, case-based reasoning, concept induction and other advanced statistical techniques.

(HTML tags aren't allowed.)

An Introduction to Programming Using Visual Basic 2010, 8th Edition
An Introduction to Programming Using Visual Basic 2010, 8th Edition

An Introduction to Programming Using Visual Basic 2010, Eighth Edition, — consistently praised by both students and instructors — is designed for students with no prior computer programming experience. Now updated for Visual Basic 2010, Schneider focuses on teaching problem-solving skills and sustainable...

Renewable and Efficient Electric Power Systems
Renewable and Efficient Electric Power Systems
Engineering for sustainability is an emerging theme for the twenty-first century, and the need for more environmentally benign electric power systems is a critical part of this new thrust. Renewable energy systems that take advantage of energy sources that won’t diminish over time and are independent of fluctuations in...
Introduction to Octave: For Engineers and Scientists
Introduction to Octave: For Engineers and Scientists
Familiarize yourself with Octave using this concise, practical tutorial that is focused on writing code to learn concepts. Starting from the basics, this book covers array-based computing, plotting, and working with files in Octave, which can run MATLAB files without modification. Introduction to Octave is useful...

Art of RAW Conversion: How to Produce Art-Quality Photos with Adobe Photoshop CS2 and Leading RAW Converters
Art of RAW Conversion: How to Produce Art-Quality Photos with Adobe Photoshop CS2 and Leading RAW Converters
The RAW file format used by digital cameras is essentially the raw data that a camera captures when it takes a photo. RAW files allow the digital photographer to edit and manipulate their photos with less data loss than in other file formats (such as JPEG). There are many RAW conversion tools, and it's often a good idea to use more than one to get...
Machinery's Handbook 28th Larger Print Edition (Machinery's Handbook)
Machinery's Handbook 28th Larger Print Edition (Machinery's Handbook)

Celebrating nearly 100 years as The Bible of the Mechanical Industries , the 28th edition brings together volumes of knowledge, information and data gathered, revised and improved upon from experts throughout the mechanical industries. Extraordinarily comprehensive yet easy to use since it premiered, Machinery s Handbook provides mechanical...

PHP Game Programming
PHP Game Programming
"PHP Game Programming" offers you the introduction you need to begin creating your own online games. You?ll be amazed at the games you can create with this powerful?and completely free?development tool! Dive right in as you begin with coverage of server configuration and the major features of PHP. Then you?re off and running as you use...
©2019 LearnIT (support@pdfchm.net) - Privacy Policy