Data Mining: Practical Machine Learning Tools and Techniques, Second Edition

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition, 9780120884070 (0120884070), Morgan Kaufmann, 2005

This book presents this new discipline in a very accessible form: both as a text to train the next generation of practitioners and researchers, and to inform lifelong learners like myself. Witten and Frank have a passion for simple and elegant solutions. They approach each topic with this mindset, grounding all concepts in concrete examples, and urging the reader to consider the simple techniques first, and then progress to the more sophisticated ones if the simple ones prove inadequate.

If you have data that you want to analyze and understand, this book and the associated Weka toolkit are an excellent way to start.
--From the foreword by Jim Gray, Microsoft Research

As with any burgeoning technology that enjoys commercial attention, the use of data mining is surrounded by a great deal of hype. Exaggerated reports tell of secrets that can be uncovered by setting algorithms loose on oceans of data. But there is no magic in machine learning, no hidden power, no alchemy. Instead there is an identifiable body of practical techniques that can extract useful information from raw data. This book describes these techniques and shows how they work.

The book is a major revision of the first edition that appeared in 1999. While the basic core remains the same, it has been updated to reflect the changes that have taken place over five years, and now has nearly double the references. The highlights for the new edition include thirty new technique sections; an enhanced Weka machine learning workbench, which now features an interactive interface; comprehensive information on neural networks; a new section on Bayesian networks; plus much more.

Offering a thorough grounding in machine learning concepts as well as practical advice on applying the tools and techniques, inside youll find:

+ Algorithmic methods at the heart of successful data miningincluding tried and true techniques as well as leading edge methods;
+ Performance improvement techniques that work by transforming the input or output;
+ Downloadable Weka, a collection of machine learning algorithms for data mining tasks, including tools for data pre-processing, classification, regression, clustering, association rules, and visualizationin a new, interactive interface.

About the Author

Ian H. Witten is a professor of computer science at the University of Waikato in New Zealand. He directs the New Zealand Digital Library research project. His research interests include information retrieval, machine learning, text compression, and programming by demonstration. He received an MA in Mathematics from Cambridge University, England; an MSc in Computer Science from the University of Calgary, Canada; and a PhD in Electrical Engineering from Essex University, England. He is a fellow of the ACM and of the Royal Society of New Zealand. He has published widely on digital libraries, machine learning, text compression, hypertext, speech synthesis and signal processing, and computer typography. He has written several books, the latest being
Managing Gigabytes (1999) and Data Mining (2000), both from Morgan Kaufmann. Eibe Frank is a researcher in the Machine Learning group at the University of Waikato. He holds a degree in computer science from the University of Karlsruhe in Germany and is the author of several papers, both presented at machine learning conferences and published in machine learning journals.

Comments

Amazing Books

Makers at Work: Folks Reinventing the World One Object or Idea at a Time

Apress, 2013

What do you get when you combine an electronics hobbyist, hacker, garage mechanic, kitchen table inventor, tinkerer, and entrepreneur? A “maker,” of course. Playful and creative, makers are—through expertise and experimentation—creating art, products, and processes that change the way we think and interact with the...

Essential CVS

O'Reilly, 2003

CVS (Concurrent Versions System) is a tool that enables you to track changes to a set of files over time. CVS is commonly used in software development to allow multiple developers to coordinate changes, track versions, and permit simultaneous development of different versions of the same code.

This book is not just for software...

Accounting for beginners

Meitav Self learning, 2013

Basic accounting skills are necessary tools when dealing with finances. Understanding the basic concepts and methods used in accounting is critical for developing organizational skills. This e-Book will help guide you into this fascinating...

Solar Cell Technology and Applications

Auerbach Publications, 2009

Energy experts predict that wholesale electricity prices could easily rise 35 to 65 percent by 2015. Add to this the growing need for energy independence and the need to reduce carbon emissions and it is very clear that the development of low-cost renewable energy, such as solar energy, is essential for our economy and our national security....

Archilochus: The Poems: Introduction, Text, Translation, and Commentary

Oxford University Press, 2019

In antiquity Archilochus of Paros was considered a poet rivalled only by Homer and Hesiod, yet he has been relatively neglected by modern scholarship. This is largely due to the fragmentary state of his surviving poetry, though our knowledge has expanded significantly since the middle of the
twentieth century as new papyrological...

How to Do Everything with Your Web 2.0 Blog

McGraw-Hill, 2007

Got a blog? Want one? Feel like you need one but you’re not sure where to start? Whatever the case, I’d suggest flipping through this book and seeing what you might be able to learn on this little journey through the latest and greatest in blogging. Yes, “Web 2.0” is at least 25 percent marketing term and 25 percent Internet...