Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Data Mining: Concepts, Models, Methods, and Algorithms, Second Edition

Now updated—the systematic introductory guide to modern analysis of large data sets

As data sets continue to grow in size and complexity, there has been an inevitable move towards indirect, automatic, and intelligent data analysis in which the analyst works via more complex and sophisticated software tools. This book reviews state-of-the-art methodologies and techniques for analyzing enormous quantities of raw data in high-dimensional data spaces to extract new information for decision-making.

This Second Edition of Data Mining: Concepts, Models, Methods, and Algorithms discusses data mining principles and then describes representative state-of-the-art methods and algorithms originating from different disciplines such as statistics, machine learning, neural networks, fuzzy logic, and evolutionary computation. Detailed algorithms are provided with necessary explanations and illustrative examples, and questions and exercises for practice at the end of each chapter. This new edition features the following new techniques/methodologies:

  • Support Vector Machines (SVM)—developed based on statistical learning theory, they have a large potential for applications in predictive data mining

  • Kohonen Maps (Self-Organizing Maps - SOM)—one of very applicative neural-networks-based methodologies for descriptive data mining and multi-dimensional data visualizations

  • DBSCAN, BIRCH, and distributed DBSCAN clustering algorithms—representatives of an important class of density-based clustering methodologies

  • Bayesian Networks (BN) methodology often used for causality modeling

  • Algorithms for measuring Betweeness and Centrality parameters in graphs, important for applications in mining large social networks

  • CART algorithm and Gini index in building decision trees

  • Bagging & Boosting approaches to ensemble-learning methodologies, with details of AdaBoost algorithm

  • Relief algorithm, one of the core feature selection algorithms inspired by instance-based learning

  • PageRank algorithm for mining and authority ranking of web pages

  • Latent Semantic Analysis (LSA) for text mining and measuring semantic similarities between text-based documents

  • New sections on temporal, spatial, web, text, parallel, and distributed data mining

  • More emphasis on business, privacy, security, and legal aspects of data mining technology

This text offers guidance on how and when to use a particular software tool (with the companion data sets) from among the hundreds offered when faced with a data set to mine. This allows analysts to create and perform their own data mining experiments using their knowledge of the methodologies and techniques provided. The book emphasizes the selection of appropriate methodologies and data analysis software, as well as parameter tuning. These critically important, qualitative decisions can only be made with the deeper understanding of parameter meaning and its role in the technique that is offered here.

This volume is primarily intended as a data-mining textbook for computer science, computer engineering, and computer information systems majors at the graduate level. Senior students at the undergraduate level and with the appropriate background can also successfully comprehend all topics presented here.

(HTML tags aren't allowed.)

SOA Patterns with BizTalk Server 2009
SOA Patterns with BizTalk Server 2009

SOA is about architecture, not products and SOA enables you to create better business processes faster than ever. While BizTalk Server 2009 is a powerful tool, by itself it cannot deliver long-lasting, agile solutions unless we actively apply tried and tested service-oriented principles.

The current BizTalk Server books are all for the...

Things I Will Never Tell You
Things I Will Never Tell You
This book is my world (cerebral based mindset) so understand it is the opposite of your world (physical based mindset) or you will not last very long, psychologically speaking. If you want to live in your world of fear, write a book and I will use the pages to wrap my fi sh heads in. I am pleased with my thoughts and my words and I will never...
XPath Kick Start : Navigating XML with XPath 1.0 and 2.0
XPath Kick Start : Navigating XML with XPath 1.0 and 2.0
 XPath Kick Start provides the fastest path to productivity with XPath, and is fully up to date with the most recent XPath specification. Award-winning author Steven Holzner speeds through the basics so you'll be an XPath expert in no time. You'll master the XPath syntax and data model and understand the features of XPath 2.0, including new...

Grid Application Systems Design
Grid Application Systems Design
Grid computing is an emerging technology designed for high-powered applications. Grid Application Systems Design shows how to unleash the high performance of Grid technology. It begins by delving into the history and theory of grid computing, providing background on the concepts, terminology, and issues surrounding it. The book then examines...
Mastering Elasticsearch, Second Edition
Mastering Elasticsearch, Second Edition

Further your knowledge of the Elasticsearch server by learning more about its internals, querying, and data handling

About This Book

  • Understand Apache Lucene and Elasticsearch's design and architecture
  • Design your index, configure it, and distribute it, not only with assumptions, but...
Trigger Happy: Videogames and the Entertainment Revolution
Trigger Happy: Videogames and the Entertainment Revolution
The Edge calls Trigger Happy a "seminal piece of work." For the first time ever, an aficionado with a knowledge of art, culture, and a real love of gaming takes a critical look at the future of our videogames, and compares their aesthetic and economic impact on society to that of film. Thirty years after the invention of the simplest of...
©2019 LearnIT (support@pdfchm.net) - Privacy Policy