Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Data Mining: Concepts, Models, Methods, and Algorithms, Second Edition

Now updated—the systematic introductory guide to modern analysis of large data sets

As data sets continue to grow in size and complexity, there has been an inevitable move towards indirect, automatic, and intelligent data analysis in which the analyst works via more complex and sophisticated software tools. This book reviews state-of-the-art methodologies and techniques for analyzing enormous quantities of raw data in high-dimensional data spaces to extract new information for decision-making.

This Second Edition of Data Mining: Concepts, Models, Methods, and Algorithms discusses data mining principles and then describes representative state-of-the-art methods and algorithms originating from different disciplines such as statistics, machine learning, neural networks, fuzzy logic, and evolutionary computation. Detailed algorithms are provided with necessary explanations and illustrative examples, and questions and exercises for practice at the end of each chapter. This new edition features the following new techniques/methodologies:

  • Support Vector Machines (SVM)—developed based on statistical learning theory, they have a large potential for applications in predictive data mining

  • Kohonen Maps (Self-Organizing Maps - SOM)—one of very applicative neural-networks-based methodologies for descriptive data mining and multi-dimensional data visualizations

  • DBSCAN, BIRCH, and distributed DBSCAN clustering algorithms—representatives of an important class of density-based clustering methodologies

  • Bayesian Networks (BN) methodology often used for causality modeling

  • Algorithms for measuring Betweeness and Centrality parameters in graphs, important for applications in mining large social networks

  • CART algorithm and Gini index in building decision trees

  • Bagging & Boosting approaches to ensemble-learning methodologies, with details of AdaBoost algorithm

  • Relief algorithm, one of the core feature selection algorithms inspired by instance-based learning

  • PageRank algorithm for mining and authority ranking of web pages

  • Latent Semantic Analysis (LSA) for text mining and measuring semantic similarities between text-based documents

  • New sections on temporal, spatial, web, text, parallel, and distributed data mining

  • More emphasis on business, privacy, security, and legal aspects of data mining technology

This text offers guidance on how and when to use a particular software tool (with the companion data sets) from among the hundreds offered when faced with a data set to mine. This allows analysts to create and perform their own data mining experiments using their knowledge of the methodologies and techniques provided. The book emphasizes the selection of appropriate methodologies and data analysis software, as well as parameter tuning. These critically important, qualitative decisions can only be made with the deeper understanding of parameter meaning and its role in the technique that is offered here.

This volume is primarily intended as a data-mining textbook for computer science, computer engineering, and computer information systems majors at the graduate level. Senior students at the undergraduate level and with the appropriate background can also successfully comprehend all topics presented here.

(HTML tags aren't allowed.)

Synthesizable VHDL Design for FPGAs
Synthesizable VHDL Design for FPGAs

The methodology described in this book is the result of many years of research experience in the field of synthesizable VHDL design targeting FPGA based platforms. VHDL was first conceived as a documentation language for ASIC designs. Afterwards, the language was used for the behavioral simulation of ASICs, and also as a design input for...

Wireless Telecommunications Networking with ANSI-41
Wireless Telecommunications Networking with ANSI-41
ALL-IN-ONE GUIDE TO ANSI-41 Revision E Replacing IS-41, ANSI –41 Revision E is the North American standard for wireless telecommunications network signaling. Written by Randall Snyder and Michael Gallagher, two of the new standard's developers, Wireless Tel Network with ANSI-41, Second Edition provides you with the latest need-to-know...
PHP 5 Fast & Easy Web Development
PHP 5 Fast & Easy Web Development
Get up and running with PHP 5, Apache, and MySQL with ease. This guide demonstrates how to display dynamic content, build your own contact management system, create custom reports, work with XML, and much more.

Don’t spend your time wading through manuals to learn PHP 5. Spend it doing what you do best—creating web pages!...

Building Performance Dashboards and Balanced Scorecards with SQL Server Reporting Services
Building Performance Dashboards and Balanced Scorecards with SQL Server Reporting Services

Discover how to maintain and update balanced scorecards and performance dashboards with SQL Server Reporting Services

Complementing the bestselling Balanced Scorecards and Operational Dashboards with Microsoft Excel (9780470386811), this indispensable book shows you how to create maintainable and dynamically updated...

Apoptosis, Senescence and Cancer (Cancer Drug Discovery and Development)
Apoptosis, Senescence and Cancer (Cancer Drug Discovery and Development)
The goals of chemotherapy (and radiotherapy) are to eliminate tumor cell targets by promoting cell death. In recent years, a major focus has been placed on programmed cell death or apoptosis as the primary mechanism of cell killing. However, tumor cells may respond to various forms of treatment in diverse ways, only some of which...
IPython Interactive Computing and Visualization Cookbook
IPython Interactive Computing and Visualization Cookbook

Over 100 hands-on recipes to sharpen your skills in high-performance numerical computing and data science with Python

About This Book

  • Leverage the new features of the IPython notebook for interactive web-based big data analysis and visualization
  • Become an expert in high-performance...
©2019 LearnIT (support@pdfchm.net) - Privacy Policy