Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage

Buy
Learn How To Convert Web Data Into Web Knowledge

This text demonstrates how to extract knowledge by finding meaningful connections among data spread throughout the Web. Readers learn methods and algorithms from the fields of information retrieval, machine learning, and data mining which, when combined, provide a solid framework for mining the Web. The authors walk readers through the algorithms with the aid of examples and exercises.

This text is divided into three parts:

  • Part One, Web Structure, presents basic concepts and techniques for extracting information from the Web. Readers learn how to collect and index Web documents as well as search and rank Web pages according to their textual content and hyperlink structure.

  • Part Two, Web Content Management, offers two approaches, clustering and classification, for organizing Web content. For both approaches, the authors set forth specific algorithms that enable readers to convert Web data into knowledge.

  • Part Three, Web Usage Mining, demonstrates the application of data mining methods to uncover meaningful patterns of Internet usage.

Methods and algorithms are illustrated by simple examples. More than 100 exercises help readers assess their grasp of the material. Further, thirty-four hands-on analysis problems ask readers to use their new data mining expertise to solve real problems, working with large data sets. All the data sets needed for the examples, exercises, and analysis problems are available on the companion Web site.

The extensive use of examples, along with the opportunity to test and apply data mining skills, makes this text ideal for graduate and upper-level undergraduates in computer science and engineering. Web designers and researchers will find that this text gives them a new set of tools to further mine the Web for knowledge and move well beyond the capabilities of standard search engines.

About the Author

Zdravko Markov, PhD, is Associate Professor of Computer Science at Central Connecticut State University. The author of three textbooks, Dr. Markov teaches undergraduate and graduate courses in computer science and artificial intelligence. He is currently a Principal Investigator (PI) in a National Science Foundation–funded project designed to introduce machine learning to undergraduates.

Daniel T. Larose, PhD, is Professor of Statistics in the Department of Mathematical Sciences at Central Connecticut State University. He is the author of three data mining books and a forthcoming textbook in undergraduate statistics. He developed and directs CCSU's DataMining@CCSU programs.

(HTML tags aren't allowed.)

Rails Crash Course: A No-Nonsense Guide to Rails Development
Rails Crash Course: A No-Nonsense Guide to Rails Development

Rails is a robust, flexible development platform that lets you build complex websites quickly. Major websites like GitHub, Hulu, and Twitter have run Rails under the hood, and if you know just enough HTML and CSS to be dangerous, Rails Crash Course will teach you to harness Rails for your own projects and create web...

VLSI Physical Design: From Graph Partitioning to Timing Closure
VLSI Physical Design: From Graph Partitioning to Timing Closure

Physical design of integrated circuits remains one of the most interesting and challenging arenas in the field of Electronic Design Automation. The ability to integrate more and more devices on our silicon chips requires the algorithms to continuously scale up. Nowadays we can integrate 2e9 transistors on a single 45nm-technology chip. This...

Cat Owner's Home Veterinary Handbook, Fully Revised and Updated
Cat Owner's Home Veterinary Handbook, Fully Revised and Updated

The classic bestseller--expanded and updated

For years, many veterinary treatments for cats were based on research conducted with dogs because it was wrongly assumed that cats were very similar. Recently, there have been giant strides in feline veterinary research. This classic reference is fully updated and revised to...


Bone Densitometry in Growing Patients (Current Clinical Practice)
Bone Densitometry in Growing Patients (Current Clinical Practice)

Bone Densitometry in Growing Patients: Guidelines for Clinical Practice, edited by Drs. Sawyer, Bachrach, and Fung, is a milestone book for all health prof- sionals concerned with bone health in growing patients. The book introduces and emphasizes the importance of attending to issues of bone health and development in childhood and...

SimCity 4: Deluxe Edition (also Covers Rush Hour Expansion)
SimCity 4: Deluxe Edition (also Covers Rush Hour Expansion)
SimCity just keeps getting bigger and better, doesn’t it? Fortunately, so does this book, with a brand-new section dedicated to the copious new features introduced in Rush Hour.

The first seven parts of this book are for all SimCity players, illustrating the basics and delving in deep to enable anyone to become an expert Mayor.
...
Magic Is Dead: My Journey into the World's Most Secretive Society of Magicians
Magic Is Dead: My Journey into the World's Most Secretive Society of Magicians

In the vein of Neil Strauss’ The Game and Joshua Foer’s Moonwalking with Einstein comes the fascinating story of one man’s colorful, mysterious, and personal journey into the world of magic, and his unlikely invitation into an underground secret society of revolutionary magicians from...

©2021 LearnIT (support@pdfchm.net) - Privacy Policy