Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)

Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications), 9783642194597 (3642194591), Springer, 2011

The rapid growth of the Web in the past two decades has made it the largest publicly accessible data source in the world. Web mining aims to discover useful information or knowledge from Web hyperlinks, page contents, and usage logs. Based on the primary kinds of data used in the mining process, Web mining tasks can be categorized into three main types: Web structure mining, Web content mining and Web usage mining. Web structure mining discovers knowledge from hyperlinks, which represent the structure of the Web. Web content mining extracts useful information/ knowledge from Web page contents. Web usage mining mines user activity patterns from usage logs and other forms of logs of user interactions with Web systems. Since the publication of the first edition at the end of 2006, there have been some important advances in several areas. To reflect these advances, new materials have been added to most chapters. The major changes are in Chapter 11 and Chapter 12, which have been rewritten and significantly expanded. When the first edition was written, opinion mining (Chapter 11) was still in its infancy. Since then, the research community has gained a much better understanding of the problem and has proposed many novel techniques to solve various aspects of the problem. To include the latest developments for the Web usage mining chapter (Chapter 12), the topics of recommender systems and collaborative filtering, query log mining, and computational advertising have been added. This new edition is thus considerably longer, from a total of 532 pages in the first edition to a total of 622 pages in this second edition.

The goal of the book is to present the above Web data mining tasks and their core mining algorithms. The book is intended to be a text with a comprehensive coverage, and therefore, for each topic, sufficient details are given so that readers can gain a reasonably complete knowledge of its algorithms or techniques without referring to any external materials. Five of the chapters - partially supervised learning, structured data extraction, information integration, opinion mining and sentiment analysis, and Web usage mining - make this book unique. These topics are not covered by existing books, but yet are essential to Web data mining. Traditional Web mining topics such as search, crawling and resource discovery, and social network analysis are also covered in detail in this book.

Comments

Amazing Books

ASP.NET for Web Designers

New Riders Publishing, 2002

Teaching ASP.NET in a non-linear format that creative thinkers can easily grasp and understand without the typical programming jargon. Provides clear and concise, hands-on, real-world examples right from the beginning of the book. The book contains a natural progression by providing foundational information in the opening chapters. Content will be...

Biomechanical Systems: Techniques and Applications, Volume II: Cardiovascular Techniques

CRC Press, 2000

Because of developments in powerful computer technology, computational techniques, advances in a wide spectrum of diverse technologies, and other advances coupled with cross disciplinary pursuits between technology and its greatly significant applied implications in human body processes, the field of biomechanics is evolving as a broadly...

Elastix Unified Communications Server Cookbook

Packt Publishing, 2015

More than 140 real-life, hands-on recipes and tips to install, deploy, administer, and maintain any VoIP/Unified Communications solution based on Elastix

About This Book

Enable a full cost-effective unified communications server solution

Go from a single server configuration to a...

Practical .NET 2.0 Networking Projects

Apress, 2007

Practical .NET 2.0 Networking Projects demonstrates some of the key networking technologies that are being made easily accessible through .NET Framework 2.0. It discusses communication between wired machines and between networks and mobile devices. The book teaches you about the technologies by walking you through sample projects in a...

Applied Technology and Innovation Management: Insights and Experiences from an Industry-Leading Innovation Centre

Springer, 2010

Rapid application of new technologies and highly leveraged innovation processes are key for the success of companies and organizations in dynamic markets. Based on the experiences of one of the industry’s most modern innovation centers this book provides an insight into the tools and methods used to align customer requirements, competitive...

Professional MFC With Visual C++ 5

Wrox Press, 1999

This book focuses on the use of the Microsoft Foundation Classes to develop software. Of course, 'software' is a very broad term - some readers are doubtless interested in writing low-level technical applications that might not even have a user interface, while others will be interested in coding form-oriented applications that do little more the...