| Existence of huge amounts of data on the Web has developed an undeferring need to locate right information at right time, as well as to integrating information effectively to provide a comprehensive source of relevant information. There is a need to develop efficient tools for analyzing and managing Web data, and efficiently managing Web information from the database perspective. The book proposes a data model called WHOM (Warehouse Object Model) to represent HTML and XML documents in the warehouse. It defines a set of web algebraic operators for building new web tables by extracting relevant data from the Web, as well as generating new tables from existing ones. These algebraic operators are used for change detection.
Text presents a data model called WHOM (WareHouse Object Model) to represent HTML and XML documents in the warehouse. Defines a set of web-algebraic operators for building new web tables from existing ones. For database-management systems developers, enterprise web-site developers, and applied R&D researchers. DLC: Web databases.
This book furnishes a detailed presentation of relevant concepts, models, and methods in a clear, simple style, providing an authoritative and comprehensive survey and resource for web database management systems developers and enterprise Web site developers. The book also outlines many research directions and possible extensions to the ideas presented, which makes this book very helpful for students doing research in this area.
The book has very strong emphasis and theoretical perspective on designing the web data model, schema development, web algebraic operators, web data visualization, change management, and knowledge discovery. The solid theoretical foundation will provide a good platform for building the web query language and other tools for the manipulation of warehoused data. The implementation of the discussed algorithms will be good exercises for undergraduate and graduate students to learn more about these operators from a system perspective. Similarly, the development of tools for application developers will serve as the foundation for building the web warehouse.
This book assumes that readers have some introductory knowledge of relational database systems and HTML or XML as well as some knowledge of HTML or XML syntax and semantics for understanding of the initial chapters. A database course at the undergraduate or graduate level or familiarity with the concepts of relational schema, data model, and algebraic operators is a sufficient prerequisite for digesting the concept described in the later chapters. For professionals, working knowledge of relational database systems and HTML programming is sufficient to grasp the ideas presented throughout this book. Some exposure to the internals of search engines will help in comparing some of the methodology described in the context of the web warehouse. A good knowledge of C++/Java programming language at a beginner’s level is sufficient to code the algorithm described herein. For readers interested in learning the area of Web data management, the book provides many examples throughout the chapters, which highlight and explain the intrinsic details. |