This book is about inductive databases and constraint-based data mining, emerging
research topics lying at the intersection of data mining and database research. The
aim of the book as to provide an overview of the state-of- the art in this novel and exciting
research area. Of special interest are the recent methods for constraint-based
mining of global models for prediction and clustering, the unification of pattern
mining approaches through constraint programming, the clarification of the relationship
between mining local patterns and global models, and the proposed integrative
frameworks and approaches for inducive databases. On the application side,
applications to practically relevant problems from bioinformatics are presented.
Inductive databases (IDBs) represent a database view on data mining and knowledge
discovery. IDBs contain not only data, but also generalizations (patterns and
models) valid in the data. In an IDB, ordinary queries can be used to access and manipulate
data, while inductive queries can be used to generate (mine), manipulate,
and apply patterns and models. In the IDB framework, patterns and models become
”first-class citizens” and KDD becomes an extended querying process in which both
the data and the patterns/models that hold in the data are queried.
This book presents inductive databases and constraint-based data mining, emerging research topics lying at the intersection of data mining and database research. The book provides an overview of the state-of-the art in this novel research area. Of special interest are the recent methods for constraint-based mining of global models for prediction and clustering, the unification of pattern mining approaches through constraint programming, the clarification of the relationship between mining local patterns and global models, and the proposed integrative frameworks and approaches for inductive databases. On the application side, applications to practically relevant problems from bioinformatics are presented to attract additional attention from a wider audience.
The primary audience consists of scientists and graduate students in computer science and bio-informatics. Potential readers are likely to attend conferences on databases, data mining/ machine learning, and bio-informatics.