Modeling KDD processes within the inductive database framework

Jean François Boulicaut, Mika Klemettinen, Heikki Mannila

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

29 Citations (Scopus)

Abstract

One of the most challenging problems in data manipulation in the future is to be able to efficiently handle very large databases but also multiple induced properties or generalizations in that data. Popular examples of useful properties are association rules, and inclusion and functional dependencies. Our view of a possible approach for this task is to specify and query inductive databases, which are databases that in addition to data also contain intensionally defined generalizations about the data. We formalize this concept and show how it can be used throughout the whole process of data mining due to the closure property of the framework. We show that simple query languages can be defined using normal database terminology. We demonstrate the use of this framework to model typical data mining processes. It is then possible to perform various tasks on these descriptions like, e.g., optimizing the selection of interesting properties or comparing two processes.

Original languageEnglish
Title of host publicationData Warehousing and Knowledge Discovery - 1st International Conference, DaWaK 1999, Proceedings
EditorsA. Min Tjoa, Mukesh Mohania
PublisherSpringer
Pages293-302
Number of pages10
ISBN (Print)3540664580, 9783540664581
DOIs
Publication statusPublished - 1999
MoE publication typeA4 Conference publication
EventInternational Conference on Data Warehousing and Knowledge Discovery - Florence, Italy
Duration: 30 Aug 19991 Sept 1999
Conference number: 1

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1676
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Conference on Data Warehousing and Knowledge Discovery
Abbreviated titleDaWak
Country/TerritoryItaly
CityFlorence
Period30/08/199901/09/1999

Fingerprint

Dive into the research topics of 'Modeling KDD processes within the inductive database framework'. Together they form a unique fingerprint.

Cite this