Interesting patterns

Jilles Vreeken*, Nikolaj Tatti

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

    34 Citations (Scopus)

    Abstract

    Pattern mining is one of the most important aspects of data mining. By far the most popular and well-known approach is frequent pattern mining. That is, to discover patterns that occur in many transactions. This approach has many virtues including monotonicity, which allows efficient discovery of all frequent patterns. Nevertheless, in practice frequent pattern mining rarely gives good results-the number of discovered patterns is typically gargantuan and they are heavily redundant. Consequently, a lot of research effort has been invested toward improving the quality of the discovered patterns. In this chapter we will give an overview of the interestingness measures and other redundancy reduction techniques that have been proposed to this end. In particular, we first present classic techniques such as closed and non-derivable itemsets that are used to prune unnecessary itemsets. We then discuss techniques for ranking patterns on how expected their score is under a null hypothesis-considering patterns that deviate from this expectation to be interesting. These models can either be static, as well as dynamic; we can iteratively update this model as we discover new patterns. More generally, we also give a brief overview on pattern set mining techniques, where we measure quality over a set of patterns, instead of individually. This setup gives us freedom to explicitly punish redundancy which leads to a more to-the-point results.

    Original languageEnglish
    Title of host publicationFrequent Pattern Mining
    PublisherSpringer
    Pages105-134
    Number of pages30
    Volume9783319078212
    ISBN (Electronic)9783319078212
    ISBN (Print)3319078208, 9783319078205
    DOIs
    Publication statusPublished - 1 Jul 2014
    MoE publication typeA3 Book section, Chapters in research books

    Keywords

    • Interestingness measures
    • Pattern mining
    • Pattern set mining
    • Ranking
    • Statistics

    Fingerprint

    Dive into the research topics of 'Interesting patterns'. Together they form a unique fingerprint.

    Cite this