Concise and interpretable multi-label rule sets

Martino Ciaperoni*, Han Xiao, Aristides Gionis

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

1 Citation (Scopus)

Abstract

Multi-label classification is becoming increasingly ubiquitous, but not much attention has been paid to interpretability. In this paper, we develop a multi-label classifier that can be represented as a concise set of simple 'if-then' rules, and thus, it offers better interpretability compared to black-box models. Notably, our method is able to find a small set of relevant patterns that lead to accurate multi-label classification, while existing rule-based classifiers are myopic and wasteful in searching rules, requiring a large number of rules to achieve high accuracy. In particular, we formulate the problem of choosing multi-label rules to maximize a target function, which considers not only discrimination ability with respect to labels, but also diversity. Accounting for diversity helps to avoid redundancy, and thus, to control the number of rules in the solution set. To tackle the said maximization problem we propose a 2-approximation algorithm, which relies on a novel technique to sample high-quality rules. In addition to our theoretical analysis, we provide a thorough experimental evaluation, which indicates that our approach offers a trade-off between predictive performance and interpretability that is unmatched in previous work.

Original languageEnglish
Title of host publicationProceedings - 22nd IEEE International Conference on Data Mining, ICDM 2022
EditorsXingquan Zhu, Sanjay Ranka, My T. Thai, Takashi Washio, Xindong Wu
PublisherIEEE
Pages71-80
Number of pages10
ISBN (Electronic)978-1-6654-5099-7
DOIs
Publication statusPublished - 2022
MoE publication typeA4 Conference publication
EventIEEE International Conference on Data Mining - Hilton Hotel, Hilton Orlando, 6001 Destination Pkwy, Orlando, United States
Duration: 28 Nov 20221 Dec 2022
Conference number: 22
https://icdm22.cse.usf.edu/registration.html

Publication series

NameIEEE International Conference on Data Mining
Volume2022-November
ISSN (Print)1550-4786

Conference

ConferenceIEEE International Conference on Data Mining
Abbreviated titleICDM
Country/TerritoryUnited States
CityOrlando
Period28/11/202201/12/2022
Internet address

Funding

ACKNOWLEDGMENTS This research is supported by the Academy of Finland projects AIDA (317085) and MLDB (325117), the ERC Advanced Grant REBOUND (834862), the EC H2020 RIA project SoBigData++ (871042), and the Wallenberg AI, Autonomous Systems and Software Program (WASP) funded by the Knut and Alice Wallenberg Foundation.

Fingerprint

Dive into the research topics of 'Concise and interpretable multi-label rule sets'. Together they form a unique fingerprint.
  • -: SoBigData-PlusPlus

    Roy, C. (Project Member), Kaski, K. (Project Member) & Bhattacharya, K. (Project Member)

    01/01/202031/12/2025

    Project: EU H2020 Framework program

  • MLDB: Model Management Systems: Machine learning meets Database Systems

    Gionis, A. (Principal investigator), Ciaperoni, M. (Project Member), Xiao, H. (Project Member), Thejaswi, S. (Project Member), Matakos, A. (Project Member) & Aslay, C. (Project Member)

    01/09/201931/08/2023

    Project: Academy of Finland: Other research funding

  • Adaptive and intelligent data

    Gionis, A. (Principal investigator), Mahadevan, A. (Project Member), Zhang, G. (Project Member), Papatheodorou, D. (Project Member), Ordozgoiti Rubio, B. (Project Member) & Thejaswi, S. (Project Member)

    01/01/201830/06/2022

    Project: Academy of Finland: Other research funding

Cite this