Concise and interpretable multi-label rule sets

Martino Ciaperoni*, Han Xiao, Aristides Gionis

*Tämän työn vastaava kirjoittaja

Tutkimustuotos: LehtiartikkeliArticleScientificvertaisarvioitu

1 Sitaatiot (Scopus)
80 Lataukset (Pure)

Abstrakti

Multi-label classification is becoming increasingly ubiquitous, but not much attention has been paid to interpretability. In this paper, we develop a multi-label classifier that can be represented as a concise set of simple “if-then” rules, and thus, it offers better interpretability compared to black-box models. Notably, our method is able to find a small set of relevant patterns that lead to accurate multi-label classification, while existing rule-based classifiers are myopic and wasteful in searching rules, requiring a large number of rules to achieve high accuracy. In particular, we formulate the problem of choosing multi-label rules to maximize a target function, which considers not only discrimination ability with respect to labels, but also diversity. Accounting for diversity helps to avoid redundancy, and thus, to control the number of rules in the solution set. To tackle the said maximization problem, we propose a 2-approximation algorithm, which circumvents the exponential-size search space of rules using a novel technique to sample highly discriminative and diverse rules. In addition to our theoretical analysis, we provide a thorough experimental evaluation and a case study, which indicate that our approach offers a trade-off between predictive performance and interpretability that is unmatched in previous work.

AlkuperäiskieliEnglanti
Sivut5657-5694
Sivumäärä38
JulkaisuKnowledge and Information Systems
Vuosikerta65
Numero12
Varhainen verkossa julkaisun päivämäärä2023
DOI - pysyväislinkit
TilaJulkaistu - jouluk. 2023
OKM-julkaisutyyppiA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

Rahoitus

This research is supported by the Academy of Finland project MLDB (325117), the ERC Advanced Grant REBOUND (834862), the EC H2020 RIA project SoBigData++ (871042), and the Wallenberg AI, Autonomous Systems and Software Program (WASP) funded by the Knut and Alice Wallenberg Foundation.

Sormenjälki

Sukella tutkimusaiheisiin 'Concise and interpretable multi-label rule sets'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.
  • -: SoBigData-PlusPlus

    Roy, C. (Projektin jäsen), Kaski, K. (Projektin jäsen) & Bhattacharya, K. (Projektin jäsen)

    01/01/202031/12/2025

    Projekti: EU H2020 Framework program

  • MLDB: Model Management Systems: Machine learning meets Database Systems (MLDB)

    Gionis, A. (Vastuullinen johtaja), Ciaperoni, M. (Projektin jäsen), Xiao, H. (Projektin jäsen), Thejaswi, S. (Projektin jäsen), Matakos, A. (Projektin jäsen) & Aslay, C. (Projektin jäsen)

    01/09/201931/08/2023

    Projekti: Academy of Finland: Other research funding

Siteeraa tätä