Projects per year
Abstract
Multi-label classification is becoming increasingly ubiquitous, but not much attention has been paid to interpretability. In this paper, we develop a multi-label classifier that can be represented as a concise set of simple 'if-then' rules, and thus, it offers better interpretability compared to black-box models. Notably, our method is able to find a small set of relevant patterns that lead to accurate multi-label classification, while existing rule-based classifiers are myopic and wasteful in searching rules, requiring a large number of rules to achieve high accuracy. In particular, we formulate the problem of choosing multi-label rules to maximize a target function, which considers not only discrimination ability with respect to labels, but also diversity. Accounting for diversity helps to avoid redundancy, and thus, to control the number of rules in the solution set. To tackle the said maximization problem we propose a 2-approximation algorithm, which relies on a novel technique to sample high-quality rules. In addition to our theoretical analysis, we provide a thorough experimental evaluation, which indicates that our approach offers a trade-off between predictive performance and interpretability that is unmatched in previous work.
Original language | English |
---|---|
Title of host publication | Proceedings - 22nd IEEE International Conference on Data Mining, ICDM 2022 |
Editors | Xingquan Zhu, Sanjay Ranka, My T. Thai, Takashi Washio, Xindong Wu |
Publisher | IEEE |
Pages | 71-80 |
Number of pages | 10 |
ISBN (Electronic) | 978-1-6654-5099-7 |
DOIs | |
Publication status | Published - 2022 |
MoE publication type | A4 Conference publication |
Event | IEEE International Conference on Data Mining - Hilton Hotel, Hilton Orlando, 6001 Destination Pkwy, Orlando, United States Duration: 28 Nov 2022 → 1 Dec 2022 Conference number: 22 https://icdm22.cse.usf.edu/registration.html |
Publication series
Name | IEEE International Conference on Data Mining |
---|---|
Volume | 2022-November |
ISSN (Print) | 1550-4786 |
Conference
Conference | IEEE International Conference on Data Mining |
---|---|
Abbreviated title | ICDM |
Country/Territory | United States |
City | Orlando |
Period | 28/11/2022 → 01/12/2022 |
Internet address |
Fingerprint
Dive into the research topics of 'Concise and interpretable multi-label rule sets'. Together they form a unique fingerprint.-
SoBigDataPlusPlus: Integrated Infrastructure for Social Mining and Big Data Analytics
Lampinen, J. (Principal investigator), Roy, C. (Project Member) & Bhattacharya, K. (Project Member)
01/01/2020 → 31/12/2024
Project: EU: Framework programmes funding
-
MLDB: Model Management Systems: Machine learning meets Database Systems
Gionis, A. (Principal investigator), Aslay, C. (Project Member), Ciaperoni, M. (Project Member), Xiao, H. (Project Member), Matakos, A. (Project Member) & Muniyappa, S. (Project Member)
01/09/2019 → 31/08/2023
Project: Academy of Finland: Other research funding
-
Adaptive and intelligent data
Gionis, A. (Principal investigator), Ordozgoiti Rubio, B. (Project Member), Zhang, G. (Project Member) & Muniyappa, S. (Project Member)
01/01/2018 → 30/06/2022
Project: Academy of Finland: Other research funding