Abstrakti
We introduce a novel problem for diversity-aware clustering. We assume that the potential cluster centers belong to a set of groups defined by protected attributes, such as ethnicity, gender, etc. We then ask to find a minimum-cost clustering of the data into k clusters so that a specified minimum number of cluster centers are chosen from each group. We thus require that all groups are represented in the clustering solution as cluster centers, according to specified requirements. More precisely, we are given a set of clients C, a set of facilities, a collection F= { F1, ⋯, Ft} of facility groups, a budget k, and a set of lower-bound thresholds R= { r1, ⋯, rt}, one for each group in F. The diversity-aware k-median problem asks to find a set S of k facilities in such that | S∩ Fi| ≥ ri, that is, at least ri centers in S are from group Fi, and the k-median cost ∑ c∈Cmin s∈Sd(c, s) is minimized. We show that in the general case where the facility groups may overlap, the diversity-aware k-median problem is NP -hard, fixed-parameter intractable with respect to parameter k, and inapproximable to any multiplicative factor. On the other hand, when the facility groups are disjoint, approximation algorithms can be obtained by reduction to the matroid median and red-blue median problems. Experimentally, we evaluate our approximation methods for the tractable cases, and present a relaxation-based heuristic for the theoretically intractable case, which can provide high-quality and efficient solutions for real-world datasets.
| Alkuperäiskieli | Englanti |
|---|---|
| Otsikko | Machine Learning and Knowledge Discovery in Databases. Research Track - European Conference, ECML PKDD 2021, Proceedings |
| Toimittajat | Nuria Oliver, Fernando Pérez-Cruz, Stefan Kramer, Jesse Read, Jose A. Lozano |
| Kustantaja | Springer |
| Sivut | 765-780 |
| Sivumäärä | 16 |
| ISBN (painettu) | 978-3-030-86519-1 |
| DOI - pysyväislinkit | |
| Tila | Julkaistu - 2021 |
| OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
| Tapahtuma | European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - Virtual, Online Kesto: 13 syysk. 2021 → 17 syysk. 2021 |
Julkaisusarja
| Nimi | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Kustantaja | Springer |
| Vuosikerta | 12976 LNAI |
| ISSN (painettu) | 0302-9743 |
| ISSN (elektroninen) | 1611-3349 |
Conference
| Conference | European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases |
|---|---|
| Lyhennettä | ECML PKDD |
| Kaupunki | Virtual, Online |
| Ajanjakso | 13/09/2021 → 17/09/2021 |
Rahoitus
This research is supported by the Academy of Finland projects AIDA (317085) and MLDB (325117), the ERC Advanced Grant REBOUND (834862), the EC H2020 RIA project SoBigData (871042), and the Wallenberg AI, Autonomous Systems and Software Program (WASP) funded by the Knut and Alice Wallenberg Foundation.
Sormenjälki
Sukella tutkimusaiheisiin 'Diversity-Aware k-median: Clustering with Fair Center Representation'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.Projektit
- 3 Päättynyt
-
-: SoBigData-PlusPlus
Roy, C. (Projektin jäsen), Kaski, K. (Projektin jäsen) & Bhattacharya, K. (Projektin jäsen)
01/01/2020 → 31/12/2025
Projekti: EU H2020 Framework program
-
MLDB: Model Management Systems: Machine learning meets Database Systems (MLDB)
Gionis, A. (Vastuullinen johtaja), Ciaperoni, M. (Projektin jäsen), Xiao, H. (Projektin jäsen), Muniyappa, S. (Projektin jäsen), Matakos, A. (Projektin jäsen) & Aslay, C. (Projektin jäsen)
01/09/2019 → 31/08/2023
Projekti: Academy of Finland: Other research funding
-
Adaptiivinen ja älykäs data
Gionis, A. (Vastuullinen johtaja), Mahadevan, A. (Projektin jäsen), Zhang, G. (Projektin jäsen), Papatheodorou, D. (Projektin jäsen), Ordozgoiti Rubio, B. (Projektin jäsen) & Muniyappa, S. (Projektin jäsen)
01/01/2018 → 30/06/2022
Projekti: Academy of Finland: Other research funding
Laitteet
Siteeraa tätä
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver