Clustering with Fair-Center Representation: Parameterized Approximation Algorithms and Heuristics

Suhas Thejaswi, Ameet Gadekar, Bruno Ordozgoiti, Michal Osadnik

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

3 Citations (Scopus)

Abstract

We study a variant of classical clustering formulations in the context of algorithmic fairness, known as diversity-aware clustering. In this variant we are given a collection of facility subsets, and a solution must contain at least a specified number of facilities from each subset while simultaneously minimizing the clustering objective (k-median or k-means). We investigate the fixed-parameter tractability of these problems and show several negative hardness and inapproximability results, even when we afford exponential running time with respect to some parameters. Motivated by these results we identify natural parameters of the problem, and present fixed-parameter approximation algorithms with approximation ratios (1 + 2 over e +) and (1 + 8 over e + ) for diversity-aware k-median and diversity-aware k-means respectively, and argue that these ratios are essentially tight assuming the gap-exponential time hypothesis. We also present a simple and more practical bicriteria approximation algorithm with better running time bounds. We finally propose efficient and practical heuristics. We evaluate the scalability and effectiveness of our methods in a wide variety of rigorously conducted experiments, on both real and synthetic data.

Original languageEnglish
Title of host publicationKDD 2022 - Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
PublisherACM
Pages1749-1759
Number of pages11
ISBN (Electronic)978-1-4503-9385-0
DOIs
Publication statusPublished - 14 Aug 2022
MoE publication typeA4 Conference publication
EventACM SIGKDD International Conference on Knowledge Discovery and Data Mining - Washington, United States
Duration: 14 Aug 202218 Aug 2022
Conference number: 28
https://kdd.org/kdd2022/

Conference

ConferenceACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Abbreviated titleKDD
Country/TerritoryUnited States
CityWashington
Period14/08/202218/08/2022
Internet address

Keywords

  • algorithmic fairness
  • clustering
  • fixed parameter tractability
  • parameterized approximation algorithms

Fingerprint

Dive into the research topics of 'Clustering with Fair-Center Representation: Parameterized Approximation Algorithms and Heuristics'. Together they form a unique fingerprint.
  • MLDB: Model Management Systems: Machine learning meets Database Systems

    Gionis, A. (Principal investigator), Aslay, C. (Project Member), Ciaperoni, M. (Project Member), Xiao, H. (Project Member), Matakos, A. (Project Member) & Muniyappa, S. (Project Member)

    01/09/201931/08/2023

    Project: Academy of Finland: Other research funding

  • ALGOCom: Novel Algorithmic Techniques through the Lens of Combinatorics

    Chalermsook, P. (Principal investigator), Jindal, G. (Project Member), Franck, M. (Project Member), Khodamoradi, K. (Project Member), Yingchareonthawornchai, S. (Project Member), Gadekar, A. (Project Member), Orgo, L. (Project Member), Jiamjitrak, W. (Project Member) & Spoerhase, J. (Project Member)

    01/02/201831/01/2024

    Project: EU: ERC grants

  • Adaptive and intelligent data

    Gionis, A. (Principal investigator), Ordozgoiti Rubio, B. (Project Member), Zhang, G. (Project Member) & Muniyappa, S. (Project Member)

    01/01/201830/06/2022

    Project: Academy of Finland: Other research funding

Cite this