Abstract
Kernel functions are a powerful tool to enhance the k-means clustering algorithm via the kernel trick. It is known that the parameters of the chosen kernel function can have a dramatic impact on the result. In supervised settings, these can be tuned via cross-validation, but for clustering this is not straightforward and heuristics are usually employed. In this paper we study the impact of kernel parameters on kernel k-means. In particular, we derive a lower bound, tight up to constant factors, below which the parameter of the RBF kernel will render kernel k-means meaningless. We argue that grid search can be ineffective for hyperparameter search in this context and propose an alternative algorithm for this purpose. In addition, we offer an efficient implementation based on fast approximate exponentiation with provable quality guarantees. Our experimental results demonstrate the ability of our method to efficiently reveal a rich and useful set of hyperparameter values.
| Original language | English |
|---|---|
| Title of host publication | Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2020, Proceedings |
| Editors | Frank Hutter, Kristian Kersting, Jefrey Lijffijt, Isabel Valera |
| Publisher | Springer |
| Pages | 399-415 |
| Number of pages | 17 |
| Edition | 1 |
| ISBN (Electronic) | 9783030676612 |
| ISBN (Print) | 9783030676605 |
| DOIs | |
| Publication status | Published - Feb 2021 |
| MoE publication type | A4 Conference publication |
| Event | European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases - Virtual, Online Duration: 14 Sept 2020 → 18 Sept 2020 https://ecmlpkdd2020.net/ |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Publisher | Springer |
| Volume | 12458 |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases |
|---|---|
| Abbreviated title | ECML-PKDD |
| City | Virtual, Online |
| Period | 14/09/2020 → 18/09/2020 |
| Internet address |
Fingerprint
Dive into the research topics of 'Off-the-grid: Fast and Effective Hyperparameter Search for Kernel Clustering'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Adaptive and intelligent data
Gionis, A. (Principal investigator), Mahadevan, A. (Project Member), Zhang, G. (Project Member), Papatheodorou, D. (Project Member), Ordozgoiti Rubio, B. (Project Member) & Muniyappa, S. (Project Member)
01/01/2018 → 30/06/2022
Project: Academy of Finland: Other research funding
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver