Siirry päänavigointiin Siirry hakuun Siirry pääsisältöön

Insightful dimensionality reduction with very low rank variable subsets

  • Bruno Ordozgoiti
  • , Sachith Pai
  • , Marta Kolczynska
  • University of Helsinki
  • Polish Academy of Sciences

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

1 Sitaatiot (Scopus)
175 Lataukset (Pure)

Abstrakti

Dimensionality reduction techniques can be employed to produce robust, cost-effective predictive models, and to enhance interpretability in exploratory data analysis. However, the models produced by many of these methods are formulated in terms of abstract factors or are too high-dimensional to facilitate insight and fit within low computational budgets. In this paper we explore an alternative approach to interpretable dimensionality reduction. Given a data matrix, we study the following question: are there subsets of variables that can be primarily explained by a single factor? We formulate this challenge as the problem of finding submatrices close to rank one. Despite its potential, this topic has not been sufficiently addressed in the literature, and there exist virtually no algorithms for this purpose that are simultaneously effective, efficient and scalable. We formalize the task as two problems which we characterize in terms of computational complexity, and propose efficient, scalable algorithms with approximation guarantees. Our experiments demonstrate how our approach can produce insightful findings in data, and show our algorithms to be superior to strong baselines.

AlkuperäiskieliEnglanti
OtsikkoProceedings of the Web Conference, WWW 2021
KustantajaACM
Sivut3066-3075
Sivumäärä10
ISBN (elektroninen)9781450383127
DOI - pysyväislinkit
TilaJulkaistu - 3 kesäk. 2021
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaThe Web Conference - Ljubljana, Slovenia
Kesto: 19 huhtik. 202123 huhtik. 2021

Conference

ConferenceThe Web Conference
LyhennettäWWW
Maa/AlueSlovenia
KaupunkiLjubljana
Ajanjakso19/04/202123/04/2021

Rahoitus

This work was supported by the Academy of Finland project AIDA (317085), the EC H2020RIA project “SoBigData++” (871042), and the Polish National Agency for Academic Exchange within the Bekker programme, number PPN/BEK/2019/1/00133.

Sormenjälki

Sukella tutkimusaiheisiin 'Insightful dimensionality reduction with very low rank variable subsets'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.
  • -: SoBigData-PlusPlus

    Roy, C. (Projektin jäsen), Kaski, K. (Projektin jäsen) & Bhattacharya, K. (Projektin jäsen)

    01/01/202031/12/2025

    Projekti: EU H2020 Framework program

  • Adaptiivinen ja älykäs data

    Gionis, A. (Vastuullinen johtaja), Mahadevan, A. (Projektin jäsen), Zhang, G. (Projektin jäsen), Papatheodorou, D. (Projektin jäsen), Ordozgoiti Rubio, B. (Projektin jäsen) & Muniyappa, S. (Projektin jäsen)

    01/01/201830/06/2022

    Projekti: Academy of Finland: Other research funding

Siteeraa tätä