Multiple output regression with latent noise

Leo Gillberg, Pekka Marttinen, Matti Pirinen, Antti J. Kangas, Pasi Soininen, Mehreen Ali, Aki S. Havulinna, Marjo Riitta Järvelin, Mika Ala-Korpela, Samuel Kaski

Research output: Contribution to journalArticleScientificpeer-review

10 Citations (Scopus)
119 Downloads (Pure)

Abstract

In high-dimensional data, structured noise caused by observed and unobserved factors affecting multiple target variables simultaneously, imposes a serious challenge for modeling, by masking the often weak signal. Therefore, (1) explaining away the structured noise in multiple-output regression is of paramount importance. Additionally, (2) assumptions about the correlation structure of the regression weights are needed. We note that both can be formulated in a natural way in a latent variable model, in which both the interesting signal and the noise are mediated through the same latent factors. Under this assumption, the signal model then borrows strength from the noise model by encouraging similar effects on correlated targets. We introduce a hyperparameter for the latent signal-to-noise ratio which turns out to be important for modelling weak signals, and an ordered infinite-dimensional shrinkage prior that resolves the rotational unidentifiability in reduced-rank regression models. Simulations and prediction experiments with metabolite, gene expression, FMRI measurement, and macroeconomic time series data show that our model equals or exceeds the state-of-the-art performance and, in particular, outperforms the standard approach of assuming independent noise and signal models.

Original languageEnglish
Pages (from-to)1-35
Number of pages35
JournalJournal of Machine Learning Research
Volume17
Publication statusPublished - 1 Jun 2016
MoE publication typeA1 Journal article-refereed

Keywords

  • Bayesian reduced-rank regression
  • Latent signal-to-noise ratio
  • Latent variable models
  • Multiple-output regression
  • Nonparametric Bayes
  • Shrinkage priors
  • Structured noise
  • Weak effects

Fingerprint

Dive into the research topics of 'Multiple output regression with latent noise'. Together they form a unique fingerprint.
  • Interactive machine learning from multiple biodata sources

    Kaski, S. (Principal investigator) & Filstroff, L. (Project Member)

    01/01/201631/08/2021

    Project: Academy of Finland: Other research funding

  • Interactive machine learning from multiple biodata sources

    Kaski, S. (Principal investigator), Reinvall, J. (Project Member), Chen, Y. (Project Member), Daee, P. (Project Member), Qin, X. (Project Member), Jälkö, J. (Project Member), Pesonen, H. (Project Member), Blomstedt, P. (Project Member), Eranti, P. (Project Member), Hegde, P. (Project Member), Siren, J. (Project Member), Peltola, T. (Project Member), Celikok, M. M. (Project Member), Sundin, I. (Project Member), Kangas, J.-K. (Project Member), Afrabandpey, H. (Project Member), Honkamaa, J. (Project Member), Shen, Z. (Project Member) & Aushev, A. (Project Member)

    01/01/201631/12/2018

    Project: Academy of Finland: Other research funding

  • Computational models and methods for deciphering evolutionary patterns in bacterial genomic data

    Marttinen, P. (Principal investigator), Kumar, Y. (Project Member) & Poyraz, O. (Project Member)

    01/09/201531/08/2020

    Project: Academy of Finland: Other research funding

Cite this