Skip to main navigation Skip to search Skip to main content

Bayesian Hierarchical Stacking: Some Models Are (Somewhere) Useful

  • Yuling Yao*
  • , Gregor Pirš
  • , Aki Vehtari
  • , Andrew Gelman
  • *Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

29 Citations (Scopus)
293 Downloads (Pure)

Abstract

Stacking is a widely used model averaging technique that asymptotically yields optimal predictions among linear averages. We show that stacking is most effective when model predictive performance is heterogeneous in inputs, and we can further improve the stacked mixture with a hierarchical model. We generalize stacking to Bayesian hierarchical stacking. The model weights are varying as a function of data, partially-pooled, and inferred using Bayesian inference. We further incorporate discrete and continuous inputs, other structured priors, and time series and longitudinal data. To verify the performance gain of the proposed method, we derive theory bounds, and demonstrate on several applied problems.

Original languageEnglish
Pages (from-to)1043-1071
Number of pages29
JournalBayesian Analysis
Volume17
Issue number4
DOIs
Publication statusPublished - Dec 2022
MoE publication typeA1 Journal article-refereed

Funding

∗The authors thank the National Science Foundation, Institute of Education Sciences, Office of Naval Research, National Institutes of Health, Sloan Foundation, Schmidt Futures, and the Academy of Finland Flagship programme: Finnish Center for Artificial Intelligence (FCAI) for partial financial support. Gregor Pirˇs is supported by the Slovenian Research Agency young researcher grant. †Flatiron Institute, New York, USA, [email protected] ‡Faculty of Computer and Information Science, University of Ljubljana, Ljubljana, Slovenia §Department of Computer Science, Aalto University, Espoo, Finland ¶Department of Statistics and Political Science, Columbia University, New York, USA

Keywords

  • Bayesian hierarchical modeling
  • Conditional prediction
  • Covariate shift
  • Model averaging
  • Prior construction
  • Stacking

Fingerprint

Dive into the research topics of 'Bayesian Hierarchical Stacking: Some Models Are (Somewhere) Useful'. Together they form a unique fingerprint.

Cite this