Rethinking Inference in Gaussian Processes: A Dual Parameterization Approach

Research output: ThesisDoctoral ThesisCollection of Articles

Abstract

Uncertainty quantification is a vital aspect of machine learning, especially when accurate estimates of uncertainty are crucial for making informed decisions. Gaussian Processes (GPs), known for their versatility as function space priors, find wide-ranging applications in diverse fields, including climate modelling and epidemiology. GPs are particularly useful due to their non-linearity, allowing them to adapt to various data patterns and their ability to integrate domain-specific knowledge. As probabilistic models, they offer predictions and quantify the uncertainty within these predictions, an essential feature in scenarios demanding reliable forecasts.  This thesis focuses on applying GPs to large-scale, non-Gaussian sequential data. Due to their non-parametric nature, GPs face increasing computational demands as data size grows. The requirement for a conjugate Gaussian likelihood for computational tractability presents further challenges. Therefore, it is common to use approximate inference for applying GPs to non-Gaussian likelihoods alongside scalable model formulations to handle complex data distributions in real- world applications.  The theme connecting the papers is an innovative approach to parameterizing the optimization problem in approximate inference, centring on a forgotten parameterization termed the dual parameters. This fresh perspective offers methods to enhance the efficiency of GPs in applying them to large and complex datasets, particularly in the context of sequential data. This approach addresses the pivotal challenges of tractability and scalability inherent in GPs in the sequential setting. The concept of dual parameters serves as a unifying framework, linking all approximate inference techniques through their various likelihood approximations.  In addition, the thesis shows the application of dual parameterization methods in a range of GP model formulations and problem settings. It introduces a new algorithm for inference and learning in non-Gaussian time series data and the sparse GP framework. The applications discussed extend to areas such as Bayesian optimization and continual learning, highlighting the adaptability and potential of GPs in contemporary machine learning.
Translated title of the contributionRethinking Inference in Gaussian Processes: A Dual Parameterization Approach
Original languageEnglish
QualificationDoctor's degree
Awarding Institution
  • Aalto University
Supervisors/Advisors
  • Solin, Arno, Supervising Professor
Publisher
Print ISBNs978-952-64-2125-4
Electronic ISBNs978-952-64-2126-1
Publication statusPublished - 2024
MoE publication typeG5 Doctoral dissertation (article)

Keywords

  • machine learning
  • Gaussian Processes
  • climate modelling
  • epidemiology

Fingerprint

Dive into the research topics of 'Rethinking Inference in Gaussian Processes: A Dual Parameterization Approach'. Together they form a unique fingerprint.

Cite this