Abstrakti

Preferential Bayesian Optimization (PBO) is a sample-efficient method to learn latent user utilities from preferential feedback over a pair of designs. It relies on a statistical surrogate model for the latent function, usually a Gaussian process, and an acquisition strategy to select the next candidate pair to get user feedback on. Due to the non-conjugacy of the associated likelihood, every PBO step requires a significant amount of computations with various approximate inference techniques. This computational overhead is incompatible with the way humans interact with computers, hindering the use of PBO in real-world cases. Building on the recent advances of amortized BO, we propose to circumvent this issue by fully amortizing PBO, meta-learning both the surrogate and the acquisition function. Our method comprises a novel transformer neural process architecture, trained using reinforcement learning and tailored auxiliary losses. On a benchmark composed of synthetic and real-world datasets, our method is several orders of magnitude faster than the usual Gaussian process-based strategies and often outperforms them in accuracy.

AlkuperäiskieliEnglanti
Otsikko13th International Conference on Learning Representations, ICLR 2025
KustantajaCurran Associates Inc.
Sivut98689-98713
Sivumäärä25
ISBN (elektroninen)9798331320850
TilaJulkaistu - 2025
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInternational Conference on Learning Representations - Singapore, Singapore, Singapore
Kesto: 24 huhtik. 202528 huhtik. 2025
Konferenssinumero: 13
https://iclr.cc/

Conference

ConferenceInternational Conference on Learning Representations
LyhennettäICLR
Maa/AlueSingapore
KaupunkiSingapore
Ajanjakso24/04/202528/04/2025
www-osoite

Rahoitus

XZ, DH and SK were supported by the Research Council of Finland (Flagship programme: Finnish Center for Artificial Intelligence FCAI and decision 341763). SK was also supported by the UKRI Turing AI World-Leading Researcher Fellowship, [EP/W002973/1]. The authors wish to thank Aalto Science-IT project, for the computational and data storage resources provided.

Sormenjälki

Sukella tutkimusaiheisiin 'PABBO: Preferential Amortized Black-Box Optimization'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä