Projekteja vuodessa
Abstrakti
We introduce a novel positional encoding strategy for Transformer-style models, addressing the shortcomings of existing, often ad hoc, approaches. Our framework implements a flexible mapping from the algebraic specification of a domain to a positional encoding scheme where positions are interpreted as orthogonal operators. This design preserves the structural properties of the source domain, thereby ensuring that the end-model upholds them. The framework can accommodate various structures, including sequences, grids and trees, but also their compositions. We conduct a series of experiments demonstrating the practical applicability of our method. Our results suggest performance on par with or surpassing the current state of the art, without hyper-parameter optimizations or task search'' of any kind.Code is available through https://aalto-quml.github.io/ape/.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | Advances in Neural Information Processing Systems 37 (NeurIPS 2024) |
Toimittajat | A. Globerson, L. Mackey, D. Belgrave, A. Fan, U. Paquet, J. Tomczak, C. Zhang |
Kustantaja | Curran Associates Inc. |
ISBN (painettu) | 9798331314385 |
Tila | Julkaistu - 2025 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | Conference on Neural Information Processing Systems - Vancouver, Canada, Vancouver , Kanada Kesto: 10 jouluk. 2024 → 15 jouluk. 2024 Konferenssinumero: 38 https://neurips.cc/Conferences/2024 |
Julkaisusarja
Nimi | Advances in Neural Information Processing Systems |
---|---|
Kustantaja | Curran Associates Inc. |
Vuosikerta | 37 |
ISSN (painettu) | 1049-5258 |
Conference
Conference | Conference on Neural Information Processing Systems |
---|---|
Lyhennettä | NeurIPS |
Maa/Alue | Kanada |
Kaupunki | Vancouver |
Ajanjakso | 10/12/2024 → 15/12/2024 |
www-osoite |
Sormenjälki
Sukella tutkimusaiheisiin 'Algebraic Positional Encodings'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.Projektit
- 1 Aktiivinen
-
HEALED/Garg: Human-steered next-generation machine learning for reviving drug design
Garg, V. (Vastuullinen tutkija)
01/09/2021 → 31/08/2025
Projekti: RCF Academy Project