Projects per year
Abstract
We introduce a novel positional encoding strategy for Transformer-style models, addressing the shortcomings of existing, often ad hoc, approaches. Our framework implements a flexible mapping from the algebraic specification of a domain to a positional encoding scheme where positions are interpreted as orthogonal operators. This design preserves the structural properties of the source domain, thereby ensuring that the end-model upholds them. The framework can accommodate various structures, including sequences, grids and trees, but also their compositions. We conduct a series of experiments demonstrating the practical applicability of our method. Our results suggest performance on par with or surpassing the current state of the art, without hyper-parameter optimizations or task search'' of any kind.Code is available through https://aalto-quml.github.io/ape/.
Original language | English |
---|---|
Title of host publication | Advances in Neural Information Processing Systems 37 (NeurIPS 2024) |
Editors | A. Globerson, L. Mackey, D. Belgrave, A. Fan, U. Paquet, J. Tomczak, C. Zhang |
Publisher | Curran Associates Inc. |
ISBN (Print) | 9798331314385 |
Publication status | Published - 2025 |
MoE publication type | A4 Conference publication |
Event | Conference on Neural Information Processing Systems - Vancouver, Canada, Vancouver , Canada Duration: 10 Dec 2024 → 15 Dec 2024 Conference number: 38 https://neurips.cc/Conferences/2024 |
Publication series
Name | Advances in Neural Information Processing Systems |
---|---|
Publisher | Curran Associates Inc. |
Volume | 37 |
ISSN (Print) | 1049-5258 |
Conference
Conference | Conference on Neural Information Processing Systems |
---|---|
Abbreviated title | NeurIPS |
Country/Territory | Canada |
City | Vancouver |
Period | 10/12/2024 → 15/12/2024 |
Internet address |
Fingerprint
Dive into the research topics of 'Algebraic Positional Encodings'. Together they form a unique fingerprint.Projects
- 1 Active
-
HEALED/Garg: Human-steered next-generation machine learning for reviving drug design
Garg, V. (Principal investigator)
01/09/2021 → 31/08/2025
Project: RCF Academy Project