EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

Yue Jiang, Zixin Guo, Hamed Rezazadegan Tavakoli, Luis A. Leiva, Antti Oulasvirta

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

2 Citations (Scopus)
36 Downloads (Pure)

Abstract

From a visual-perception perspective, modern graphical user interfaces (GUIs) comprise a complex graphics-rich two-dimensional visuospatial arrangement of text, images, and interactive objects such as buttons and menus. While existing models can accurately predict regions and objects that are likely to attract attention “on average”, no scanpath model has been capable of predicting scanpaths for an individual. To close this gap, we introduce EyeFormer, which utilizes a Transformer architecture as a policy network to guide a deep reinforcement learning algorithm that predicts gaze locations. Our model offers the unique capability of producing personalized predictions when given a few user scanpath samples. It can predict full scanpath information, including fixation positions and durations, across individuals and various stimulus types. Additionally, we demonstrate applications in GUI layout optimization driven by our model.
Original languageEnglish
Title of host publicationProceedings of the 37th Annual ACM Symposium on User Interface Software and Technology
Place of PublicationNew York
PublisherACM
Number of pages15
ISBN (Electronic)9798400706288
ISBN (Print)979-8-4007-0628-8
DOIs
Publication statusPublished - 13 Oct 2024
MoE publication typeA4 Conference publication
EventACM Symposium on User Interface Software and Technology - Pittsburgh, United States
Duration: 13 Oct 202416 Oct 2024
Conference number: 37

Publication series

NameUIST '24
PublisherAssociation for Computing Machinery

Conference

ConferenceACM Symposium on User Interface Software and Technology
Abbreviated titleUIST
Country/TerritoryUnited States
CityPittsburgh
Period13/10/202416/10/2024

Fingerprint

Dive into the research topics of 'EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning'. Together they form a unique fingerprint.
  • Science-IT

    Hakala, M. (Manager)

    School of Science

    Facility/equipment: Facility

Cite this