Theory of Mind Based Models in Human-AI Interaction

Mustafa Celikok

Tutkimustuotos: Master's thesis


Humans are social animals. They have goals, they make plans, they collaborate and compete. The richness of human-human interaction is immense. Yet, the way modern AI systems model their interaction with human users does not take these aspects into account. Often times human feedback is modelled as samples from an unknown but fixed probability distribution. These models are not able to capture the active planning aspect of real humans. The underlying motivation of this thesis is that the performance of human-AI collaboration is limited by the parties' ability of modelling each others' minds. In human-human interaction, this ability is called the theory of mind, and it is shown to be a limiting factor in human teams' task performance by cognitive science studies. In order to examine the effects of having theory of mind based user models, we define a multi-armed bandit setting where the system takes into account that the user is able to anticipate the system's behaviour multiple steps ahead, and strategically plan her feedback. We compare the performance of our proposed setting to the standard multi-armed bandit setting where the feedback is assumed to be samples from an unknown probability distribution. Empirical results demonstrate that better reward performance and ranking of arms are achieved when users can behave strategically and the system takes this into account. The results indicate
that the performance of human-AI teams increase based on how well the parties can model each other and use their models to plan their interaction.
Myöntävä instituutio
  • Aalto-yliopisto
Myöntöpäivämäärä10 jouluk. 2018
TilaJulkaistu - 10 jouluk. 2018
OKM-julkaisutyyppiG2 Pro gradu, diplomityö, ylempi amk-opinnäytetyö


Sukella tutkimusaiheisiin 'Theory of Mind Based Models in Human-AI Interaction'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä