User Modelling for Avoiding Overfitting in Interactive Knowledge Elicitation for Prediction

Pedram Daee, Tomi Peltola, Aki Vehtari, Samuel Kaski

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

15 Citations (Scopus)
184 Downloads (Pure)

Abstract

In human-in-the-loop machine learning, the user provides information beyond that in the training data. Many algorithms and user interfaces have been designed to optimize and facilitate this human--machine interaction; however, fewer studies have addressed the potential defects the designs can cause. Effective interaction often requires exposing the user to the training data or its statistics. The design of the system is then critical, as this can lead to double use of data and overfitting, if the user reinforces noisy patterns in the data. We propose a user modelling methodology, by assuming simple rational behaviour, to correct the problem. We show, in a user study with 48 participants, that the method improves predictive performance in a sparse linear regression sentiment analysis task, where graded user knowledge on feature relevance is elicited. We believe that the key idea of inferring user knowledge with probabilistic user models has general applicability in guarding against overfitting and improving interactive machine learning.
Original languageEnglish
Title of host publicationIUI 2018 - Proceedings of the 23rd International Conference on Intelligent User Interfaces
PublisherACM
Pages305-310
Number of pages6
ISBN (Electronic)978-1-4503-4945-1
DOIs
Publication statusPublished - 8 Mar 2018
MoE publication typeA4 Conference publication
EventInternational Conference on Intelligent User Interfaces - Tokyo, Japan
Duration: 7 Mar 201811 Mar 2018
Conference number: 23
http://iui.acm.org/2018/index.html

Conference

ConferenceInternational Conference on Intelligent User Interfaces
Abbreviated titleIUI
Country/TerritoryJapan
CityTokyo
Period07/03/201811/03/2018
Internet address

Keywords

  • Interactive machine learning
  • Probabilistic modeling
  • Bayesian Inference
  • overfitting
  • expert prior elicitation
  • human-in-the-loop machine learning

Fingerprint

Dive into the research topics of 'User Modelling for Avoiding Overfitting in Interactive Knowledge Elicitation for Prediction'. Together they form a unique fingerprint.
  • Data-Driven Decision Support for Digital Health

    Kaski, S. (Principal investigator), Vuollekoski, H. (Project Member), Strahl, J. (Project Member), Niinimäki, T. (Project Member), Sundin, I. (Project Member), Blomstedt, P. (Project Member), Hegde, P. (Project Member), Daee, P. (Project Member) & Eranti, P. (Project Member)

    01/01/201630/06/2018

    Project: Academy of Finland: Other research funding

  • Interactive machine learning from multiple biodata sources

    Kaski, S. (Principal investigator) & Filstroff, L. (Project Member)

    01/01/201631/08/2021

    Project: Academy of Finland: Other research funding

  • Interactive machine learning from multiple biodata sources

    Kaski, S. (Principal investigator), Reinvall, J. (Project Member), Chen, Y. (Project Member), Daee, P. (Project Member), Qin, X. (Project Member), Jälkö, J. (Project Member), Pesonen, H. (Project Member), Blomstedt, P. (Project Member), Eranti, P. (Project Member), Hegde, P. (Project Member), Siren, J. (Project Member), Peltola, T. (Project Member), Celikok, M. M. (Project Member), Sundin, I. (Project Member), Kangas, J.-K. (Project Member), Afrabandpey, H. (Project Member), Honkamaa, J. (Project Member), Shen, Z. (Project Member) & Aushev, A. (Project Member)

    01/01/201631/12/2018

    Project: Academy of Finland: Other research funding

Cite this