Projects per year
Abstract
Large pre-trained models are essential in paralinguistic systems, demonstrating effectiveness in tasks like emotion recognition and stuttering detection. In this paper, we employ large pre-trained models for the ACM Multimedia Computational Paralinguistics Challenge, addressing the Requests and Emotion Share tasks. We explore audio-only and hybrid solutions leveraging audio and text modalities. Our empirical results consistently show the superiority of the hybrid approaches over the audio-only models. Moreover, we introduce a Bayesian layer as an alternative to the standard linear output layer. The multimodal fusion approach achieves an 85.4% UAR on HC-Requests and 60.2% on HC-Complaints. The ensemble model for the Emotion Share task yields the best 𝜌 value of .614. The Bayesian wav2vec2 approach, explored in this study, allows us to easily build ensembles, at the cost of fine-tuning only one model. Moreover, we can have usable confidence values instead of the usual overconfident posterior probabilities.
Original language | English |
---|---|
Title of host publication | MM '23: Proceedings of the 31st ACM International Conference on Multimedia |
Publisher | ACM |
Pages | 9477-9481 |
Number of pages | 5 |
ISBN (Electronic) | 979-8-4007-0108-5 |
DOIs | |
Publication status | Published - 27 Oct 2023 |
MoE publication type | A4 Conference publication |
Event | ACM International Conference on Multimedia - Ottawa, Canada Duration: 29 Oct 2023 → 29 Oct 2023 Conference number: 31 |
Conference
Conference | ACM International Conference on Multimedia |
---|---|
Abbreviated title | MM |
Country/Territory | Canada |
City | Ottawa |
Period | 29/10/2023 → 29/10/2023 |
Fingerprint
Dive into the research topics of 'Advancing Audio Emotion and Intent Recognition with Large Pre-Trained Models and Bayesian Inference'. Together they form a unique fingerprint.Projects
- 1 Active
-
USSEE: Understanding Speech and Scene with Ears and Eyes
Kurimo, M. (Principal investigator), Virkkunen, A. (Project Member) & Grósz, T. (Project Member)
01/01/2022 → 31/12/2024
Project: Academy of Finland: Other research funding