Projekteja vuodessa
Abstrakti
Large-scale, pre-trained models revolutionized the field of sentiment analysis and enabled multimodal systems to be quickly developed. In this paper, we address two challenges posed by the Multimodal Sentiment Analysis (MuSe) 2023 competition by focusing on automatically detecting cross-cultural humor and predicting three continuous emotion targets from user-generated videos. Multiple methods in the literature already demonstrate the importance of embedded features generated by popular pre-trained neural solutions. Based on their success, we can assume that the embedded space consists of several sub-spaces relevant to different tasks. Our aim is to automatically identify the task-specific sub-spaces of various embeddings by interpreting the baseline neural models. Once the relevant dimensions are located, we train a new model using only those features, which leads to similar or slightly better results with a considerably smaller and faster model. The best Humor Detection model using only the relevant sub-space of audio embeddings contained approximately 54% fewer parameters than the one processing the whole encoded vector, required 48% less time to be trained and even outperformed the larger model. Our empirical results validate that, indeed, only a portion of the embedding space is needed to achieve good performance. Our solution could be considered a novel form of knowledge distillation, which enables new ways of transferring knowledge from one model into another.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | MuSe '23: Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, Humour and Personalisation |
Kustantaja | ACM |
Sivut | 27-34 |
Sivumäärä | 8 |
ISBN (painettu) | 979-8-4007-0270-9 |
DOI - pysyväislinkit | |
Tila | Julkaistu - 2 marrask. 2023 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | ACM International Conference on Multimedia - Ottawa, Kanada Kesto: 29 lokak. 2023 → 29 lokak. 2023 Konferenssinumero: 31 |
Conference
Conference | ACM International Conference on Multimedia |
---|---|
Lyhennettä | MM |
Maa/Alue | Kanada |
Kaupunki | Ottawa |
Ajanjakso | 29/10/2023 → 29/10/2023 |
Sormenjälki
Sukella tutkimusaiheisiin 'Discovering Relevant Sub-spaces of BERT, Wav2Vec 2.0, ELECTRA and ViT Embeddings for Humor and Mimicked Emotion Recognition with Integrated Gradients'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.Projektit
- 1 Päättynyt
-
USSEE: Understanding Speech and Scene with Ears and Eyes
Kurimo, M. (Vastuullinen tutkija), Virkkunen, A. (Projektin jäsen) & Grósz, T. (Projektin jäsen)
01/01/2022 → 31/12/2024
Projekti: Academy of Finland: Other research funding