Projekteja vuodessa
Abstrakti
Traditional spoken emotion recognition solutions often process entire utterances all at once, ignoring the emotional variability within the speech. This shortcoming, especially plaguing end-to-end models, prompted us to investigate a segment-based technique processing only short parts of the audio, improving the recognition accuracy across three diverse emotion datasets. Furthermore, we employed a triplet loss to increase inter-class separability, demonstrating that combining it effectively with segment-based processing within our multi-task learning framework leads to improvements on both English and Finnish datasets. Our proposed method achieves 8.1% unweighted average recall improvement over the baseline on the IEMOCAP, 12% on the RAVDESS, and 7.2% on the FESC dataset. The results also indicate that vocalised emotions are strongly concentrated in short segments of speech, and new methods are needed to exploit this fact.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024) |
Kustantaja | Association for Computational Linguistics |
Sivut | 47-54 |
Sivumäärä | 8 |
ISBN (elektroninen) | 979-8-89176-165-0 |
Tila | Julkaistu - 22 lokak. 2024 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | International Conference on Natural Language and Speech Processing - Trento, Italia Kesto: 19 lokak. 2024 → 20 lokak. 2024 https://www.icnlsp.org/2024welcome/ |
Conference
Conference | International Conference on Natural Language and Speech Processing |
---|---|
Lyhennettä | ICNLSP |
Maa/Alue | Italia |
Kaupunki | Trento |
Ajanjakso | 19/10/2024 → 20/10/2024 |
www-osoite |
Sormenjälki
Sukella tutkimusaiheisiin 'Improved Spoken Emotion Recognition With Combined Segment-Based Processing And Triplet Loss'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.Projektit
- 1 Aktiivinen
-
LAREINA: LAREINA - Language Resource Infrastructure for AI
Kurimo, M. (Vastuullinen tutkija), Moisio, A. (Projektin jäsen), Getman, Y. (Projektin jäsen), Porjazovski, D. (Projektin jäsen), Rouhe, A. (Projektin jäsen) & Virkkunen, A. (Projektin jäsen)
01/01/2023 → 31/12/2025
Projekti: Business Finland: Strategic centres for science, technology and innovation (SHOK)