Projekteja vuodessa
Abstrakti
Traditional topic identification solutions from audio rely on an automatic speech recognition system (ASR) to produce transcripts used as input to a text-based model. These approaches work well in high-resource scenarios, where there are sufficient data to train both components of the pipeline. However, in low-resource situations, the ASR system, even if available, produces low-quality transcripts, leading to a bad text-based classifier. Moreover, spontaneous speech containing hesitations can further degrade the performance of the ASR model. In this paper, we investigate alternatives to the standard text-only solutions by comparing audio-only and hybrid techniques of jointly utilising text and audio features. The models evaluated on spontaneous Finnish speech demonstrate that purely audio-based solutions are a viable option when ASR components are not available, while the hybrid multi-modal solutions achieve the best results.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | 2023 31st European Signal Processing Conference (EUSIPCO) |
Kustantaja | IEEE |
Sivut | 396-400 |
Sivumäärä | 5 |
ISBN (elektroninen) | 978-9-4645-9360-0 |
ISBN (painettu) | 979-8-3503-2811-0 |
DOI - pysyväislinkit | |
Tila | Julkaistu - 4 syysk. 2023 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | European Signal Processing Conference - Helsinki, Suomi Kesto: 4 syysk. 2023 → 8 syysk. 2023 Konferenssinumero: 31 https://eusipco2023.org/ |
Julkaisusarja
Nimi | European Signal Processing Conference |
---|---|
ISSN (elektroninen) | 2076-1465 |
Conference
Conference | European Signal Processing Conference |
---|---|
Lyhennettä | EUSIPCO |
Maa/Alue | Suomi |
Kaupunki | Helsinki |
Ajanjakso | 04/09/2023 → 08/09/2023 |
www-osoite |
Sormenjälki
Sukella tutkimusaiheisiin 'Topic Identification for Spontaneous Speech: Enriching Audio Features with Embedded Linguistic Information'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.Projektit
- 1 Päättynyt
-
USSEE: Understanding Speech and Scene with Ears and Eyes
Kurimo, M. (Vastuullinen tutkija), Virkkunen, A. (Projektin jäsen) & Grósz, T. (Projektin jäsen)
01/01/2022 → 31/12/2024
Projekti: Academy of Finland: Other research funding