Projekteja vuodessa
Abstrakti
End-to-End speech recognition has become the center of attention for speech recognition research, but Hybrid Hidden Markov Model Deep Neural Network (HMM/DNN) -systems remain a competitive approach in terms of performance. End-to-End models may be better at very large data scales, and HMM / DNN-systems may have an advantage in low-resource scenarios, but the thousand-hour scale is particularly interesting for comparisons. At that scale experiments have not been able to conclusively demonstrate which approach is best, or if the heterogeneous approaches yield similar results. In this work, we work towards answering that question for Attention-based Encoder-Decoder models compared with HMM / DNN-systems. We present two simple experimental design principles, and how to build systems adhering to those principles. We demonstrate how those principles remove confounding variables related to both data, and neural architecture and training. We apply the principles in a set of experiments on three diverse thousand-hour-scale tasks. In our experiments, the HMM / DNN-systems yield equal or better results in almost all cases.
Alkuperäiskieli | Englanti |
---|---|
Sivut | 623-638 |
Sivumäärä | 16 |
Julkaisu | IEEE/ACM Transactions on Audio, Speech, and Language Processing |
Vuosikerta | 32 |
Varhainen verkossa julkaisun päivämäärä | 24 marrask. 2023 |
DOI - pysyväislinkit | |
Tila | Julkaistu - 2024 |
OKM-julkaisutyyppi | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä |
Sormenjälki
Sukella tutkimusaiheisiin 'Principled Comparisons for End-to-End Speech Recognition: Attention vs Hybrid at the 1000-hour Scale'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.Projektit
- 2 Aktiivinen
-
LAREINA: LAREINA - Language Resource Infrastructure for AI
Kurimo, M., Moisio, A., Getman, Y., Porjazovski, D., Rouhe, A. & Virkkunen, A.
01/01/2023 → 31/12/2025
Projekti: Business Finland: Strategic centres for science, technology and innovation (SHOK)
-
USSEE: Understanding Speech and Scene with Ears and Eyes
Kurimo, M., Virkkunen, A. & Grósz, T.
01/01/2022 → 31/12/2024
Projekti: Academy of Finland: Other research funding