Projects per year
Abstract
Standard end-to-end training of attention-based ASR models only uses transcribed speech. If they are compared to HMM/DNN systems, which additionally leverage a large corpus of text-only data and expert-crafted lexica, the differences in modeling cannot be disentangled from differences in data. We propose an experimental setup, where only transcribed speech is used to train both model types. To highlight the difference that text-only data can make, we use Finnish, where an expert-crafted lexicon is not needed. With 1500h equal data, we find that both ASR paradigms perform similarly, but adding text data quickly improves the HMM/DNN system. On a smaller 160h subset we find that HMM/DNN models outperform AED models.
Original language | English |
---|---|
Title of host publication | Speech and Computer - 23rd International Conference, SPECOM 2021, Proceedings |
Editors | Alexey Karpov, Rodmonga Potapova |
Publisher | Springer |
Pages | 602-613 |
Number of pages | 12 |
ISBN (Print) | 9783030878016 |
DOIs | |
Publication status | Published - 2021 |
MoE publication type | A4 Conference publication |
Event | International Conference on Speech and Computer - Virtual, Online Duration: 27 Sept 2021 → 30 Sept 2021 Conference number: 23 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 12997 LNAI |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | International Conference on Speech and Computer |
---|---|
Abbreviated title | SPECOM |
City | Virtual, Online |
Period | 27/09/2021 → 30/09/2021 |
Keywords
- Attention-based Encoder-Decoder
- Equal data
- HMM/DNN
Fingerprint
Dive into the research topics of 'An Equal Data Setting for Attention-Based Encoder-Decoder and HMM/DNN Models: A Case Study in Finnish ASR'. Together they form a unique fingerprint.Projects
- 1 Finished
-
MeMAD: Methods for Managing Audiovisual Data: Combining Automatic Efficiency with Human Accuracy
Kurimo, M. (Principal investigator), Grönroos, S.-A. (Project Member), Brander, T. (Project Member), Porjazovski, D. (Project Member), Raitio, R. (Project Member), Grósz, T. (Project Member), Virkkunen, A. (Project Member) & Rouhe, A. (Project Member)
27/12/2017 → 31/03/2021
Project: EU: Framework programmes funding