Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies

Tutkimustuotos: Lehtiartikkelivertaisarvioitu

Tutkijat

Organisaatiot

Kuvaus

Today, the vocabulary size for language models in large vocabulary speech recognition is typically several hundreds of thousands of words. While this is already sufficient in some applications, the out-of-vocabulary words are still limiting the usability in others. In agglutinative languages the vocabulary for conversational speech should include millions of word forms to cover the spelling variations due to colloquial pronunciations, in addition to the word compounding and inflections. Very large vocabularies are also needed, for example, when the recognition of rare proper names is important.

Yksityiskohdat

AlkuperäiskieliEnglanti
Sivut2085-2097
Sivumäärä13
JulkaisuIEEE/ACM Transactions on Audio, Speech, and Language Processing
Vuosikerta25
Numero11
Varhainen verkossa julkaisun päivämäärä23 elokuuta 2017
TilaJulkaistu - marraskuuta 2017
OKM-julkaisutyyppiA1 Julkaistu artikkeli, soviteltu

Lataa tilasto

Ei tietoja saatavilla

ID: 14570386