TY - GEN
T1 - Using dependency grammar features in whole sentence maximum entropy language model for speech recognition
AU - Ruokolainen, Teemu
AU - Alumäe, Tanel
AU - Dobrinkat, Marcus
PY - 2010
Y1 - 2010
N2 - In automatic speech recognition, the standard choice for a language model is the well-known n-gram model. The n-grams are used to predict the probability of a word given its n-1 preceding words. However, the n-gram model is not able to explicitly learn grammatical relations of the sentence. In the present work, in order to augment the n-gram model with grammatical features, we apply the Whole Sentence Maximum Entropy framework. The grammatical features are head-modifier relations between pairs of words, together with the labels of the relationships, obtained with the dependency grammar. We evaluate the model in a large vocabulary speech recognition task with Wall Street Journal speech corpus. The results show a substantial improvement in both test set perplexity and word error rate.
AB - In automatic speech recognition, the standard choice for a language model is the well-known n-gram model. The n-grams are used to predict the probability of a word given its n-1 preceding words. However, the n-gram model is not able to explicitly learn grammatical relations of the sentence. In the present work, in order to augment the n-gram model with grammatical features, we apply the Whole Sentence Maximum Entropy framework. The grammatical features are head-modifier relations between pairs of words, together with the labels of the relationships, obtained with the dependency grammar. We evaluate the model in a large vocabulary speech recognition task with Wall Street Journal speech corpus. The results show a substantial improvement in both test set perplexity and word error rate.
KW - dependency grammar
KW - language modeling
KW - speech recognition
KW - whole sentence maximum entropy model
UR - http://www.scopus.com/inward/record.url?scp=78049256444&partnerID=8YFLogxK
U2 - 10.3233/978-1-60750-641-6-73
DO - 10.3233/978-1-60750-641-6-73
M3 - Conference article in proceedings
AN - SCOPUS:78049256444
SN - 9781607506409
VL - 219
T3 - Frontiers in Artificial Intelligence and Applications
SP - 73
EP - 79
BT - Human Language Technologies - The Baltic Perspective
PB - IOS Press
ER -