Transparent pronunciation scoring using articulatorily weighted phoneme edit distance

Reima Karhila, Anna Riikka Smolander, Sari Ylinen, Mikko Kurimo

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

7 Citations (Scopus)
153 Downloads (Pure)


For researching effects of gamification in foreign language learning for children in the “Say It Again, Kid!” project we developed a feedback paradigm that can drive gameplay in pronunciation learning games. We describe our scoring system based on the difference between a reference phone sequence and the output of a multilingual CTC phoneme recogniser. We present a white-box scoring model of mapped weighted Levenshtein edit distance between reference and error with error weights for articulatory differences computed from a training set of scored utterances. The system can produce a human-readable list of each detected mispronunciation's contribution to the utterance score. We compare our scoring method to established black box methods.

Original languageEnglish
Title of host publicationProceedings of Interspeech
PublisherInternational Speech Communication Association
Number of pages5
Publication statusPublished - 1 Jan 2019
MoE publication typeA4 Article in a conference publication
EventInterspeech - Graz, Austria
Duration: 15 Sep 201919 Sep 2019

Publication series

NameInterspeech - Annual Conference of the International Speech Communication Association
ISSN (Electronic)2308-457X


Internet address


  • Computer Assisted Pronunciation Training
  • Mispronunciation Detection
  • Multilingual Phoneme Recognition


Dive into the research topics of 'Transparent pronunciation scoring using articulatorily weighted phoneme edit distance'. Together they form a unique fingerprint.

Cite this