A system for dynamic 3D visualisation of speech recognition paths

Satumino Luz*, Masood Masoodian, Bill Rogers, Bo Zhang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

2 Citations (Scopus)

Abstract

This paper presents an interactive visualisation system that assists users of semi-automatic speech transcription systems to assess alternative recognition results in real time and provide feedback to the speech recognition back-end in an intuitive manner. This prototype uses the OpenGL libraries to implement an animated 3D visual representation of alternative recognition results generated by the Sphinx automatic speech recognition system. It is expected that displaying alternatives dynamically will facilitate early detection of recognition errors and encourage user interaction, which in turn can be used to improve future recognition performance.

Original languageEnglish
Title of host publicationAVI '08: Proceedings of the working conference on Advanced visual interfaces
Pages482-483
Number of pages2
ISBN (Electronic)978-1-60558-141-5
DOIs
Publication statusPublished - 2008
MoE publication typeA3 Part of a book or another research book
EventInternational Working Conference on Advanced Visual Interfaces - Naples, Italy
Duration: 28 May 200830 May 2008

Conference

ConferenceInternational Working Conference on Advanced Visual Interfaces
Abbreviated titleAVI
Country/TerritoryItaly
CityNaples
Period28/05/200830/05/2008

Keywords

  • Animated interfaces
  • Automatic speech transcription
  • Error correction
  • Interactive visualisation

Fingerprint

Dive into the research topics of 'A system for dynamic 3D visualisation of speech recognition paths'. Together they form a unique fingerprint.

Cite this