Projects per year
Abstract
Speech coding is the most commonly used application of speech processing. Accumulated layers of improvements have however made codecs so complex that optimization of individual modules becomes increasingly difficult. This work introduces machine learning methodology to speech and audio coding, such that we can optimize quality in terms of overall entropy. We can then use conventional quantization, coding and perceptual models without modification such that the codec adheres to conventional requirements on algorithmic complexity, latency and robustness to packet loss. Experiments demonstrate that end-to-end optimization of quantization accuracy of the spectral envelope can be used for a lossless reduction in bitrate of 0.4 kbits/s.
Original language | English |
---|---|
Title of host publication | Proceedings of Interspeech |
Publisher | International Speech Communication Association (ISCA) |
Pages | 3401-3405 |
DOIs | |
Publication status | Published - Sept 2019 |
MoE publication type | A4 Conference publication |
Event | Interspeech - Graz, Austria Duration: 15 Sept 2019 → 19 Sept 2019 https://www.interspeech2019.org/ |
Publication series
Name | Interspeech - Annual Conference of the International Speech Communication Association |
---|---|
ISSN (Electronic) | 2308-457X |
Conference
Conference | Interspeech |
---|---|
Country/Territory | Austria |
City | Graz |
Period | 15/09/2019 → 19/09/2019 |
Internet address |
Keywords
- speech and audio coding
- end-to-end optimization
- speech source modeling
Fingerprint
Dive into the research topics of 'End-to-End Optimization of Source Models for Speech and Audio Coding Using a Machine Learning Framework'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Interdisciplinary research on statistical parametric speech synthesis
Alku, P. (Principal investigator), Bäckström, T. (Project Member), Juvela, L. (Project Member), Murtola, T. (Project Member), Nonavinakere Prabhakera, N. (Project Member), Bollepalli, B. (Project Member) & Airaksinen, M. (Project Member)
01/01/2018 → 31/12/2019
Project: Academy of Finland: Other research funding