Fundamental Frequency Model for Postfiltering at Low Bitrates in a Transform-Domain Speech and Audio Codec

Sneha Das, Tom Bäckström, Guillaume Fuchs

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

2 Sitaatiot (Scopus)
152 Lataukset (Pure)

Abstrakti

Speech codecs can use postfilters to improve the quality of the decoded signal. While postfiltering is effective in reducing coding artifacts, such methods often involve processing in both the encoder and the decoder, rely on additional transmitted side information, or are highly dependent on other codec functions for optimal performance. We propose a low-complexity postfiltering method to improve the harmonic structure of the decoded signal, which models the fundamental frequency of the signal. In contrast to past approaches, the postfilter operates at the decoder as a standalone function and does not need the transmission of additional side information. It can thus be used to enhance the output of any codec. We tested the approach on a modified version of the EVS codec in TCX mode only, which is subject to more pronounced coding artefacts when used at its lowest bitrate. Listening test results show an average improvement of 7 MUSHRA points for decoded signals with the proposed harmonic postfilter.
AlkuperäiskieliEnglanti
OtsikkoProceedings of Interspeech
KustantajaInternational Speech Communication Association (ISCA)
Sivut2837-2841
Sivumäärä5
Vuosikerta2020-October
DOI - pysyväislinkit
TilaJulkaistu - 2020
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInterspeech - Shanghai, Kiina
Kesto: 25 lokak. 202029 lokak. 2020
Konferenssinumero: 21
http://www.interspeech2020.org/

Julkaisusarja

NimiInterspeech
ISSN (elektroninen)1990-9772

Conference

ConferenceInterspeech
LyhennettäINTERSPEECH
Maa/AlueKiina
KaupunkiShanghai
Ajanjakso25/10/202029/10/2020
www-osoite

Sormenjälki

Sukella tutkimusaiheisiin 'Fundamental Frequency Model for Postfiltering at Low Bitrates in a Transform-Domain Speech and Audio Codec'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä