Entropy coding of spectral envelopes for speech and audio coding using distribution quantization

Srikanth Korse, Tobias Jähnel, Tom Bäckström

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

6 Citations (Scopus)

Abstract

Speech and audio codecs model the overall shape of the signal spectrum using envelope models. In speech coding the predominant approach is linear predictive coding, which offers high coding efficiency at the cost of computational complexity and a rigid systems design. Audio codecs are usually based on scale factor bands, whose calculation and coding is simple, but whose coding efficiency is lower than that of linear prediction. In the current work we propose an entropy coding approach for scale factor bands, with the objective of reaching the same coding efficiency as linear prediction, but simultaneously retaining a low computational complexity. The proposed method is based on quantizing the distribution of spectral mass using betadistributions. Our experiments show that the perceptual quality achieved with the proposed method is similar to that of linear predictive models with the same bit rate, while the design simultaneously allows variable bit-rate coding and can easily be scaled to different sampling rates. The algorithmic complexity of the proposed method is less than one third of traditional multi-stage vector quantization of linear predictive envelopes.

Original languageEnglish
Title of host publicationProceedings of the Annual Conference of the International Speech Communication Association
PublisherInternational Speech Communication Association
Pages2543-2547
Number of pages5
Volume08-12-September-2016
DOIs
Publication statusPublished - 1 Jan 2016
MoE publication typeA4 Article in a conference publication
EventInterspeech - San Francisco, United States
Duration: 8 Sep 201612 Sep 2016
Conference number: 17

Publication series

NameProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
PublisherInternational Speech Communication Association
ISSN (Print)2308-457X

Conference

ConferenceInterspeech
Country/TerritoryUnited States
CitySan Francisco
Period08/09/201612/09/2016

Keywords

  • Arithmetic coding
  • Spectral envelope
  • Speech and audio coding

Fingerprint

Dive into the research topics of 'Entropy coding of spectral envelopes for speech and audio coding using distribution quantization'. Together they form a unique fingerprint.

Cite this