GMM-Based Iterative Entropy Coding for Spectral Envelopes of Speech and Audio

Srikanth Korse, Guillaume Fuchs, Tom Bäckström

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    4 Citations (Scopus)
    268 Downloads (Pure)

    Abstract

    Spectral envelope modelling is a central part of speech and audio codecs and is traditionally based on either vector quantization or scalar quantization followed by entropy coding. To bridge the coding performance of vector quantization with the low complexity of the scalar case, we propose an iterative approach for entropy coding the spectral envelope parameters. For each parameter, a univariate probability distribution is derived from a Gaussian mixture model of the joint distribution and the previously quantized parameters used as a-priori information. Parameters are then iteratively and individually scalar quantized and entropy coded. Unlike vector quantization, the complexity of proposed method does not increase exponentially with dimension and bitrate. Moreover, the coding resolution and dimension can be adaptively modified without retraining the model. Experimental results show that these important advantages do not impair coding efficiency compared to a state-of-art vector quantization scheme.
    Original languageEnglish
    Title of host publicationProceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
    PublisherIEEE
    Pages5689-5693
    ISBN (Electronic)978-1-5386-4658-8
    DOIs
    Publication statusPublished - 2018
    MoE publication typeA4 Conference publication
    EventIEEE International Conference on Acoustics, Speech, and Signal Processing - Calgary, Canada
    Duration: 15 Apr 201820 Apr 2018
    https://2018.ieeeicassp.org/

    Publication series

    NameProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
    ISSN (Electronic)2379-190X

    Conference

    ConferenceIEEE International Conference on Acoustics, Speech, and Signal Processing
    Abbreviated titleICASSP
    Country/TerritoryCanada
    CityCalgary
    Period15/04/201820/04/2018
    Internet address

    Keywords

    • Entropy Coding
    • Gaussian mixture models
    • Envelope Modelling
    • Speech Coding
    • Audio Coding

    Fingerprint

    Dive into the research topics of 'GMM-Based Iterative Entropy Coding for Spectral Envelopes of Speech and Audio'. Together they form a unique fingerprint.

    Cite this