Estimation of the Probability Distribution of Spectral Fine Structure in the Speech Source

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Researchers

Research units

Abstract

The efficiency of many speech processing methods rely on accurate modeling of the distribution of the signal spectrum and a majority of prior works suggest that the spectral components follow the Laplace distribution. To improve the probability distribution models based on our knowledge of speech source modeling, we argue that the model should in fact be a multiplicative mixture model, including terms for voiced and unvoiced utterances. While prior works have applied Gaussian mixture models, we demonstrate that a mixture of generalized Gaussian models more accurately follows the observations. The proposed estimation method is based on measuring the ratio of $L_p$-norms between spectral bands. Such ratios follow the Beta-distribution when the input signal is generalized Gaussian, whereby the estimated parameters can be used to determine the underlying parameters of the mixture of generalized Gaussian distributions.

Details

Original languageEnglish
Title of host publicationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publication statusPublished - Aug 2017
MoE publication typeA4 Article in a conference publication
EventInterspeech - Stockholm, Sweden
Duration: 20 Aug 201724 Aug 2017
Conference number: 18
http://www.interspeech2017.org/

Publication series

NameInterspeech: Annual Conference of the International Speech Communication Association
ISSN (Electronic)1990-9772

Conference

ConferenceInterspeech
CountrySweden
CityStockholm
Period20/08/201724/08/2017
Internet address

    Research areas

  • probability distribution mixture models, speech production modeling

Download statistics

No data available

ID: 15091685