Spectral Envelope Statistics for Source Modeling in Speech Enhancement

S. Das, A. Craciun, T. Jaehnel, T. Baeckstroem

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference contributionScientificvertaisarvioitu

Abstrakti

Source modeling is an efficient tool in speech and audio coding, yet in enhancement applications it has been less extensively employed. Incorporating speech source models from coding to enhancement has been difficult because the models are based on linear prediction, which is non-linear in the frequency domain. In this paper we propose a speech source model based on distribution quantizer, which quantifies the coarse shape of the spectral envelope. The spectral envelope is thus described by a set of parameters whose probability distributions have a simple form. The source parameters are estimated using these probability distributions from a single-channel noisy observation by maximum likelihood. Our experiments show that the proposed method is able to track the signal-to-noise ratio with good accuracy. In addition, although trained only on English items, our method showed relatively good results for German items as well, which demonstrates the robustness of the estimated source models.
AlkuperäiskieliEnglanti
OtsikkoSpeech Communication; 12. ITG Symposium
KustantajaVDE Verlag
Sivut1-5
Sivumäärä5
ISBN (painettu)978-3-8007-4275-2
TilaJulkaistu - 2016
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaITG Symposium on Speech Communication: Speech Communication - Paderborn, Saksa
Kesto: 5 lokakuuta 20167 lokakuuta 2016
Konferenssinumero: 12

Conference

ConferenceITG Symposium on Speech Communication
MaaSaksa
KaupunkiPaderborn
Ajanjakso05/10/201607/10/2016

Sormenjälki Sukella tutkimusaiheisiin 'Spectral Envelope Statistics for Source Modeling in Speech Enhancement'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

  • Siteeraa tätä

    Das, S., Craciun, A., Jaehnel, T., & Baeckstroem, T. (2016). Spectral Envelope Statistics for Source Modeling in Speech Enhancement. teoksessa Speech Communication; 12. ITG Symposium (Sivut 1-5). VDE Verlag.