Modeling Formant Dynamics in Speech Spectral Envelopes

Alexandra Craciun, Jouni Paulus, Gökhan Sevkin, Tom Bäckström

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

2 Citations (Scopus)

Abstract

The spectral envelope of a speech signal encodes information about the characteristics of the speech source. As a result, spectral envelope modeling is a central task in speech applications, where tracking temporal transitions in diphones and triphones is essential for efficient speech synthesis and recognition
algorithms. Temporal changes in the envelope structure are often derived from estimated formant tracks, an approach which is sensitive to estimation errors. In this paper we propose a speech source model which estimates frequency and amplitude movements in the spectral envelopes of speech signals and does not rely on formant tracking. The proposed model estimates the amplitude and frequency shifts for each sub-band and time frame of a speech signal using the information from the previous time frame. Our experiments demonstrate that the model captures temporal structures of spectral envelopes with high precision. The proposed model can thus be applied as an accurate low-order representation of temporal dynamics in speech spectral envelopes.
Original languageEnglish
Title of host publicationProceedings of the European Signal Processing Conference (EUSIPCO)
PublisherIEEE
Pages1669-1673
Number of pages5
ISBN (Electronic)978-0-9928626-7-1
DOIs
Publication statusPublished - Aug 2017
MoE publication typeA4 Article in a conference publication
EventEuropean Signal Processing Conference - Kos Island, Greece, Kos, Greece
Duration: 28 Aug 20172 Sep 2017
Conference number: 25
https://www.eusipco2017.org
https://www.eusipco2017.org/

Publication series

NameEuropean Signal Processing Conference
ISSN (Print)2219-5491
ISSN (Electronic)2076-1465

Conference

ConferenceEuropean Signal Processing Conference
Abbreviated titleEUSIPCO
CountryGreece
CityKos
Period28/08/201702/09/2017
Internet address

Fingerprint Dive into the research topics of 'Modeling Formant Dynamics in Speech Spectral Envelopes'. Together they form a unique fingerprint.

Cite this