Abstract
This study presents an automatic glottal inverse filtering (GIF) technique based on separating the effect of the glottal main excitation from the impulse response of the vocal tract. The proposed method is based on a non-negative matrix factorization (NMF) based decomposition of an ultra short-term spectrogram of the analyzed signal. Unlike other state-of-theart GIF techniques, the proposed method does not require estimation of glottal closure instants. The proposed method was objectively evaluated with two test sets of continuous synthetic speech created with a glottal vocoding analysis/synthesis procedure. When compared to a set of reference GIF methods, the proposed NMF technique shows improved estimation accuracy especially for male voices.
Original language | English |
---|---|
Title of host publication | Proceedings of the Annual Conference of the International Speech Communication Association |
Subtitle of host publication | Interspeech'16, San Francisco, USA, Sept. 8-12, 2016 |
Publisher | International Speech Communication Association |
Pages | 1039-1043 |
Number of pages | 5 |
Volume | 08-12-September-2016 |
ISBN (Electronic) | 978-1-5108-3313-5 |
DOIs | |
Publication status | Published - 2016 |
MoE publication type | A4 Article in a conference publication |
Event | Interspeech - San Francisco, United States Duration: 8 Sept 2016 → 12 Sept 2016 Conference number: 17 |
Publication series
Name | Proceedings of the Annual Conference of the International Speech Communication Association |
---|---|
Publisher | International Speech Communication Association |
ISSN (Print) | 1990-9770 |
ISSN (Electronic) | 2308-457X |
Conference
Conference | Interspeech |
---|---|
Country/Territory | United States |
City | San Francisco |
Period | 08/09/2016 → 12/09/2016 |
Keywords
- Glottal inverse filtering
- Nonnegative matrix factorization
- Speech analysis