Abstrakti
This study presents an automatic glottal inverse filtering (GIF) technique based on separating the effect of the glottal main excitation from the impulse response of the vocal tract. The proposed method is based on a non-negative matrix factorization (NMF) based decomposition of an ultra short-term spectrogram of the analyzed signal. Unlike other state-of-theart GIF techniques, the proposed method does not require estimation of glottal closure instants. The proposed method was objectively evaluated with two test sets of continuous synthetic speech created with a glottal vocoding analysis/synthesis procedure. When compared to a set of reference GIF methods, the proposed NMF technique shows improved estimation accuracy especially for male voices.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | Proceedings of the Annual Conference of the International Speech Communication Association |
Alaotsikko | Interspeech'16, San Francisco, USA, Sept. 8-12, 2016 |
Kustantaja | International Speech Communication Association (ISCA) |
Sivut | 1039-1043 |
Sivumäärä | 5 |
Vuosikerta | 08-12-September-2016 |
ISBN (elektroninen) | 978-1-5108-3313-5 |
DOI - pysyväislinkit | |
Tila | Julkaistu - 2016 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | Interspeech - San Francisco, Yhdysvallat Kesto: 8 syysk. 2016 → 12 syysk. 2016 Konferenssinumero: 17 |
Julkaisusarja
Nimi | Proceedings of the Annual Conference of the International Speech Communication Association |
---|---|
Kustantaja | International Speech Communication Association |
ISSN (painettu) | 1990-9770 |
ISSN (elektroninen) | 2308-457X |
Conference
Conference | Interspeech |
---|---|
Maa/Alue | Yhdysvallat |
Kaupunki | San Francisco |
Ajanjakso | 08/09/2016 → 12/09/2016 |