Abstract
This study proposes an approach for glottal inverse filtering of acoustic speech signals using quadratic programming (QPR). The method aims to jointly model the effect of vocal tract and lip radiation with a single filter whose coefficients are optimized using QPR. This optimization is based on the principles of closed phase analysis, where the contribution of the glottal source is attenuated in optimizing the inverse model of the vocal tract. By expressing the optimization problem in terms of the output of a filter, we can apply physically motivated optimization such as flatness of the closed phase. The proposed method was objectively evaluated using a synthetic Liljencrants-Fant model based test set of sustained vowels, as well as a real speech test set where the glottal flow estimates' closed phases were compared in terms of their flatness. The results based on synthetic speech indicate that the proposed method is robust to changes in f0, and state-of-the-art quality results were obtained for high-pitched voices, when f0 is in the range 330-450 Hz. The results based on real speech indicate that the proposed method produces glottal flow estimates that have flatter closed phases with less formant ripple in comparison to estimates computed with known reference methods.
Original language | English |
---|---|
Article number | 7636994 |
Pages (from-to) | 929-939 |
Number of pages | 11 |
Journal | IEEE/ACM Transactions on Audio, Speech, and Language Processing |
Volume | 25 |
Issue number | 5 |
DOIs | |
Publication status | Published - 1 May 2017 |
MoE publication type | A1 Journal article-refereed |
Keywords
- Closed phase analysis
- glottal inverse filtering
- quadratic programming (QPR)