TY - GEN
T1 - Frequency-warped time-weighted linear prediction for glottal vocoding
AU - Airaksinen, Manu
AU - Bollepalli, Bajibabu
AU - Pohjalainen, Jouni
AU - Alku, Paavo
PY - 2017/6/16
Y1 - 2017/6/16
N2 - Auto-regressive modeling is a prevalent source-filter separation method of speech. Conventional linear prediction (LP) and its derivatives such as weighted linear prediction (WeLP) produce parametric spectral models within a linear frequency scale, whereas frequency-warped linear prediction (WaLP) can be used to take into account the frequency sensitivity of the human auditory system. From the perspective of glottal vocoding, the principles behind WeLP have been found to be beneficial for an accurate separation of the glottal source signal and the vocal tract transfer function, but this approach can not utilize the auditory benefits of frequency warping. On the other hand, the WaLP approach suffers from less accurate source-filter separation properties. In this study, a generalized frequency-warped time-weighted linear prediction (WWLP) analysis is proposed. Experiments with WWLP are performed within the context of glottal vocoding. The subjective listening test results show that WWLP-based spectral envelope modeling is able to increase quality over previously developed methods in some of the test cases.
AB - Auto-regressive modeling is a prevalent source-filter separation method of speech. Conventional linear prediction (LP) and its derivatives such as weighted linear prediction (WeLP) produce parametric spectral models within a linear frequency scale, whereas frequency-warped linear prediction (WaLP) can be used to take into account the frequency sensitivity of the human auditory system. From the perspective of glottal vocoding, the principles behind WeLP have been found to be beneficial for an accurate separation of the glottal source signal and the vocal tract transfer function, but this approach can not utilize the auditory benefits of frequency warping. On the other hand, the WaLP approach suffers from less accurate source-filter separation properties. In this study, a generalized frequency-warped time-weighted linear prediction (WWLP) analysis is proposed. Experiments with WWLP are performed within the context of glottal vocoding. The subjective listening test results show that WWLP-based spectral envelope modeling is able to increase quality over previously developed methods in some of the test cases.
KW - glottal inverse filtering
KW - Linear prediction
KW - speech synthesis
KW - vocoder
UR - http://www.scopus.com/inward/record.url?scp=85023744845&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2017.7953234
DO - 10.1109/ICASSP.2017.7953234
M3 - Conference article in proceedings
AN - SCOPUS:85023744845
T3 - Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
SP - 5630
EP - 5634
BT - 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings
PB - IEEE
T2 - IEEE International Conference on Acoustics, Speech, and Signal Processing
Y2 - 5 March 2017 through 9 March 2017
ER -