Time-varying quasi-closed-phase weighted linear prediction analysis of speech for accurate formant detection and tracking

Dhananjaya Gowda, Paavo Alku

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

1 Sitaatiot (Scopus)

Abstrakti

In this paper, we propose a new method for accurate detection, estimation and tracking of formants in speech signals using time-varying quasi-closed phase analysis (TVQCP). The proposed method combines two different methods of analysis namely, the time-varying linear prediction (TVLP) and quasiclosed phase (QCP) analysis. TVLP helps in better tracking of formant frequencies by imposing a time-continuity constraint on the linear prediction (LP) coefficients. QCP analysis, a type of weighted LP (WLP), improves the estimation accuracies of the formant frequencies by using a carefully designed weight function on the error signal that is minimized. The QCP weight function emphasizes the closed-phase region of the glottal cycle, and also weights down the regions around the main excitations. This results in reduced coupling of the subglottal cavity and the excitation source. Experimental results on natural speech signals show that the proposed method performs considerably better than the detect-and-track approach used in popular tools like Wavesurfer or Praat.

AlkuperäiskieliEnglanti
OtsikkoProceedings of the Annual Conference of the International Speech Communication Association
AlaotsikkoInterspeech'16, San Francisco, USA, Sept. 8-12, 2016
KustantajaInternational Speech Communication Association (ISCA)
Sivut1760-1764
Sivumäärä5
Vuosikerta08-12-September-2016
ISBN (elektroninen)978-1-5108-3313-5
DOI - pysyväislinkit
TilaJulkaistu - 2016
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInterspeech - San Francisco, Yhdysvallat
Kesto: 8 syysk. 201612 syysk. 2016
Konferenssinumero: 17

Julkaisusarja

NimiProceedings of the Annual Conference of the International Speech Communication Association
KustantajaInternational Speech Communication Association
ISSN (painettu)1990-9770
ISSN (elektroninen)2308-457X

Conference

ConferenceInterspeech
Maa/AlueYhdysvallat
KaupunkiSan Francisco
Ajanjakso08/09/201612/09/2016

Sormenjälki

Sukella tutkimusaiheisiin 'Time-varying quasi-closed-phase weighted linear prediction analysis of speech for accurate formant detection and tracking'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä