Mel-frequency cepstral coefficients of voice source waveforms for classification of phonation types in speech

Sudarsana Reddy Kadiri, Paavo Alku

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

12 Sitaatiot (Scopus)
400 Lataukset (Pure)

Abstrakti

Voice source characteristics in different phonation types vary due to the tension of laryngeal muscles along with the respiratory effort. This study investigates the use of mel-frequency cepstral coefficients (MFCCs) derived from voice source waveforms for classification of phonation types in speech. The cepstral coefficients are computed using two source waveforms: (1) glottal flow waveforms estimated by the quasi-closed phase (QCP) glottal inverse filtering method and (2) approximate voice source waveforms obtained using the zero frequency filtering (ZFF) method. QCP estimates voice source waveforms based on the source-filter decomposition while ZFF yields source waveforms without explicitly computing the source-filter decomposition. Experiments using MFCCs computed from the two source waveforms show improved accuracy in classification of phonation types compared to the existing voice source features and conventional MFCC features. Further, it is observed that the proposed features have complimentary information to the existing features.

AlkuperäiskieliEnglanti
OtsikkoProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
KustantajaInternational Speech Communication Association (ISCA)
Sivut2508-2512
Sivumäärä5
Vuosikerta2019-September
DOI - pysyväislinkit
TilaJulkaistu - 1 tammik. 2019
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInterspeech - Graz, Itävalta
Kesto: 15 syysk. 201919 syysk. 2019
https://www.interspeech2019.org/

Julkaisusarja

NimiInterspeech - Annual Conference of the International Speech Communication Association, INTERSPEECH
ISSN (elektroninen)2308-457X

Conference

ConferenceInterspeech
Maa/AlueItävalta
KaupunkiGraz
Ajanjakso15/09/201919/09/2019
www-osoite

Sormenjälki

Sukella tutkimusaiheisiin 'Mel-frequency cepstral coefficients of voice source waveforms for classification of phonation types in speech'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä