Breathy to tense voice discrimination using zero-time windowing cepstral coefficients (ZTWCCs)

Sudarsana Reddy Kadiri, B. Yegnanarayana

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

24 Sitaatiot (Scopus)

Abstrakti

In this paper, we consider breathy to tense voices, which are often considered to be opposite ends of a voice quality continuum. Along with these, other aspects of a speaker's voice play an important role to convey the information to the listener such as mood, attitude and emotional state. The glottal pulse characteristics in different phonation types vary due to the tension of laryngeal muscles together with the respiratory effort. In the present study, we are deriving the features that can capture effects of excitation on the vocal tract system through a signal processing method, called as zero-time windowing (ZTW) method. The ZTW method gives the instantaneous spectrum which captures the changes in the speech production mechanism, providing higher spectral resolution. The cepstral coefficients derived from ZTW method are used for the classification of phonation types. Along with zero-time windowing cepstral coefficients (ZTWCCs), we use the excitation source features derived from zero frequency filtering (ZFF) method. The excitation features used are: strength of excitation, energy of excitation, loudness measure and ZFF signal energy. Classification experiments using ZTWCC and excitation features reveal a significant improvement in the detection of phonation type compared to the existing voice quality features and MFCC features.

AlkuperäiskieliEnglanti
OtsikkoInterspeech
KustantajaInternational Speech Communication Association (ISCA)
Sivut232-236
Sivumäärä5
Vuosikerta2018-September
DOI - pysyväislinkit
TilaJulkaistu - 2018
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInterspeech - Hyderabad International Convention Centre, Hyderabad, Intia
Kesto: 2 syysk. 20186 syysk. 2018
http://interspeech2018.org/

Julkaisusarja

NimiInterspeech
KustantajaInternational Speech Communication Association
ISSN (painettu)2308-457X

Conference

ConferenceInterspeech
Maa/AlueIntia
KaupunkiHyderabad
Ajanjakso02/09/201806/09/2018
www-osoite

Sormenjälki

Sukella tutkimusaiheisiin 'Breathy to tense voice discrimination using zero-time windowing cepstral coefficients (ZTWCCs)'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä