Projekteja vuodessa
Abstrakti
Telephone speech is one of the degradations involved in building speech systems in practical environments. The potential use of the speech systems depends on the speech analysis algorithms that can handle different acoustic variations and degradations often found in the human speech communication. Detection of epochs/glottal closure instants (GCIs) is typically required in such analysis stages. In this paper, the effect of telephone channel speech on the accuracy of detection of epochs using state-of-art epoch extraction methods is investigated. Epoch is the instant of significant excitation to the vocal tract system in voiced speech. Most of the existing epoch extraction algorithms are shown to perform excellently well on the speech data collected under lab environment. The efficiency of these algorithms for the analysis of telephone quality speech is quantitatively studied and the strengths and weaknesses of the methods are discussed here. The methods are evaluated on six large databases containing speech and simultaneous EGG recordings as the ground truth. The state-of-art epoch extraction algorithms considered in this study for comparison are: ZFF, YAGA, DYPSA, SEDREAMS, SE-VQ and MMF. The performance of the algorithms is evaluated in terms of both reliability and accuracy measures.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019; Brighton; United Kingdom; 12-17 May 2019 : Proceedings |
Kustantaja | IEEE |
Sivut | 6500-6504 |
Sivumäärä | 5 |
ISBN (elektroninen) | 9781479981311 |
DOI - pysyväislinkit | |
Tila | Julkaistu - 1 toukok. 2019 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | IEEE International Conference on Acoustics, Speech, and Signal Processing - Brighton, Iso-Britannia Kesto: 12 toukok. 2019 → 17 toukok. 2019 Konferenssinumero: 44 |
Julkaisusarja
Nimi | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing |
---|---|
Vuosikerta | 2019-May |
ISSN (painettu) | 1520-6149 |
ISSN (elektroninen) | 2379-190X |
Conference
Conference | IEEE International Conference on Acoustics, Speech, and Signal Processing |
---|---|
Lyhennettä | ICASSP |
Maa/Alue | Iso-Britannia |
Kaupunki | Brighton |
Ajanjakso | 12/05/2019 → 17/05/2019 |
Sormenjälki
Sukella tutkimusaiheisiin 'A Quantitative Comparison of Epoch Extraction Algorithms for Telephone Speech'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.Projektit
- 1 Päättynyt
-
Poikkitieteellinen parametrisen puhesynteesin tutkimusprojekti
Alku, P. (Vastuullinen tutkija), Bäckström, T. (Projektin jäsen), Juvela, L. (Projektin jäsen), Murtola, T. (Projektin jäsen), Nonavinakere Prabhakera, N. (Projektin jäsen), Bollepalli, B. (Projektin jäsen) & Airaksinen, M. (Projektin jäsen)
01/01/2018 → 31/12/2019
Projekti: Academy of Finland: Other research funding