Spectral modification for recognition of children’s speech under mismatched conditions

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference contributionScientificvertaisarvioitu

7 Lataukset (Pure)

Abstrakti

n this paper, we propose spectral modification by sharpening formants and by reducing the spectral tilt to recognize children’s speech by automatic speech recognition (ASR) systems developed using adult speech. In this type of mismatched condition, the ASR performance is degraded due to the acoustic and linguistic mismatch in the attributes between children and adult speakers. The proposed method is used to improve the speech intelligibility to enhance the children’s speech recognition using an acoustic model trained on adult speech. In the experiments, WSJCAM0 and PFSTAR are used as databases for adults’ and children’s speech, respectively. The proposed technique gives a significant improvement in the context of the DNN-HMM-based ASR. Furthermore, we validate the robustness of the technique by showing that it performs well also in mismatched noise conditions.
AlkuperäiskieliEnglanti
OtsikkoProceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa)
JulkaisupaikkaSweden
KustantajaLINKÖPING UNIVERSITY ELECTRONIC PRESS
Sivut94–100
Sivumäärä7
ISBN (elektroninen)978-91-7929-614-8
TilaJulkaistu - 31 toukokuuta 2021
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaNordic Conference on Computational Linguistics - Reykjavik, Islanti
Kesto: 31 toukokuuta 20212 kesäkuuta 2021

Julkaisusarja

NimiLinköping electronic conference proceedings
Numero178
ISSN (painettu)1650-3686
ISSN (elektroninen)1650-3740

Conference

ConferenceNordic Conference on Computational Linguistics
LyhennettäNoDaLiDa
MaaIslanti
KaupunkiReykjavik
Ajanjakso31/05/202102/06/2021

Sormenjälki

Sukella tutkimusaiheisiin 'Spectral modification for recognition of children’s speech under mismatched conditions'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä