Estimating the spectral tilt of the glottal source from telephone speech using a deep neural network

Emma Jokinen*, Paavo Alku

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

6 Citations (Scopus)
132 Downloads (Pure)

Abstract

Estimation of the spectral tilt of the glottal source has several applications in speech analysis and modification. However, direct estimation of the tilt from telephone speech is challenging due to vocal tract resonances and distortion caused by speech compression. In this study, a deep neural network is used for the tilt estimation from telephone speech by training the network with tilt estimates computed by glottal inverse filtering. An objective evaluation shows that the proposed technique gives more accurate estimates for the spectral tilt than previously used techniques that estimate the tilt directly from telephone speech without glottal inverse filtering.

Original languageEnglish
Pages (from-to)EL327-EL330
Number of pages4
JournalJournal of the Acoustical Society of America
Volume141
Issue number4
DOIs
Publication statusPublished - 1 Apr 2017
MoE publication typeA1 Journal article-refereed

Fingerprint Dive into the research topics of 'Estimating the spectral tilt of the glottal source from telephone speech using a deep neural network'. Together they form a unique fingerprint.

Cite this