Projects per year
Abstract
Linguistic prominence in speech is known to correlate with the acoustic measures of energy, F0, and duration. In contrast, the role of spectral tilt in the realization of prominence has remained more inconsistent between previous empirical investigations. This may be partially due to the lack of a standard method for quantifying spectral tilt or due to difficulties in estimating the acoustical source of spectral tilt, the glottal flow, from continuous speech. These issues have rendered interpretations and comparisons between studies difficult. In addition, (i) little is known about the robustness of tilt estimators for prominence detection in the case when speech is not clean but corrupted, as in real life, by environmental noise or telephone transmission (i.e. degradation caused by bandpass filtering and quantization noise). Moreover, (ii) little attention has been paid to multidimensional representations of source spectrum that can potentially incorporate more information about the phonation style than purely scalar measures. In this work, we study spectral tilt in signaling prominence in spoken Dutch and French under different levels of additive noise, and for telephone-band coded speech, and compare several one-dimensional tilt measures that have been previously encountered in the literature as well as multidimensional tilt measures. We also compare spectral tilt measures with other standard acoustic correlates for prominence, namely, energy, F0, and duration. Our results provide further empirical support for the finding that tilt is a systematic correlate of prominence in Dutch, that the role is smaller in French, and that energy, F0, and duration appear still to be the most robust features for discriminating prominent and non-prominent words. In addition, our results show that there are notable differences between different tilt measures at different levels of noise, and that multidimensional representations for tilt improve class separability from the scalar measures.
Original language | English |
---|---|
Pages (from-to) | 11-26 |
Number of pages | 16 |
Journal | Speech Communication |
Volume | 103 |
DOIs | |
Publication status | Published - 1 Oct 2018 |
MoE publication type | A1 Journal article-refereed |
Keywords
- Prosody
- Sentence prominence
- Acoustic measures
- Spectral tilt
- Noise robustness
- DNN
Fingerprint
Dive into the research topics of 'Comparison of spectral tilt measures for sentence prominence in speech — Effects of dimensionality and adverse noise conditions'. Together they form a unique fingerprint.Projects
- 4 Finished
-
Interdisciplinary research on statistical parametric speech synthesis
Alku, P., Bäckström, T., Bollepalli, B., Nonavinakere Prabhakera, N., Airaksinen, M., Murtola, T. & Juvela, L.
01/01/2018 → 31/12/2019
Project: Academy of Finland: Other research funding
-
ACLEW: Analyzing Child Language Experiences Around the World
Räsänen, O. & Seshadri, S.
01/06/2017 → 31/05/2020
Project: Academy of Finland: Other research funding
-
Computational modeling of language acquisition: bridging the gap between physical sensory input and high-level linguistic representations
Räsänen, O.
01/09/2014 → 31/08/2017
Project: Academy of Finland: Other research funding