Abstrakti
Vocal intensity is quantified by the sound pressure level (SPL). The SPL can be measured by either using a sound level meter or by comparing the energy of the recorded speech signal with the energy of the recorded calibration tone of a known SPL. Neither of these approaches can be used if speech is recorded in real-life conditions using a device that is not calibrated for SPL measurements. To measure the SPL from non-calibrated recordings, where speech is presented on a normalized amplitude scale, this study investigates the use of the machine learning (ML)-based estimation of the SPL. Several ML-based systems consisting of a feature extraction stage and a regression stage were built. For the former, four conventional acoustic features, two state-of-the-art pre-trained features, and their combined feature set were compared. For the latter, three regression models were compared. The systems were trained using the healthy speech of an open repository. The systems were evaluated using both pathological speech produced by patients suffering from heart failure and using speech produced by healthy controls. The results showed that the best combination of the feature and regression model provided a mean absolute error of about 2 dB in the SPL estimation task.
| Alkuperäiskieli | Englanti |
|---|---|
| Sivut | 1726-1741 |
| Sivumäärä | 16 |
| Julkaisu | Journal of the Acoustical Society of America |
| Vuosikerta | 157 |
| Numero | 3 |
| DOI - pysyväislinkit | |
| Tila | Julkaistu - 13 maalisk. 2025 |
| OKM-julkaisutyyppi | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä |
Sormenjälki
Sukella tutkimusaiheisiin 'The machine learning-based prediction of the sound pressure level from pathological and healthy speech signals'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.Tietoaineistot
-
AVID: Aalto Vocal Intensity Database
Alku, P. (Creator), Kodali, M. (Creator) & Kadiri, S. R. (Creator), Zenodo, 18 toukok. 2023
DOI - pysyväislinkki: 10.5281/zenodo.7948299, https://zenodo.org/record/7948300 ja vielä yksi linkki, https://zenodo.org/records/10524873 (näytä vähemmän)
Tietoaineisto: Dataset
Projektit
- 1 Päättynyt
-
HEART: Speech-based biomarking of heart failure
Alku, P. (Vastuullinen johtaja), Javanmardi, F. (Projektin jäsen), Yagnavajjula, M. (Projektin jäsen), Pohjalainen, H. (Projektin jäsen), Kadiri, S. (Projektin jäsen), Kodali, M. (Projektin jäsen), Tirronen, S. (Projektin jäsen) & Mittapalle, K. (Projektin jäsen)
01/09/2020 → 31/08/2024
Projekti: RCF Academy Project
Laitteet
Siteeraa tätä
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver