Parameterization of a computational physical model for glottal flow using inverse filtering and high-speed videoendoscopy

Research output: Contribution to journalArticleScientificpeer-review

Standard

Parameterization of a computational physical model for glottal flow using inverse filtering and high-speed videoendoscopy. / Murtola, Tiina; Alku, Paavo; Malinen, Jarmo; Geneid, Ahmed.

In: Speech Communication, Vol. 96, 01.02.2018, p. 67-80.

Research output: Contribution to journalArticleScientificpeer-review

Harvard

APA

Vancouver

Author

Bibtex - Download

@article{5cde36120cd94bb6b16a351dd1df1bc0,
title = "Parameterization of a computational physical model for glottal flow using inverse filtering and high-speed videoendoscopy",
abstract = "High-speed videoendoscopy, glottal inverse filtering, and physical modeling can be used to obtain complementary information about speech production. In this study, the three methodologies are combined to pursue a better understanding of the relationship between the glottal air flow and glottal area. Simultaneously acquired high-speed video and glottal inverse filtering data from three male and three female speakers were used. Significant correlations were found between the quasi-open and quasi-speed quotients of the glottal area (extracted from the high-speed videos) and glottal flow (estimated using glottal inverse filtering), but only the quasi-open quotient relationship could be represented as a linear model. A simple physical glottal flow model with three different glottal geometries was optimized to match the data. The results indicate that glottal flow skewing can be modeled using an inertial vocal/subglottal tract load and that estimated inertia within the glottis is sensitive to the quality of the data. Parameter optimisation also appears to favour combining the simplest glottal geometry with viscous losses and the more complex glottal geometries with entrance/exit effects in the glottis.",
keywords = "Glottal flow, Physical model, Speech production, Vocal fold imaging",
author = "Tiina Murtola and Paavo Alku and Jarmo Malinen and Ahmed Geneid",
year = "2018",
month = "2",
day = "1",
doi = "10.1016/j.specom.2017.11.007",
language = "English",
volume = "96",
pages = "67--80",
journal = "Speech Communication",
issn = "0167-6393",
publisher = "Elsevier",

}

RIS - Download

TY - JOUR

T1 - Parameterization of a computational physical model for glottal flow using inverse filtering and high-speed videoendoscopy

AU - Murtola, Tiina

AU - Alku, Paavo

AU - Malinen, Jarmo

AU - Geneid, Ahmed

PY - 2018/2/1

Y1 - 2018/2/1

N2 - High-speed videoendoscopy, glottal inverse filtering, and physical modeling can be used to obtain complementary information about speech production. In this study, the three methodologies are combined to pursue a better understanding of the relationship between the glottal air flow and glottal area. Simultaneously acquired high-speed video and glottal inverse filtering data from three male and three female speakers were used. Significant correlations were found between the quasi-open and quasi-speed quotients of the glottal area (extracted from the high-speed videos) and glottal flow (estimated using glottal inverse filtering), but only the quasi-open quotient relationship could be represented as a linear model. A simple physical glottal flow model with three different glottal geometries was optimized to match the data. The results indicate that glottal flow skewing can be modeled using an inertial vocal/subglottal tract load and that estimated inertia within the glottis is sensitive to the quality of the data. Parameter optimisation also appears to favour combining the simplest glottal geometry with viscous losses and the more complex glottal geometries with entrance/exit effects in the glottis.

AB - High-speed videoendoscopy, glottal inverse filtering, and physical modeling can be used to obtain complementary information about speech production. In this study, the three methodologies are combined to pursue a better understanding of the relationship between the glottal air flow and glottal area. Simultaneously acquired high-speed video and glottal inverse filtering data from three male and three female speakers were used. Significant correlations were found between the quasi-open and quasi-speed quotients of the glottal area (extracted from the high-speed videos) and glottal flow (estimated using glottal inverse filtering), but only the quasi-open quotient relationship could be represented as a linear model. A simple physical glottal flow model with three different glottal geometries was optimized to match the data. The results indicate that glottal flow skewing can be modeled using an inertial vocal/subglottal tract load and that estimated inertia within the glottis is sensitive to the quality of the data. Parameter optimisation also appears to favour combining the simplest glottal geometry with viscous losses and the more complex glottal geometries with entrance/exit effects in the glottis.

KW - Glottal flow

KW - Physical model

KW - Speech production

KW - Vocal fold imaging

UR - http://www.scopus.com/inward/record.url?scp=85034819485&partnerID=8YFLogxK

U2 - 10.1016/j.specom.2017.11.007

DO - 10.1016/j.specom.2017.11.007

M3 - Article

VL - 96

SP - 67

EP - 80

JO - Speech Communication

JF - Speech Communication

SN - 0167-6393

ER -

ID: 16291027