Donate Speech: Collecting and Sharing a Large-Scale Speech Database for Social Sciences, Humanities and Artificial Intelligence Research and Innovation

Krister Lindén, Tommi Jauhiainen, Mietta Lennes, Mikko Kurimo, Aleksi Rossi, Tommi Kurki, Olli Pitkänen

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaChapterScientificvertaisarvioitu

55 Lataukset (Pure)

Abstrakti

The Donate Speech campaign aimed to collect 10 000 hours of ordinary, casual Finnish speech to be used for studying language as well as for developing technology and services that can be readily used in the languages spoken in Finland. In this project, particular attention has been paid to allowing for both academic and commercial use of the material. Even though the ambitious target currently seems to evade us, the Donate Speech campaign has managed to collect an extensive resource of more than 3500 h of Finnish colloquial speech with more than 200 000 speech recordings by roughly 50 000 speakers from all over Finland in just a few months.
AlkuperäiskieliEnglanti
OtsikkoCLARIN : the infrastructure for language resources
KustantajaDe Gruyter
Sivumäärä30
ISBN (elektroninen)978-3-11-076737-7
ISBN (painettu)978-3-11-076734-6
DOI - pysyväislinkit
TilaJulkaistu - lokak. 2022
OKM-julkaisutyyppiA3 Kirjan tai muun kokoomateoksen osa

Julkaisusarja

NimiDigital Linguistics
Vuosikerta1
ISSN (elektroninen)2751-1278

Sormenjälki

Sukella tutkimusaiheisiin 'Donate Speech: Collecting and Sharing a Large-Scale Speech Database for Social Sciences, Humanities and Artificial Intelligence Research and Innovation'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä