FinChat: Corpus and evaluation setup for Finnish chat conversations on everyday topics

Katri Leino, Juho Leinonen, Mittul Singh, Sami Virpioja, Mikko Kurimo

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference contributionScientificvertaisarvioitu

Abstrakti

Creating open-domain chatbots requires large amounts of conversational data and related benchmark tasks to evaluate them. Standardized evaluation tasks are crucial for creating automatic evaluation metrics for model development; otherwise, comparing the models would require resource-expensive human evaluation. While chatbot challenges have recently managed to provide a plethora of such resources for English, resources in other languages are not yet available. In this work, we provide a starting point for Finnish open-domain chatbot research. We describe our collection efforts to create the Finnish chat conversation corpus FinChat, which is made available publicly. FinChat includes unscripted conversations on seven topics from people of different ages. Using this corpus, we also construct a retrieval-based evaluation task for Finnish chatbot development. We observe that off-the-shelf chatbot models trained on conversational corpora do not perform better than chance at choosing the right answer based on automatic metrics, while humans can do the same task almost perfectly. Similarly, in a human evaluation, responses to questions from the evaluation set generated by the chatbots are predominantly marked as incoherent. Thus, FinChat provides a challenging evaluation set, meant to encourage chatbot development in Finnish.
AlkuperäiskieliEnglanti
OtsikkoProceedings of Interspeech
KustantajaInternational Speech Communication Association
Sivut429-433
Sivumäärä5
Vuosikerta2020-October
DOI - pysyväislinkit
TilaJulkaistu - 2020
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaInterspeech - Shanghai, Kiina
Kesto: 25 lokakuuta 202029 lokakuuta 2020
Konferenssinumero: 21
http://www.interspeech2020.org/

Julkaisusarja

NimiInterspeech
ISSN (elektroninen)1990-9772

Conference

ConferenceInterspeech
LyhennettäINTERSPEECH
Maa/AlueKiina
KaupunkiShanghai
Ajanjakso25/10/202029/10/2020
www-osoite

Sormenjälki

Sukella tutkimusaiheisiin 'FinChat: Corpus and evaluation setup for Finnish chat conversations on everyday topics'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä