Siirry päänavigointiin Siirry hakuun Siirry pääsisältöön

Exploring adaptation techniques of large speech foundation models for low-resource ASR: a case study on Northern Sámi

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

6 Sitaatiot (Scopus)
72 Lataukset (Pure)

Abstrakti

Speech foundation models such as wav2vec 2.0 have made it possible to develop highly accurate models for low-resourced languages using a limited amount of speech data. For optimal results, the pre-training should already include data from the target language, but unfortunately, none of the available foundation models include Northern Sámi. In this work, we explore various ways of preparing the foundation model for the Northern Sámi, including continued pre-training with a small untranscribed corpus and our new extended fine-tuning method. The extended fine-tuning starts from an already fine-tuned ASR model and augments it with new output units for the unique Sámi characters before new fine-tuning with transcribed Sámi data. Our results demonstrate the benefits of these advanced adaptation techniques, as both approaches lead to better performance than the direct fine-tuning-based adaptation.

AlkuperäiskieliEnglanti
OtsikkoInterspeech 2024
KustantajaInternational Society for Computers and Their Applications (ISCA)
Sivut2539-2543
Sivumäärä5
DOI - pysyväislinkit
TilaJulkaistu - 2024
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInterspeech - Kos Island, Kreikka
Kesto: 1 syysk. 20245 syysk. 2024

Julkaisusarja

NimiProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
KustantajaInternational Speech Communication Association (ISCA)
ISSN (painettu)2308-457X

Conference

ConferenceInterspeech
Maa/AlueKreikka
KaupunkiKos Island
Ajanjakso01/09/202405/09/2024

Sormenjälki

Sukella tutkimusaiheisiin 'Exploring adaptation techniques of large speech foundation models for low-resource ASR: a case study on Northern Sámi'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä