Datasets
- 22 results
Search results
-
nikolay-banar/OpenNMT-py: Character-level Transformer
Rush, S. (Creator), Nguyen, V. (Creator), Peters, B. (Creator), Gehrmann, S. (Creator), Zhan, J. (Creator), Tardy, P. (Creator), Deng, Y. (Creator), McCann, B. (Creator), Klein, G. (Creator), Chintala, S. (Creator), Hernandez, F. (Creator), Lerer, A. (Creator), Li, J. (Creator), Hoang, V. N. T. (Creator), Grönroos, S. (Creator), Ma, X. (Creator), Paszke, A. (Creator), Linxiao, Z. (Creator), Senellart, J. (Creator), Gowda, T. (Creator), Wenniger, G. M. D. B. (Creator), Yahmed, T. (Creator), Gangi, M. D. (Creator), Gross, S. (Creator) & Lei, T. (Creator), Zenodo, 2020
DOI: 10.5281/zenodo.3988435, https://zenodo.org/record/3989305
Dataset: Software or code
-
Models for MeMAD language identification pipeline
Virkkunen, A. (Creator) & Lindgren, M. (Creator), Zenodo, 2021
DOI: 10.5281/zenodo.4486872, https://zenodo.org/record/4486873
Dataset
-
wav2vec 2.0 Base LP (continued PT) 1500h model checkpoint from Getman et al. "What happens in continued pre-training? Analysis of self-supervised speech models with continued pre-training for colloquial Finnish ASR"
Getman, Y. (Creator), Zenodo, 11 Jun 2024
DOI: 10.5281/zenodo.11573213, https://zenodo.org/records/11573214
Dataset: Software or code
-
wav2vec 2.0 Base LP (PT from scratch) 1500h model checkpoint from Getman et al. "What happens in continued pre-training? Analysis of self-supervised speech models with continued pre-training for colloquial Finnish ASR"
Getman, Y. (Creator), Zenodo, 11 Jun 2024
DOI: 10.5281/zenodo.11572956, https://zenodo.org/records/11572957
Dataset
-
wav2vec 2.0 Large LP (PT from scratch) model checkpoint from Getman et al. "What happens in continued pre-training? Analysis of self-supervised speech models with continued pre-training for colloquial Finnish ASR"
Getman, Y. (Creator), Zenodo, 11 Jun 2024
DOI: 10.5281/zenodo.11573671, https://zenodo.org/records/11573672
Dataset: Software or code
-
Lahjoita puhetta baseline Kaldi ASR model
Moisio, A. (Creator), Zenodo, 2022
DOI: 10.5281/zenodo.6539428, https://zenodo.org/record/6539429 and 2 more links, https://zenodo.org/record/7101543, https://zenodo.org/record/7101461 (show fewer)
Dataset: Software or code
-
wav2vec 2.0 Large LP (continued PT) 1500h model checkpoint from Getman et al. "What happens in continued pre-training? Analysis of self-supervised speech models with continued pre-training for colloquial Finnish ASR"
Getman, Y. (Creator), Zenodo, 11 Jun 2024
DOI: 10.5281/zenodo.11574055, https://zenodo.org/records/11574056
Dataset: Software or code
-
ITE typing dataset
Leino, K. (Creator), Zenodo, 28 Jun 2024
DOI: 10.5281/zenodo.12528162, https://zenodo.org/records/12528163
Dataset
-
wav2vec 2.0 Base VP-Finnish 1500h model checkpoint from Getman et al. "What happens in continued pre-training? Analysis of self-supervised speech models with continued pre-training for colloquial Finnish ASR"
Getman, Y. (Creator), Zenodo, 11 Jun 2024
DOI: 10.5281/zenodo.11571810, https://zenodo.org/records/11571811
Dataset: Software or code
-
wav2vec 2.0 Large LP (PT from scratch) 1500h model checkpoint from Getman et al. "What happens in continued pre-training? Analysis of self-supervised speech models with continued pre-training for colloquial Finnish ASR"
Getman, Y. (Creator), Zenodo, 11 Jun 2024
DOI: 10.5281/zenodo.11573886, https://zenodo.org/records/11573887
Dataset
-
wav2vec 2.0 Large VP-Uralic 1500h model checkpoint from Getman et al. "What happens in continued pre-training? Analysis of self-supervised speech models with continued pre-training for colloquial Finnish ASR"
Getman, Y. (Creator), Zenodo, 11 Jun 2024
DOI: 10.5281/zenodo.11573577, https://zenodo.org/records/11573578
Dataset: Software or code
-
Lahjoita puhetta semisupervised baseline Kaldi ASR model
Grósz, T. (Creator), Zenodo, 2022
Dataset: Software or code
-
Speech recognition alignments for Finnish parliament data
Virkkunen, A. (Creator), Mansikkaniemi, A. (Creator) & Kurimo, M. (Creator), Zenodo, 2021
DOI: 10.5281/zenodo.4581940, https://zenodo.org/record/4581941
Dataset
-
Finnish conversational chat corpus, source
Kurimo, M. (Creator), Language Bank of Finland, 2022
http://urn.fi/urn:nbn:fi:lb-2022060801
Dataset
-
Finnish parliament session 2
Mansikkaniemi, A. (Creator), Aalto University, 2019
http://hdl.handle.net/11304/6e927a1b-2222-488d-be78-1cf661691fd4
Dataset
-
Wav2Vec2 model checkpoint from Moisio et al. "Lahjoita puhetta – a large-scale corpus of spoken Finnish with some benchmarks"
Getman, Y. (Creator), Zenodo, 2022
Dataset: Software or code
-
wav2vec 2.0 Large LP (continued PT) model checkpoint from Getman et al. "What happens in continued pre-training? Analysis of self-supervised speech models with continued pre-training for colloquial Finnish ASR"
Getman, Y. (Creator), Zenodo, 11 Jun 2024
DOI: 10.5281/zenodo.11573973, https://zenodo.org/records/11573974
Dataset: Software or code
-
MeMAD multimodal image caption translation model
Grönroos, S. (Creator), Huet, B. (Contributor), Kurimo, M. (Contributor), Laaksonen, J. (Contributor), Pham, P. (Contributor), Sjöberg, M. (Contributor), Sulubacak, U. (Contributor), Tiedemann, J. (Contributor), Troncy, R. (Contributor) & Vázquez, R. (Contributor), Zenodo, 2020
DOI: 10.5281/zenodo.4038443, https://zenodo.org/record/4038444
Dataset
-
SIAK Corpus
Kurimo, M. (Creator), Language Bank of Finland, 2018
http://urn.fi/urn:nbn:fi:lb-2017081501
Dataset
-
Trained models for the paper: Topic Identification for Spontaneous Speech: Enriching Audio Features with Embedded Linguistic Information
Porjazovski, D. (Creator), Zenodo, Aug 2023
DOI: 10.23919/EUSIPCO58844.2023.10289822, https://zenodo.org10158851
Dataset: Software or code
-
wav2vec 2.0 Base LP (PT from scratch) model checkpoint from Getman et al. "What happens in continued pre-training? Analysis of self-supervised speech models with continued pre-training for colloquial Finnish ASR"
Getman, Y. (Creator), Zenodo, 11 Jun 2024
DOI: 10.5281/zenodo.11572683, https://zenodo.org/records/11572684
Dataset: Software or code
-
wav2vec 2.0 Base LP (continued PT) model checkpoint from Getman et al. "What happens in continued pre-training? Analysis of self-supervised speech models with continued pre-training for colloquial Finnish ASR"
Getman, Y. (Creator), Zenodo, 11 Jun 2024
DOI: 10.5281/zenodo.11573133, https://zenodo.org/records/11573134
Dataset: Software or code