Okko Räsänen

  • Rakentajanaukio 2 C

20082020

Research output per year

If you made any changes in Pure these will be visible here soon.

Research Output

Filter
Conference contribution
2019

A computational model of early language acquisition from audiovisual experiences of young infants

Räsänen, O. & Khorrami, K., 1 Jan 2019, Proceedings of Interspeech. International Speech Communication Association, Vol. 2019-September. p. 3594-3598 5 p. (Interspeech - Annual Conference of the International Speech Communication Association).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
29 Downloads (Pure)

Augmented CycleGANs for continuous scale normal-to-Lombard speaking style conversion

Seshadri, S., Juvela, L., Alku, P. & Räsänen, O., 2019, Proceedings of Interspeech. International Speech Communication Association, p. 2838-2842 (Interspeech - Annual Conference of the International Speech Communication Association).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
3 Citations (Scopus)
51 Downloads (Pure)

Cycle-consistent adversarial networks for non-parallel vocal effort based speaking style conversion

Seshadri, S., Juvela, L., Yamagishi, J., Räsänen, O. & Alku, P., 1 May 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 6835 - 6839 5 p. 8682648. (Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
4 Citations (Scopus)
116 Downloads (Pure)

Data augmentation strategies for neural network F0 estimation

Airaksinen, M., Juvela, L., Alku, P. & Räsänen, O., 1 May 2019, 44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019; Brighton; United Kingdom; 12-17 May 2019 : Proceedings. IEEE, p. 6485 - 6489 5 p. 8683041. ( IEEE International Conference on Acoustics Speech and Signal Processing).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
170 Downloads (Pure)
2018

Comparison of syllabification algorithms and training strategies for robust word count estimation across different languages and recording conditions

Räsänen, O., Seshadri, S. & Casillas, M., 1 Jan 2018, Proceedings of Interspeech. International Speech Communication Association, Vol. 2018-September. p. 1200-1204 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
2 Citations (Scopus)
109 Downloads (Pure)

Time-regularized linear prediction for noise-robust extraction of the spectral envelope of speech

Airaksinen, M., Juvela, L., Räsänen, O. & Alku, P., 2 Sep 2018, Proceedings of Interspeech. International Speech Communication Association, p. 701-705 5 p. (Interspeech - Annual Conference of the International Speech Communication Association).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
1 Citation (Scopus)
144 Downloads (Pure)
2017

Blind phoneme segmentation with temporal prediction errors

Michel, P., Räsänen, O., Thiolliere, R. & Dupoux, E., 2017, Proceedings of the Student Research Workshop at the Annual Meeting of the Association for Computational Linguistics. p. 62-68 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
1 Citation (Scopus)
134 Downloads (Pure)

Comparison of Non-parametric Bayesian Mixture Models for Syllable Clustering and Zero-Resource Speech Processing

Seshadri, S., Remes, U. & Räsänen, O., Aug 2017, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. International Speech Communication Association, Vol. 2017-August. p. 2744-2748 5 p. (Interspeech: Annual Conference of the International Speech Communication Association).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
1 Citation (Scopus)
146 Downloads (Pure)

Connecting stimulus-driven attention to the properties of infant-directed speech – Is exaggerated intonation also more surprising?

Räsänen, O., Kakouros, S. & Soderstrom, M., 2017, Proceedings of the 39th Annual Conference of the Cognitive Science Society. COGNITIVE SCIENCE SOCIETY, p. 998-1003

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Dirichlet process mixture models for clustering i-vector data

Seshadri, S., Remes, U. & Rasanen, O., 16 Jun 2017, 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings. IEEE, p. 5470-5474 5 p. 7953202. (Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

1 Citation (Scopus)

Evaluation of Spectral Tilt Measures for Sentence Prominence Under Different Noise Conditions

Kakouros, S., Räsänen, O. & Alku, P., Aug 2017, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. International Speech Communication Association, Vol. 2017-August. p. 3211-3215 5 p. (Interspeech: Annual Conference of the International Speech Communication Association).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
10 Citations (Scopus)
161 Downloads (Pure)

Language is Not About Language: Towards Formalizing the Role of Extra-Linguistic Factors in Human and Machine Language Acquisition and Communication

Räsänen, O., 25 Aug 2017, Proceedings of Workshop on Grounding Language Understanding (GLU). Salvi, G. & Dupont, S. (eds.). KTH Royal Institute of Technology, p. 37-41

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Speaking style conversion from normal to Lombard speech using a glottal vocoder and Bayesian GMMs

Ramirez Lopez, A., Seshadri, S., Juvela, L., Räsänen, O. & Alku, P., Aug 2017, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. International Speech Communication Association, Vol. 2017-August. p. 1363-1367 5 p. (Interspeech: Annual Conference of the International Speech Communication Association).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
File
12 Citations (Scopus)
263 Downloads (Pure)
2016

Analyzing distributional learning of Phonemic Categories in Unsupervised Deep Neural Networks

Räsänen, O., Nagamine, T. & Mesgarani, N., 10 Aug 2016, Proceedings of the 38th Annual Conference of the Cognitive Science Society, CogSci 2016. Papafragou, A., Grodner, D., Mirman, D. & Trueswell, J. C. (eds.). Austin, TX: COGNITIVE SCIENCE SOCIETY, p. 1757–1762 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Analyzing the Contribution of Top-Down Lexical and Bottom-Up Acoustic Cues in the Detection of Sentence Prominence

Kakouros, S., Pelemans, J., Verwimp, L., Wambacq, P. & Räsänen, O., 2016, Proceedings of the Annual Conference of the International Speech Communication Association: Interspeech'16, San Francisco, USA, Sept. 8-12, 2016. International Speech Communication Association, p. 1074-1078 (Proceedings of the Annual Conference of the International Speech Communication Association).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

5 Citations (Scopus)

Statistical Learning of Prosodic Patterns and Reversal of Perceptual Cues for Sentence Prominence

Kakouros, S. & Räsänen, O., 2016, Proceedings of the 38th Annual Conference of the Cognitive Science Society, CogSci 2016. Papafragou, A., Grodner, D., Mirman, D. & Trueswell, J. C. (eds.). COGNITIVE SCIENCE SOCIETY, p. 2489-2494

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
2015

Analyzing the Predictability of Lexeme-specific Prosodic Features as a Cue to Sentence Prominence

Kakouros, S. & Räsänen, O., 2015, 37th Annual Conference of the Cognitive Science Society, Pasadena, California, July 23-25, 2015. Noelle, D. C., Dale, R., Warlaumont, A. S., Yoshimi, J., Matlock, T., Jennings, C. D. & Maglio, P. P. (eds.). p. 1039-1044

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Automatic Detection of Sentence Prominence in Speech Using Predictability of Word-level Acoustic Features

Kakouros, S. & Räsänen, O., 2015, Interspeech-2015, Dresden, Germany, September 2016. p. 568-572

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

4 Citations (Scopus)

Computational evidence for effects of memory decay, familiarity preference and mutual exclusivity in cross-situational learning

Rasilo, H. & Räsänen, O., 2015, 37th Annual Conference of the Cognitive Science Society, Pasadena, California, July 2325, 2015. p. 19551960

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Cross-situational cues are relevant for early word segmentation

Räsänen, O. & Rasilo, H., 2015, 37th Annual Conference of the Cognitive Science Society, Pasadena, California, July 2325, 2015. p. 19491954

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Data-driven metric representing the maturation of preterm EEG

Koolen, N., Dereymaeker, A., Räsänen, O., Jansen, K., Vervish, J., Matic, V., De Vos, M., Naulaers, G., Van Huffel, S. & Vanhatalo, S., 2015, 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Milan, Italy, August 2529. p. 1492-1495

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

6 Citations (Scopus)

Generating Hyperdimensional Distributed Representations from Continuous-Valued Multivariate Sensory Input

Räsänen, O., 2015, 37th Annual Conference of the Cognitive Science Society, Pasadena, California, July 2325, 2015. p. 19431948

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Unsupervised word discovery from speech using automatic segmentation into syllable-like units

Räsänen, O., Doyle, G. & Frank, M., 2015, Interspeech-2015, Dresden, Germany, September 610. p. 32043208

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

30 Citations (Scopus)

Weakly-supervised word learning is improved by an active online algorithm

Rasilo, H. & Räsänen, O., 2015, Interspeech-2015, Dresden, Germany, September 610. p. 15611565

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

1 Citation (Scopus)
2014

Basic cuts revisited: Temporal segmentation of speech into phone-like units with statistical learning at a pre-linguistic level

Räsänen, O., 2014, 36th Annual Conference of the Cognitive Science Society, Quebec, Canada, July.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Perception of Sentence Stress in English Infant Directed Speech

Kakouros, S. & Räsänen, O., 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 2014. p. 1821-1825

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

4 Citations (Scopus)

Statistical Unpredictability of F0 Trajectories as a Cue to Sentence Stress

Kakouros, S. & Räsänen, O., 2014, Proceedings of the 36th Annual Conference of the Cognitive Science Society, CogSci 2014. p. 1246-1251

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
2013

Attention based temporal filtering of sensory signals for data redundancy reduction

Kakouros, S., Räsänen, O. & Laine, U. K., 18 Oct 2013, 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings. p. 3188-3192 5 p. 6638246

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

2 Citations (Scopus)

Automatic self-supervised learning of associations between speech and text

Knuuttila, J., Räsänen, O. & Laine, U., 2013, Interspeech 2013, Lyon, France, August 2529. p. 465-469 (Interspeech).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Random subset feature selection in automatic recognition of developmental disorders, affective states, and level of conflict from speech

Räsänen, O. & Pohjalainen, J., 2013, Interspeech'2013, Lyon, France, August 2529. p. 210-214 (Interspeech).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

47 Citations (Scopus)

Virtual infant's online acquisition of vowel categories and their mapping between dissimilar bodies

Rasilo, H., Räsänen, O. & de Boer, B., 2013, Workshop on Speech Production in Automatic Speech Recognition, Lyon, France, August 30, 2013.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

2012

Acoustic analysis supports the existence of a single distributional learning mechanism in structural rule learning from an artificial language

Räsänen, O. & Rasilo, H., 2012, Proceedings of the 34th Annual Meeting of the Cognitive Science Society, CogSci 2012: Building Bridges Across Cognitive Sciences. Miyake, N., Peebles, D. & Cooper, R. (eds.). Austin, Texas: COGNITIVE SCIENCE SOCIETY, p. 887-892

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access

Average Spectrotemporal Structure of Continuous Speech Matches with the Frequency Resolution of Human Hearing

Räsänen, O., 2012, Proceedings of Interspeech'2012. Speech Communication Association, I. (ed.). Portland, Oregon: International Speech Communication Association, p. 1444-1447 4 p. (Proceedings of the Annual Conference of the International Speech Association).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
2 Citations (Scopus)

Context induced merging of synonymous word models in computational modeling of early language acquisition

Räsänen, O., 2012, Proceedings of the 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012). Kyoto, Japan, p. 5037-5040

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

5 Citations (Scopus)

Feature Selection for Speaker Traits

Pohjalainen, J., Kadioglu, S. & Räsänen, O., 2012, Interspeech 2012, Portland, Oregon, USA, Sept. 9-13, 2012.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

14 Citations (Scopus)

Hierarchical unsupervised discovery of user context from multivariate sensory data

Räsänen, O., 2012, Proceedings of the 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012). Kyoto, Japan, p. 2105-2108

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

1 Citation (Scopus)

Modeling spoken language acquisition with a generic cognitive architecture for associative learning

Räsänen, O., Rasilo, H. & Laine, U., 2012, Proceedings of Interspeech'2012. Speech Communication Association, I. (ed.). International Speech Communication Association, p. 919-922 4 p. (Proceedings of the Annual Conference of the International Speech Association).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
1 Citation (Scopus)

Non-auditory cognitive capabilities in computational modeling of early language acquisition

Räsänen, O., 2012, Proceedings of Interspeech'2012. Speech Communication Association, I. (ed.). International Speech Communication Association, p. 915-918 4 p. (Proceedings of the Annual Conference of the International Speech Association).

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Open Access
1 Citation (Scopus)
2011

Comparison of Classifiers in Audio and Acceleration Based Context Classification in Mobile Phones

Räsänen, O., Leppänen, J., Laine, U. & Saarinen, J., 2011, EUSIPCO The 2011 European Signal Processing Conference (EUSIPCO-2011), Barcelona, Spain, August 29 - September 2, 2011.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

6 Citations (Scopus)

Method for Speech Inversion with Large Scale Statistical Evaluation

Rasilo, H., Laine, U., Räsänen, O. & Altosaar, T., 2011, 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, August 28-31, 2011. p. 2693-2696

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

2 Citations (Scopus)
2010

Estimation studies of vocal tract shape trajectory using a variable length and lossy Kelly-Lochbaum model

Rasilo, H., Laine, U. & Räsänen, O., 2010, Interspeech`10, Chiba, Japan, September 26-30, 2010. p. 2414-2417

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

2 Citations (Scopus)

Fully Unsupervised Word Learning from Continuous Speech Using Transitional Probabilities of Atomic Acoustic Events

Räsänen, O., 2010, Interspeech 2010, Makuhari, Japan, September 26-30, 2010. p. 2922-2925

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

1 Citation (Scopus)
2009

A comparison and combination of segmental and fixed-frame signal representations in NMF-based word recognition

Räsänen, O. & Diersen, J., 2009, The 17th Nordic Conference on Computational Linguistics, NODALIDA 2009, Odense Denmark, May 14-16 2009.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

An Improved Speech Segmentation Quality Measure: the R-value

Räsänen, O., Laine, U. & Altosaar, T., 2009, 10th Interspeech Conference, Brighton, UK, September 6-10, 2009.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

32 Citations (Scopus)

A noise robust method for pattern discovery in quantized time series: the concept matrix approach

Räsänen, O., Laine, U. K. & Altosaar, T., 2009, 10th Interspeech Conference, Brighton, UK, September 6-10, 2009..

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

8 Citations (Scopus)

Discovering Keywords from Cross-Modal Input: Ecological vs. Engineering Methods for Enhancing Acoustic Repetitions

Aimetti, G., Moore, R., Bosch, L. T., Räsänen, O. & Laine, U. K., 2009, 10th Interspeech Conference, Brighton, UK, September 6-10, 2009.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

1 Citation (Scopus)

Do Multiple Caregivers Speed up Language Acquisition?

Bosch, L. T., Räsänen, O., Driesen, J., Aimetti, G., Altosaar, T. & Boves, L., 2009, 10th Interspeech Conference, Brighton, UK, September 6-10, 2009..

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

10 Citations (Scopus)

Indirect estimation of formant frequencies through mean spectral variance with application to automatic gender recognition

Laine, U. K. & Räsänen, O., 2009, 6th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA2009), Firenze, Italy, December 14-16, 2009..

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Learning meaningful units from multimodal input - the effect of interaction strategies

Bosch, L. T., Boves, L. & Räsänen, O., 2009, Workshop on Child, Computer and Interaction 2009, Boston, MA, USA, November 5, 2009.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

2 Citations (Scopus)

Self-learning Vector Quantization for Pattern Discovery from Speech

Räsänen, O., Laine, U. K. & Altosaar, T., 2009, 10th Interspeech Conference, Brighton, UK, September 6-10, 2009.

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

11 Citations (Scopus)