Subword Representations Successfully Decode Brain Responses to Morphologically Complex Written Words

Tero Hakala, Tiina Lindh-Knuutila, Annika Hulten, Minna Lehtonen, Riitta Salmelin*

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

13 Downloads (Pure)

Abstract

This study extends the idea of decoding word-evoked brain activations using a corpus-semantic vector space to multimorphemic words in the agglutinative Finnish language. The corpus-semantic models are trained on word segments, and decoding is carried out with word vectors that are composed of these segments. We tested several alternative vector-space models using different segmentations: no segmentation (whole word), linguistic morphemes, statistical morphemes, random segmentation, and character-level 1-, 2- and 3-grams, and paired them with recorded MEG responses to multimorphemic words in a visual word recognition task. For all variants, the decoding accuracy exceeded the standard word-label permutation-based significance thresholds at 350–500 ms after stimulus onset. However, the critical segment-label permutation test revealed that only those segmentations that were morphologically aware reached significance in the brain decoding task. The results suggest that both whole-word forms and morphemes are represented in the brain and show that neural decoding using corpus-semantic word representations derived from compositional subword segments is applicable also for multimorphemic word forms. This is especially relevant for languages with complex morphology, because a large proportion of word forms are rare and it can be difficult to find statistically reliable surface representations for them in any large corpus.
Original languageEnglish
Pages (from-to)844-863
Number of pages20
JournalNeurobiology of language
Volume5
Issue number4
DOIs
Publication statusPublished - 2024
MoE publication typeA1 Journal article-refereed

Keywords

  • decoding
  • MEG
  • multimorphemic words
  • statistical morphemes
  • word2vec

Fingerprint

Dive into the research topics of 'Subword Representations Successfully Decode Brain Responses to Morphologically Complex Written Words'. Together they form a unique fingerprint.

Cite this