Abstract
This paper is a pilot study that aims to explore the viability of annotation projection from one language to another as well as to evaluate the multilingual data set we have created for emotion analysis. We study different language pairs based on parallel corpora for sentiment and emotion annotations and explore annotator agreement. We show that the data source is a possible one for reliable L1 data to be used in annotation projection from high-resource languages, such as English, into low-resource languages and that this is a reliable way of creating data sets for fine-grained sentiment analysis and emotion detection.
Original language | English |
---|---|
Pages (from-to) | 38-50 |
Number of pages | 13 |
Journal | CEUR WORKSHOP PROCEEDINGS |
Volume | 2612 |
Publication status | Published - 2020 |
MoE publication type | A4 Conference publication |
Event | Digital Humanities in the Nordic Countries - Riga, Latvia Duration: 21 Oct 2020 → 23 Oct 2020 Conference number: 5 |
Keywords
- Annotation Projection
- Sentiment Analysis
- Translation Studies