This paper is a pilot study that aims to explore the viability of annotation projection from one language to another as well as to evaluate the multilingual data set we have created for emotion analysis. We study different language pairs based on parallel corpora for sentiment and emotion annotations and explore annotator agreement. We show that the data source is a possible one for reliable L1 data to be used in annotation projection from high-resource languages, such as English, into low-resource languages and that this is a reliable way of creating data sets for fine-grained sentiment analysis and emotion detection.
|CEUR Workshop Proceedings
|Published - 2020
|5th Conference on Digital Humanities in the Nordic Countries, DHN 2020 - Riga, Latvia
継続期間: 2020 10月 21 → 2020 10月 23
ASJC Scopus subject areas