Discovering latent country words: A step towards cross-cultural emotional communication

Heeryon Cho*, Toru Ishida

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Knowing what concepts are substantial to each country can be helpful in enhancing emotional communication between two countries. As a concrete example of identifying substantial country concepts, we focus on a task of finding latent country words from cross-cultural texts of two countries. We do this by combining word embedding and tensor decomposition: common words that appear in both countries’ texts are selected; their country specific word embeddings are learned; a three-way tensor consisting of word factor, word embedding factor, and country factor are constructed; and CANDECOMP/PARAFAC decomposition is performed on the three-way tensor while fixing the country factor values of the decomposed result. We tested our method on a motivating example of finding latent country words from J-pop lyrics from Japan and K-pop lyrics from South Korea. We found that J-pop lyrics words feature nature related motifs such as ‘petal’, ‘cloud’, ‘universe’, ‘star’, and ‘sky’, whereas K-pop lyrics words highlight human body related motifs such as ‘style’, ‘shirt’, ‘head’, ‘foot’, and ‘skin’.

Original languageEnglish
Title of host publicationCollaboration Technologies and Social Computing - 25th International Conference, CRIWG+CollabTech 2019, Proceedings
EditorsHideyuki Nakanishi, Hironori Egi, Irene-Angelica Chounta, Hideyuki Takada, Satoshi Ichimura, Ulrich Hoppe
PublisherSpringer Verlag
Pages232-241
Number of pages10
ISBN (Print)9783030280109
DOIs
Publication statusPublished - 2019
Event25th International Conference on Collaboration Technologies and Social Computing, CRIWG+CollabTech 2019 - Kyoto, Japan
Duration: 2019 Sept 42019 Sept 6

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11677 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference25th International Conference on Collaboration Technologies and Social Computing, CRIWG+CollabTech 2019
Country/TerritoryJapan
CityKyoto
Period19/9/419/9/6

Keywords

  • Cross-cultural text analysis
  • Tensor decomposition
  • Word embedding

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Discovering latent country words: A step towards cross-cultural emotional communication'. Together they form a unique fingerprint.

Cite this