TY - GEN
T1 - Automatic prediction of misconceptions in multilingual computer-mediated communication
AU - Yamashita, Naomi
AU - Ishida, Toru
PY - 2006
Y1 - 2006
N2 - Multilingual communities using machine translation to overcome language barriers are showing up with increasing frequency. However, when a large number of translation errors get mixed into conversations, users have difficulty completely understanding each other. In this paper, we focus on misconceptions found in high volume in actual online conversations using machine translation. We first examine the response patterns in machine translation-mediated communication and associate them with misconceptions. Analysis results indicate that response messages to include misconceptions posted via machine translation tend to be incoherent, often focusing on short phrases of the original message. Next, based on the analysis results, we propose a method that automatically predicts the occurrence of misconceptions in each dialogue. The proposed method assesses the tendency of each dialogue including misconceptions by calculating the gaps between the regular discussion thread (syntactic thread) and the discussion thread based on lexical cohesion (semantic thread). Verification results show significant positive correlation between actual misconception frequency and gaps between syntactic and semantic threads, which indicate the validity of the method.
AB - Multilingual communities using machine translation to overcome language barriers are showing up with increasing frequency. However, when a large number of translation errors get mixed into conversations, users have difficulty completely understanding each other. In this paper, we focus on misconceptions found in high volume in actual online conversations using machine translation. We first examine the response patterns in machine translation-mediated communication and associate them with misconceptions. Analysis results indicate that response messages to include misconceptions posted via machine translation tend to be incoherent, often focusing on short phrases of the original message. Next, based on the analysis results, we propose a method that automatically predicts the occurrence of misconceptions in each dialogue. The proposed method assesses the tendency of each dialogue including misconceptions by calculating the gaps between the regular discussion thread (syntactic thread) and the discussion thread based on lexical cohesion (semantic thread). Verification results show significant positive correlation between actual misconception frequency and gaps between syntactic and semantic threads, which indicate the validity of the method.
KW - Computer-Mediated Communication
KW - Machine Translation
KW - Misconception
KW - Multilingual Groups
UR - http://www.scopus.com/inward/record.url?scp=33746074110&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33746074110&partnerID=8YFLogxK
U2 - 10.1145/1111449.1111469
DO - 10.1145/1111449.1111469
M3 - Conference contribution
AN - SCOPUS:33746074110
SN - 1595932879
SN - 9781595932877
T3 - International Conference on Intelligent User Interfaces, Proceedings IUI
SP - 62
EP - 69
BT - IUI 06 - 2006 International Conference on Intelligent User Interfaces
T2 - IUI 06 - 2006 International Conference on Intelligent User Interfaces
Y2 - 29 January 2005 through 1 February 2005
ER -