TY - JOUR
T1 - A method for supporting document selection in cross-language information retrieval and its evaluation
AU - Suzuki, Masami
AU - Inoue, Naomi
AU - Hashimoto, Kazuo
PY - 2001/11
Y1 - 2001/11
N2 - It is important to give useful clues for selecting desired content from a number of retrieval results obtained (usually) from a vague search request. Compared with monolingual retrieval, such a support framework is inevitable and much more significant for filtering given translingual retrieval results. This paper describes an attempt to provide appropriate translation of major keywords in each document in a cross-language information retrieval (CLIR) result, as a browsing support for users. Our idea of determining appropriate translation of major keywords is based on word co-occurrence distribution in the translation target language, considering the actual situation of WWW content where it is difficult to obtain aligned parallel (multilingual) corpora. The proposed method provides higher quality of keyword translation to yield a more effective support in identifying the target documents in the retrieval result. We report the advantage of this browsing support technique through evaluation experiments including comparison with conditions of referring to a translated document summary, and discuss related issues to be examined towards more effective cross-language information extraction.
AB - It is important to give useful clues for selecting desired content from a number of retrieval results obtained (usually) from a vague search request. Compared with monolingual retrieval, such a support framework is inevitable and much more significant for filtering given translingual retrieval results. This paper describes an attempt to provide appropriate translation of major keywords in each document in a cross-language information retrieval (CLIR) result, as a browsing support for users. Our idea of determining appropriate translation of major keywords is based on word co-occurrence distribution in the translation target language, considering the actual situation of WWW content where it is difficult to obtain aligned parallel (multilingual) corpora. The proposed method provides higher quality of keyword translation to yield a more effective support in identifying the target documents in the retrieval result. We report the advantage of this browsing support technique through evaluation experiments including comparison with conditions of referring to a translated document summary, and discuss related issues to be examined towards more effective cross-language information extraction.
KW - Browsing support
KW - Cross-language information retrieval
KW - Partial translation
KW - Term list
UR - http://www.scopus.com/inward/record.url?scp=2942739997&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=2942739997&partnerID=8YFLogxK
U2 - 10.1023/A:1011877503081
DO - 10.1023/A:1011877503081
M3 - Review article
AN - SCOPUS:2942739997
SN - 0010-4817
VL - 35
SP - 421
EP - 438
JO - Computers and the Humanities
JF - Computers and the Humanities
IS - 4
ER -