TY - GEN
T1 - Marker-based chunking in eleven European languages for analogy-based translation
AU - Takeya, Kota
AU - Lepage, Yves
N1 - Funding Information:
This paper is part of the outcome of research performed under a Waseda University Grant for Special Research Project (project number: 2010A-906).
PY - 2014
Y1 - 2014
N2 - An example-based machine translation (EBMT) system based on proportional analogies requires numerous proportional analogies between linguistic units to work properly. Consequently, long sentences cannot be handled directly in such a framework. Cutting sentences into chunks would be a solution. Using different markers, we count the number of proportional analogies between chunks in 11 European languages. As expected, the number of proportional analogies between chunks found is very high. These results, and preliminary experiments in translation, are promising for the EBMT system that we intend to build.
AB - An example-based machine translation (EBMT) system based on proportional analogies requires numerous proportional analogies between linguistic units to work properly. Consequently, long sentences cannot be handled directly in such a framework. Cutting sentences into chunks would be a solution. Using different markers, we count the number of proportional analogies between chunks in 11 European languages. As expected, the number of proportional analogies between chunks found is very high. These results, and preliminary experiments in translation, are promising for the EBMT system that we intend to build.
UR - http://www.scopus.com/inward/record.url?scp=84905821718&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84905821718&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-08958-4_35
DO - 10.1007/978-3-319-08958-4_35
M3 - Conference contribution
AN - SCOPUS:84905821718
SN - 9783319089577
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 432
EP - 444
BT - Human Language Technology Challenges for Computer Science and Linguistics - 5th Language and Technology Conference, LTC 2011, Revised Selected Papers
PB - Springer Verlag
T2 - 5th Language and Technology Conference, LTC 2011
Y2 - 25 November 2011 through 27 November 2011
ER -