TY - GEN
T1 - Improving sampling-based alignment by investigating the distribution of N-grams in phrase translation tables
AU - Luo, Juan
AU - Lardilleux, Adrien
AU - Lepage, Yves
PY - 2011
Y1 - 2011
N2 - This paper describes an approach to improve the performance of sampling-based multilingual alignment on translation tasks by investigating the distribution of n-grams in the translation tables. This approach consists in enforcing the alignment of n-grams. The quality of phrase translation tables output by this approach and that of MGIZA++ is compared in statistical machine translation tasks. Significant improvements for this approach are reported. In addition, merging translation tables is shown to outperform state-of-the-art techniques.
AB - This paper describes an approach to improve the performance of sampling-based multilingual alignment on translation tasks by investigating the distribution of n-grams in the translation tables. This approach consists in enforcing the alignment of n-grams. The quality of phrase translation tables output by this approach and that of MGIZA++ is compared in statistical machine translation tasks. Significant improvements for this approach are reported. In addition, merging translation tables is shown to outperform state-of-the-art techniques.
KW - Alignment
KW - Phrase translation table
KW - Statistical machine translation
UR - http://www.scopus.com/inward/record.url?scp=84863876792&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863876792&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84863876792
SN - 9784905166023
T3 - PACLIC 25 - Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation
SP - 150
EP - 159
BT - PACLIC 25 - Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation
T2 - 25th Pacific Asia Conference on Language, Information and Computation, PACLIC 25
Y2 - 16 December 2011 through 18 December 2011
ER -