Detecting language divergences and predicting possible sub-translations is one of the most essential issues in machine translation. Since the existence of translation divergences, it is impractical to straightforward translate from source sentence into target sentence while keeping the high degree of accuracy and without additional information. In this paper, we investigate the problem from an emerging and special point of view: bigrams and the corresponding translations. We first profile corpora and explore the constituents of bigrams in the source language. Then we translate unseen bigrams based on proportional analogy and filter the outputs using an Support Vector Machine (SVM) classifier. The experiment results also show that even a small set of features from analogous can provide meaningful information in translating by analogy.
|Published - 2015
|29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015 - Shanghai, China
継続期間: 2015 10月 30 → 2015 11月 1
|29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015
|15/10/30 → 15/11/1
ASJC Scopus subject areas