Hierarchical sub-sentential alignment with anymalign

Adrien Lardilleux, François Yvon, Yves Lepage

研究成果: Paper査読

11 被引用数 (Scopus)

抄録

We present a sub-sentential alignment algorithm that relies on association scores between words or phrases. This algorithm is inspired by previous work on alignment by recursive binary segmentation and on document clustering. We evaluate the resulting alignments on machine translation tasks and show that we can obtain state-of-the-art results, with gains up to more than 4 BLEU points compared to previous work, with a method that is simple, independent of the size of the corpus to be aligned, and directly computes symmetric alignments. This work also provides new insights regarding the use of "heuristic" alignment scores in statistical machine translation.

本文言語English
ページ279-286
ページ数8
出版ステータスPublished - 2012
イベント16th Annual Conference of the European Association for Machine Translation, EAMT 2012 - Trento, Italy
継続期間: 2012 5月 282012 5月 30

Other

Other16th Annual Conference of the European Association for Machine Translation, EAMT 2012
国/地域Italy
CityTrento
Period12/5/2812/5/30

ASJC Scopus subject areas

  • 言語および言語学
  • 人間とコンピュータの相互作用
  • ソフトウェア

フィンガープリント

「Hierarchical sub-sentential alignment with anymalign」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル