Abstract
This paper describes the NiCT-ATR statistical machine translation (SMT) system used for the IWSLT 2006 evaluation compaign. We participated in all four language pair translation tasks (CE, JE, AE and IE) and all two tracks (OPEN and CSTAR). We used a phrase-based SMT in the OPEN track and a hybrid multiple translation engine in the CSTAR track. We also equipped our system with some of new preprocessing and post-processing techniques for Chinese word segmentation, named entity translation, punctuation and capitalization, sentence splitting, and language model adaptation. Our experiments show these features significantly improved our system.
Original language | English |
---|---|
Pages | 83-90 |
Number of pages | 8 |
Publication status | Published - 2006 |
Externally published | Yes |
Event | 3rd International Workshop on Spoken Language Translation, IWSLT 2006 - Keihanna Science City, Japan Duration: 2006 Nov 27 → 2006 Nov 28 |
Conference
Conference | 3rd International Workshop on Spoken Language Translation, IWSLT 2006 |
---|---|
Country/Territory | Japan |
City | Keihanna Science City |
Period | 06/11/27 → 06/11/28 |
ASJC Scopus subject areas
- Language and Linguistics
- Human-Computer Interaction
- Linguistics and Language