Designing a collaborative process to create bilingual dictionaries of Indonesian ethnic languages

Arbi Haza Nasution, Yohei Murakami, Toru Ishida

研究成果: Conference contribution

8 被引用数 (Scopus)

抄録

The constraint-based approach has been proven useful for inducing bilingual dictionary for closely-related low-resource languages. When we want to create multiple bilingual dictionaries linking several languages, we need to consider manual creation by a native speaker if there are no available machine-readable dictionaries are available as input. To overcome the difficulty in planning the creation of bilingual dictionaries, the consideration of various methods and costs, plan optimization is essential. Utilizing both constraint-based approach and plan optimizer, we design a collaborative process for creating 10 bilingual dictionaries from every combination of 5 languages, i.e., Indonesian, Malay, Minangkabau, Javanese, and Sundanese. We further design an online collaborative dictionary generation to bridge spatial gap between native speakers. We define a heuristic plan that only utilizes manual investment by the native speaker to evaluate our optimal plan with total cost as an evaluation metric. The optimal plan outperformed the heuristic plan with a 63.3% cost reduction.

本文言語English
ホスト出版物のタイトルLREC 2018 - 11th International Conference on Language Resources and Evaluation
編集者Hitoshi Isahara, Bente Maegaard, Stelios Piperidis, Christopher Cieri, Thierry Declerck, Koiti Hasida, Helene Mazo, Khalid Choukri, Sara Goggi, Joseph Mariani, Asuncion Moreno, Nicoletta Calzolari, Jan Odijk, Takenobu Tokunaga
出版社European Language Resources Association (ELRA)
ページ3397-3404
ページ数8
ISBN(電子版)9791095546009
出版ステータスPublished - 2019
外部発表はい
イベント11th International Conference on Language Resources and Evaluation, LREC 2018 - Miyazaki, Japan
継続期間: 2018 5月 72018 5月 12

出版物シリーズ

名前LREC 2018 - 11th International Conference on Language Resources and Evaluation

Other

Other11th International Conference on Language Resources and Evaluation, LREC 2018
国/地域Japan
CityMiyazaki
Period18/5/718/5/12

ASJC Scopus subject areas

  • 言語学および言語
  • 教育
  • 図書館情報学
  • 言語および言語学

フィンガープリント

「Designing a collaborative process to create bilingual dictionaries of Indonesian ethnic languages」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル