TY - JOUR
T1 - Towards language service creation and customization for low-resource languages
AU - Lin, Donghui
AU - Murakami, Yohei
AU - Ishida, Toru
N1 - Funding Information:
Funding: This research was partially supported by a Grant-in-Aid for Scientific Research (A) (17H00759, 2017-2020), a Grant-in-Aid for Scientific Research (B) (18H03341, 2018-2020), and a Grant-in-Aid for Young Scientists (A) (17H04706, 2017–2020) from the Japan Society for the Promotion of Science (JSPS).
Funding Information:
This research was partially supported by a Grant-in-Aid for Scientific Research (A) (17H00759, 2017-2020), a Grant-in-Aid for Scientific Research (B) (18H03341, 2018-2020), and a Grant-in-Aid for Young Scientists (A) (17H04706, 2017-2020) from the Japan Society for the Promotion of Science (JSPS). The authors would like to thank Mairidan Wushouer and Arbi Haza Nasution for their empirical studies of bilingual dictionary induction of Turkic languages [43] and Indonesian ethnic languages [34] based on the concept described in this research.
Publisher Copyright:
© 2020 by the authors.
PY - 2020/2/1
Y1 - 2020/2/1
N2 - The most challenging issue with low-resource languages is the difficulty of obtaining enough language resources. In this paper, we propose a language service framework for low-resource languages that enables the automatic creation and customization of new resources from existing ones. To achieve this goal, we first introduce a service-oriented language infrastructure, the Language Grid; it realizes new language services by supporting the sharing and combining of language resources. We then show the applicability of the Language Grid to low-resource languages. Furthermore, we describe how we can now realize the automation and customization of language services. Finally, we illustrate our design concept by detailing a case study of automating and customizing bilingual dictionary induction for low-resource Turkic languages and Indonesian ethnic languages.
AB - The most challenging issue with low-resource languages is the difficulty of obtaining enough language resources. In this paper, we propose a language service framework for low-resource languages that enables the automatic creation and customization of new resources from existing ones. To achieve this goal, we first introduce a service-oriented language infrastructure, the Language Grid; it realizes new language services by supporting the sharing and combining of language resources. We then show the applicability of the Language Grid to low-resource languages. Furthermore, we describe how we can now realize the automation and customization of language services. Finally, we illustrate our design concept by detailing a case study of automating and customizing bilingual dictionary induction for low-resource Turkic languages and Indonesian ethnic languages.
KW - Bilingual dictionary
KW - Language resources
KW - Language service infrastructure
KW - Low-resource languages
KW - The language grid
UR - http://www.scopus.com/inward/record.url?scp=85081113999&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85081113999&partnerID=8YFLogxK
U2 - 10.3390/info11020067
DO - 10.3390/info11020067
M3 - Article
AN - SCOPUS:85081113999
SN - 2078-2489
VL - 11
JO - Information (Switzerland)
JF - Information (Switzerland)
IS - 2
M1 - 67
ER -