TY - JOUR
T1 - Organising lexica into analogical grids
T2 - a study of a holistic approach for morphological generation under various sizes of data in various languages
AU - Fam, Rashel
AU - Lepage, Yves
N1 - Publisher Copyright:
© 2022 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2022
Y1 - 2022
N2 - Morphological generation is a task where given a lemma and a morphosyntactic description of the target form, we are asked to generate the target form. Knowing that the syntactic and semantic relations to other forms are reflected by the word form itself, we show how to exploit these relations between word forms, holistically, that is, as a whole, to derive the target form without even breaking them into morphemes. Experimental results show that by organising the lexica into analogical grids we are able to improve the accuracy of morphological generation by up to 8% in low data scenarios. Our holistic approach always performs better than a morpheme-based baseline. We also enquire possible improvements by using data augmentation for neural approaches, especially in low data scenarios. However, our system seems not to gain any advantage from having more data after some point in time.
AB - Morphological generation is a task where given a lemma and a morphosyntactic description of the target form, we are asked to generate the target form. Knowing that the syntactic and semantic relations to other forms are reflected by the word form itself, we show how to exploit these relations between word forms, holistically, that is, as a whole, to derive the target form without even breaking them into morphemes. Experimental results show that by organising the lexica into analogical grids we are able to improve the accuracy of morphological generation by up to 8% in low data scenarios. Our holistic approach always performs better than a morpheme-based baseline. We also enquire possible improvements by using data augmentation for neural approaches, especially in low data scenarios. However, our system seems not to gain any advantage from having more data after some point in time.
KW - Analogical grids
KW - language productivity
KW - morphological complexity
KW - morphological generation
KW - organisation of lexicon
UR - http://www.scopus.com/inward/record.url?scp=85131362906&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85131362906&partnerID=8YFLogxK
U2 - 10.1080/0952813X.2022.2078890
DO - 10.1080/0952813X.2022.2078890
M3 - Article
AN - SCOPUS:85131362906
SN - 0952-813X
JO - Journal of Experimental and Theoretical Artificial Intelligence
JF - Journal of Experimental and Theoretical Artificial Intelligence
ER -