TY - JOUR
T1 - Research and Implementation of Chinese Couplet Generation System with Attention-Based Transformer Mechanism
AU - Wang, Yufeng
AU - Zhang, Jiang
AU - Zhang, Bo
AU - Jin, Qun
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2022/8/1
Y1 - 2022/8/1
N2 - Couplet is a unique art form in Chinese traditional culture. The development of deep neural network (DNN) technology makes it possible for computers to automatically generate couplets. Especially, Transformer is a DNN-based 'Encoder-Decoder' framework, and widely used in natural language processing (NLP). However, the existed Transformer mechanism cannot fully exploit the essential linguistic knowledge in Chinese, including the special format and requirements of Chinese couplets. Therefore, this article adapts the Transformer mechanism to generate meaningful Chinese couplets. Specifically, the contributions of our work are threefold. First, considering the fact that the words in the corresponding positions of the antecedent clause and the subsequent clause in a Chinese couplet always have same part-of-speech (pos, i.e., word class), pos information is intentionally added into the Transformer to improve the accuracy of the conceived couplet. Second, to deal with the large number of unregistered and low-frequency words in Chinese couplet, a specific unregistered/low-frequency word processing mechanism (UWP) is designed and combined with the Transformer model. Third, to further improve the coherence of couplets, we incorporate the polish mechanisms (PMs) into Transformer model. In terms of three evaluation criteria including bilingual evaluation understudy (BLEU), perplexity, and human evaluation, the experimental results demonstrate the effectiveness of our designed Chinese couplet generation system.
AB - Couplet is a unique art form in Chinese traditional culture. The development of deep neural network (DNN) technology makes it possible for computers to automatically generate couplets. Especially, Transformer is a DNN-based 'Encoder-Decoder' framework, and widely used in natural language processing (NLP). However, the existed Transformer mechanism cannot fully exploit the essential linguistic knowledge in Chinese, including the special format and requirements of Chinese couplets. Therefore, this article adapts the Transformer mechanism to generate meaningful Chinese couplets. Specifically, the contributions of our work are threefold. First, considering the fact that the words in the corresponding positions of the antecedent clause and the subsequent clause in a Chinese couplet always have same part-of-speech (pos, i.e., word class), pos information is intentionally added into the Transformer to improve the accuracy of the conceived couplet. Second, to deal with the large number of unregistered and low-frequency words in Chinese couplet, a specific unregistered/low-frequency word processing mechanism (UWP) is designed and combined with the Transformer model. Third, to further improve the coherence of couplets, we incorporate the polish mechanisms (PMs) into Transformer model. In terms of three evaluation criteria including bilingual evaluation understudy (BLEU), perplexity, and human evaluation, the experimental results demonstrate the effectiveness of our designed Chinese couplet generation system.
KW - Deep neural network (DNN) based Transformer mechanism
KW - part-of-speech features
KW - polish-up mechanism
KW - unregistered and low-frequency words
UR - http://www.scopus.com/inward/record.url?scp=85104573648&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85104573648&partnerID=8YFLogxK
U2 - 10.1109/TCSS.2021.3072153
DO - 10.1109/TCSS.2021.3072153
M3 - Article
AN - SCOPUS:85104573648
SN - 2329-924X
VL - 9
SP - 1020
EP - 1028
JO - IEEE Transactions on Computational Social Systems
JF - IEEE Transactions on Computational Social Systems
IS - 4
ER -