TY - GEN
T1 - Evaluating Image-Inspired Poetry Generation
AU - Wu, Chao Chung
AU - Song, Ruihua
AU - Sakai, Tetsuya
AU - Cheng, Wen Feng
AU - Xie, Xing
AU - Lin, Shou De
N1 - Publisher Copyright:
© 2019, Springer Nature Switzerland AG.
PY - 2019
Y1 - 2019
N2 - Creative natural language generation, such as poetry generation, writing lyrics, and storytelling, is appealing but difficult to evaluate. We take the application of image-inspired poetry generation as a showcase and investigate two problems in evaluation: (1) how to evaluate the generated text when there are no ground truths, and (2) how to evaluate nondeterministic systems that output different texts given the same input image. Regarding the first problem, we first design a judgment tool to collect ratings of a few poems for comparison with the inspiring image shown to assessors. We then propose a novelty measurement that quantifies how different a generated text is compared to a known corpus. Regarding the second problem, we experiment with different strategies to approximate evaluating multiple trials of output poems. We also use a measure for quantifying the diversity of different texts generated in response to the same input image, and discuss their merits.
AB - Creative natural language generation, such as poetry generation, writing lyrics, and storytelling, is appealing but difficult to evaluate. We take the application of image-inspired poetry generation as a showcase and investigate two problems in evaluation: (1) how to evaluate the generated text when there are no ground truths, and (2) how to evaluate nondeterministic systems that output different texts given the same input image. Regarding the first problem, we first design a judgment tool to collect ratings of a few poems for comparison with the inspiring image shown to assessors. We then propose a novelty measurement that quantifies how different a generated text is compared to a known corpus. Regarding the second problem, we experiment with different strategies to approximate evaluating multiple trials of output poems. We also use a measure for quantifying the diversity of different texts generated in response to the same input image, and discuss their merits.
KW - AI-based creation
KW - Evaluation
KW - Image
KW - Natural language generation
KW - Poetry generation
UR - http://www.scopus.com/inward/record.url?scp=85075561188&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85075561188&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-32233-5_42
DO - 10.1007/978-3-030-32233-5_42
M3 - Conference contribution
AN - SCOPUS:85075561188
SN - 9783030322328
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 539
EP - 551
BT - Natural Language Processing and Chinese Computing - 8th CCF International Conference, NLPCC 2019, Proceedings
A2 - Tang, Jie
A2 - Kan, Min-Yen
A2 - Zhao, Dongyan
A2 - Li, Sujian
A2 - Zan, Hongying
PB - Springer
T2 - 8th CCF International Conference on Natural Language Processing and Chinese Computing, NLPCC 2019
Y2 - 9 October 2019 through 14 October 2019
ER -