TY - GEN
T1 - The effect of corpus size on case frame acquisition for discourse analysis
AU - Sasano, Ryohei
AU - Kawahara, Daisuke
AU - Kurohashi, Sadao
PY - 2009
Y1 - 2009
N2 - This paper reports the effect of corpus size on case frame acquisition for discourse analysis in Japanese. For this study, we collected a Japanese corpus consisting of up to 100 billion words, and constructed case frames from corpora of six different sizes. Then, we applied these case frames to syntactic and case structure analysis, and zero anaphora resolution. We obtained better results by using case frames constructed from larger corpora; the performance was not saturated even with a corpus size of 100 billion words.
AB - This paper reports the effect of corpus size on case frame acquisition for discourse analysis in Japanese. For this study, we collected a Japanese corpus consisting of up to 100 billion words, and constructed case frames from corpora of six different sizes. Then, we applied these case frames to syntactic and case structure analysis, and zero anaphora resolution. We obtained better results by using case frames constructed from larger corpora; the performance was not saturated even with a corpus size of 100 billion words.
UR - http://www.scopus.com/inward/record.url?scp=78649988126&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78649988126&partnerID=8YFLogxK
U2 - 10.3115/1620754.1620830
DO - 10.3115/1620754.1620830
M3 - Conference contribution
AN - SCOPUS:78649988126
SN - 9781932432411
T3 - NAACL HLT 2009 - Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Conference
SP - 521
EP - 529
BT - NAACL HLT 2009 - Human Language Technologies
PB - Association for Computational Linguistics (ACL)
T2 - Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL HLT 2009
Y2 - 31 May 2009 through 5 June 2009
ER -