Summarization of multiple documents with rhetorical annotation

Sohei Aya*, Yutaka Matsuo, Naoaki Okazaki, Kôiti Hasida, Mitsuru Ishizuka

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper, we propose a new algorithm of summarization which targets a new kind of structured contents. The structured content, which is to be created by semantic authoring, consists of sentenses and rhetorical relation among sentences: It is represented by a graph, where a node is a sentence and an edge is a rhetorical relation. We simulate creating this content graph by using news paper articles that are annotated rhetorical relations by a GDA tagset. Our summarization method basically uses spreading activation over the content graph, followed by particular postprocesses to increase readability of the resultant summary. Experimental evaluation shows our method is at least equal to or better than Lead method for summarizing news paper articles.

Original languageEnglish
Pages (from-to)149-158
Number of pages10
JournalTransactions of the Japanese Society for Artificial Intelligence
Volume20
Issue number3
DOIs
Publication statusPublished - 2005
Externally publishedYes

Keywords

  • Automatic summarization
  • Rhetorical structure theory
  • Semantic authoring
  • Spreading activation

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Summarization of multiple documents with rhetorical annotation'. Together they form a unique fingerprint.

Cite this