Query-biased summarization considering difference of paragraphs

Chikara Otani*, Yasushi Oda, Osamu Yoshie

*この研究の対応する著者

研究成果: Conference contribution

抄録

Most conventional query-biased summarization methods generate the summary using extracted sentences based on similarity measure between all sentences in a document and the query. If there are plural sentences having high similarity to the query in the document, these methods cannot decide the sentence which the summary should be from. This paper proposes an algorithm adopting new indicator that shows the difference between one paragraph and the others. In a word space which is composed of all words in the target document, the algorithm determines the axis that maximizes the difference when a paragraph and the others are projected onto it. There are many combinations of a paragraph and a set of other paragraphs. For each combination, the above-mentioned axis that maximizes the difference and gives a conformity degree to the given query is calculated. With these conformities, the algorithm decides one paragraph for generating the summary. To obtain the axis, topic distinctiveness factor analysis is applied. The basic idea for making final summary is concatenating the sentences extracted from the paragraph. The resultant summary is evaluated from the following points of view: readability, understandability and the easiness to judge whether the link works well or not.

本文言語English
ホスト出版物のタイトルiiWAS2010 - 12th International Conference on Information Integration and Web-Based Applications and Services
ページ535-541
ページ数7
DOI
出版ステータスPublished - 2010
イベント12th International Conference on Information Integration and Web-Based Applications and Services, iiWAS2010 - Paris, France
継続期間: 2010 11月 82010 11月 10

出版物シリーズ

名前iiWAS2010 - 12th International Conference on Information Integration and Web-Based Applications and Services

Conference

Conference12th International Conference on Information Integration and Web-Based Applications and Services, iiWAS2010
国/地域France
CityParis
Period10/11/810/11/10

ASJC Scopus subject areas

  • コンピュータ ネットワークおよび通信
  • コンピュータ サイエンスの応用
  • 情報システム

フィンガープリント

「Query-biased summarization considering difference of paragraphs」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル