Generic summaries for indexing in information retrieval

Tetsuya Sakai, Karen Sparck-Jones

Research output: Contribution to journalConference articlepeer-review

54 Citations (Scopus)

Abstract

This paper examines the use of generic summaries for indexing in information retrieval. Our main observations are that: (1) With or without pseudo-relevance feedback, a summary index may be as effective as the corresponding fulltext index for precision-oriented search of highly relevant documents. But a reasonably sophisticated summarizer, using a compression ratio of 10-30%, is desirable for this purpose. (2) In pseudo-relevance feedback, using a summary index at initial search and a fulltext index at final search is possibly effective for precision-oriented search, regardless of relevance levels. This strategy is significantly more effective than the one using the summary index only and probably more effective than using summaries as mere term selection filters. For this strategy, the summary quality is probably not a critical factor, and a compression ratio of 5-10% appears best.

Original languageEnglish
Pages (from-to)190-198
Number of pages9
JournalSIGIR Forum (ACM Special Interest Group on Information Retrieval)
DOIs
Publication statusPublished - 2001
Externally publishedYes
Event24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - New Orleans, LA, United States
Duration: 2001 Sept 92001 Sept 13

ASJC Scopus subject areas

  • Management Information Systems
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Generic summaries for indexing in information retrieval'. Together they form a unique fingerprint.

Cite this