From rhetorical structures to document structure: Shallow pragmatic analysis for document engineering

Gersende Georg*, Hugo Hernault, Marc Cavazza, Helmut Prendinger, Mitsuru Ishizuka

*この研究の対応する著者

研究成果: Conference contribution

9 被引用数 (Scopus)

抄録

In this paper, we extend previous work on the automatic structuring of medical documents using content analysis. Our long-term objective is to take advantage of specific rhetoric markers encountered in specialized medical documents (clinical guidelines) to automatically structure free text according to its role in the document. This should enable to generate multiple views of the same document depending on the target audience, generate document summaries, as well as facilitating knowledge extraction from text. We have established in previous work that the structure of clinical guidelines could be refined through the identification of a limited set of deontic operators. We now propose to extend this approach by analyzing the text delimited by these operators using Rhetorical Structure Theory. The emphasis on causality and time in RST proves a powerful complement to the recognition of deontic structures while retaining the same philosophy of high-level recognition of sentence structure, which can be converted into applicationspecific mark-ups. Throughout the paper, we illustrate our findings through results produced by the automatic processing of English guidelines for the management of hypertension and Alzheimer disease.

本文言語English
ホスト出版物のタイトルDocEng'09 - Proceedings of the 2009 ACM Symposium on Document Engineering
ページ185-192
ページ数8
DOI
出版ステータスPublished - 2009
外部発表はい
イベント9th ACM Symposium on Document Engineering, DocEng'09 - Munich
継続期間: 2009 9月 152009 9月 18

Other

Other9th ACM Symposium on Document Engineering, DocEng'09
CityMunich
Period09/9/1509/9/18

ASJC Scopus subject areas

  • コンピュータ サイエンスの応用
  • ソフトウェア

引用スタイル