Summarizing Scientific Articles: Experiments with Relevance and Rhetorical Status
Top Cited Papers
- 1 December 2002
- journal article
- Published by MIT Press in Computational Linguistics
- Vol. 28 (4) , 409-445
- https://doi.org/10.1162/089120102762671936
Abstract
In this article we propose a strategy for the summarization of scientific articles that concentrates on the rhetorical status of statements in an article: Material for summaries is selected in such a way that summaries can highlight the new contribution of the source article and situate it with respect to earlier work.We provide a gold standard for summaries of this kind consisting of a substantial corpus of conference articles in computational linguistics annotated with human judgments of the rhetorical status and relevance of each sentence in the articles. We present several experiments measuring our judges' agreement on these annotations.We also present an algorithm that, on the basis of the annotated training material, selects content from unseen articles and classifies it into a fixed set of seven rhetorical categories. The output of this extraction and classification system can be viewed as a single-document summary in its own right; alternatively, it provides starting material for the generation of task-oriented and user-tailored summaries designed to give users an overview of a scientific field.Keywords
This publication has 20 references indexed in Scilit:
- Digital libraries and autonomous citation indexingComputer, 1999
- Persuasion and context: The pragmatics of academic metadiscourseJournal of Pragmatics, 1998
- Relevance: The whole historyJournal of the American Society for Information Science, 1997
- Automatic condensation of electronic publications by sentence selectionInformation Processing & Management, 1995
- ‘In this paper we report …'’: Speech acts and scientific factsJournal of Pragmatics, 1992
- The discourse-level structure of empirical abstracts: an exploratory studyInformation Processing & Management, 1991
- New Methods in Automatic ExtractingJournal of the ACM, 1969
- Automatic abstracting and indexing—survey and recommendationsCommunications of the ACM, 1961
- Machine-Made Index for Technical Literature—An ExperimentIBM Journal of Research and Development, 1958
- The Automatic Creation of Literature AbstractsIBM Journal of Research and Development, 1958