Detecting and Browsing Events in Unstructured text
- 11 August 2002
- proceedings article
- Published by Association for Computing Machinery (ACM)
Abstract
Previews and overviews of large, heterogeneous information resources help users comprehend the scope of collections and focus on particular subsets of interest. For narrative documents, questions of "what happened? where? and when?" are natural points of entry. Building on our earlier work at the Perseus Project with detecting terms, place names, and dates, we have exploited co-occurrences of dates and place names to detect and describe likely events in document collections. We compare statistical measures for determining the relative significance of various events. We have built interfaces that help users preview likely regions of interest for a given range of space and time by plotting the distribution and relevance of various collocations. Users can also control the amount of collocation information in each view. Once particular collocations are selected, the system can identify key phrases associated with each possible event to organize browsing of the documents themselves.Keywords
This publication has 11 references indexed in Scilit:
- Evaluating document clustering for interactive information retrievalPublished by Association for Computing Machinery (ACM) ,2001
- Drudgery and deep thoughtCommunications of the ACM, 2001
- Geospatial mapping and navigation of the webPublished by Association for Computing Machinery (ACM) ,2001
- Building a hypertextual digital library in the humanitiesPublished by Association for Computing Machinery (ACM) ,2001
- An evaluation corpus for temporal summarizationPublished by Association for Computational Linguistics (ACL) ,2001
- Automatic generation of overview timelinesPublished by Association for Computing Machinery (ACM) ,2000
- Extracting significant time varying features from textPublished by Association for Computing Machinery (ACM) ,1999
- A study of retrospective and on-line event detectionPublished by Association for Computing Machinery (ACM) ,1998
- Automatic text structuring and summarizationInformation Processing & Management, 1997
- Subtopic structuring for full-length document accessPublished by Association for Computing Machinery (ACM) ,1993