Experiments on the automatic construction of hypertext from texts
- 1 January 1995
- journal article
- research article
- Published by Taylor & Francis in New Review of Hypermedia and Multimedia
- Vol. 1 (1) , 23-39
- https://doi.org/10.1080/13614569508914659
Abstract
The problem of (semi-)automatically turning text into hypertext is one that has been identified as important to the growth and development of hypertext as a way of organising information. In this paper we describe an approach we have developed to semi-automatically generate a hypertext from linear texts. This is based on initially creating nodes and composite nodes composed of ‘mini-hypertexts’. Following this we then compute node-node similarity values using standard information retrieval techniques. These similarity measures are then used to selectively create node-node links based on the strength of similarity between nodes. What makes our process novel is that the link creation process also uses values from a dynamically computed metric which measures the topological compactness of the overall hypertext being generated. Thus link creation is a selective process based not only on node-node similarity but also on the overall layout of the hypertext. Experiments on generating a hypertext from a collection of 846 software product descriptions comprising 8.5 Mbytes of text are described. Our experiments with a variety of IR techniques and link creation approaches yield some guidelines on how the process should be automated. Finally, this text to hypertext conversion method is put into the context of an overall hypertext authoring tool currently under development.Keywords
This publication has 14 references indexed in Scilit:
- Automatic Analysis, Theme Generation, and Summarization of Machine-Readable TextsScience, 1994
- Information retrieval from hypertext using dynamically planned guided toursPublished by Association for Computing Machinery (ACM) ,1993
- Making use of hypertext links when retrieving informationPublished by Association for Computing Machinery (ACM) ,1993
- Information filtering and information retrievalCommunications of the ACM, 1992
- Converting a textbook to hypertextACM Transactions on Information Systems, 1992
- Progress in the Application of Natural Language Processing to Information Retrieval TasksThe Computer Journal, 1992
- Structural analysis of hypertextsACM Transactions on Information Systems, 1992
- Searching for information in a hypertext medical handbookCommunications of the ACM, 1988
- Reflections on NoteCards: seven issues for the next generation of hypermedia systemsCommunications of the ACM, 1988
- Hypertext and the Oxford English dictionaryCommunications of the ACM, 1988