Authoritative sources in a hyperlinked environment
- 1 September 1999
- journal article
- Published by Association for Computing Machinery (ACM) in Journal of the ACM
- Vol. 46 (5) , 604-632
- https://doi.org/10.1145/324133.324140
Abstract
The network structure of a hyperlinked environment can be a rich source of information about the content of the environment, provided we have effective means for understanding it. We develop a set of algorithmic tools for extracting information from the link structures of such environments, and report on experiments that demonstrate their effectiveness in a variety of context on the World Wide Web. The central issue we address within our framework is the distillation of broad search topics, through the discovery of “authorative” information sources on such topics. We propose and test an algorithmic formulation of the notion of authority, based on the relationship between a set of relevant authoritative pages and the set of “hub pages” that join them together in the link structure. Our formulation has connections to the eigenvectors of certain matrices associated with the link graph; these connections in turn motivate additional heuristrics for link-based analysis.Keywords
This publication has 33 references indexed in Scilit:
- Automatic resource compilation by analyzing hyperlink structure and associated textComputer Networks and ISDN Systems, 1998
- The World-Wide WebCommunications of the ACM, 1994
- Structural analysis of hypertextsACM Transactions on Information Systems, 1992
- Indexing by latent semantic analysisJournal of the American Society for Information Science, 1990
- Searching for information in a hypertext medical handbookCommunications of the ACM, 1988
- Co‐citation analysis and the invisible collegeJournal of the American Society for Information Science, 1984
- The Structure of Scientific Literatures I: Identifying and Graphing SpecialtiesScience Studies, 1974
- Co‐citation in the scientific literature: A new measure of the relationship between two documentsJournal of the American Society for Information Science, 1973
- An Input-Output Approach to Clique IdentificationSociometry, 1965
- Analysis of a complex of statistical variables into principal components.Journal of Educational Psychology, 1933