Information integration
- 1 September 1998
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Intelligent Systems and their Applications
- Vol. 13 (5) , 12-24
- https://doi.org/10.1109/5254.722342
Abstract
Despite the Web's current disorganized and anarchic state, many AI researchers believe that it will become the world's largest knowledge base. We examine a line of research whose final goal is to make disparate data sources work together to better serve users' information needs. This work is known as information integration. The authors talk about its application to datasets made available over the Web. A. Levy discusses the relationship between information-integration and traditional database systems. He then enumerates important issues in the field and demonstrates how the Information Manifold project has addressed some of these. C. Knoblock and S. Minton describe the Ariadne system. Two of its distinguishing features are its use of wrapper algorithms to extract structured information from semistructured data sources and its use of planning algorithms to determine how to integrate information efficiently and effectively across sources. W. Cohen describes an interesting variation on the theme, focusing on "informal" information integration. The idea is that, as in related fields that deal with uncertain and incomplete information, an information-integration system should be allowed to take chances and make mistakes. His Whirl system uses information-retrieval algorithms to find approximate matches between different databases, and as a consequence knits together data from quite diverse sources.Keywords
This publication has 14 references indexed in Scilit:
- Database techniques for the World-Wide WebACM SIGMOD Record, 1998
- Catching the boat with StrudelPublished by Association for Computing Machinery (ACM) ,1998
- Integration of heterogeneous databases without common domains using queries based on textual similarityPublished by Association for Computing Machinery (ACM) ,1998
- A Web-based information system that reasons with structured collections of textPublished by Association for Computing Machinery (ACM) ,1998
- InfomasterPublished by Association for Computing Machinery (ACM) ,1997
- A scalable comparison-shopping agent for the World-Wide WebPublished by Association for Computing Machinery (ACM) ,1997
- Customizing information capture and accessACM Transactions on Information Systems, 1997
- Answering queries using limited external query processors (extended abstract)Published by Association for Computing Machinery (ACM) ,1996
- Query reformulation for dynamic information integrationJournal of Intelligent Information Systems, 1996
- A softbot-based interface to the InternetCommunications of the ACM, 1994