Reconciling schemas of disparate data sources
Top Cited Papers
- 1 May 2001
- proceedings article
- Published by Association for Computing Machinery (ACM)
- Vol. 30 (2) , 509-520
- https://doi.org/10.1145/375663.375731
Abstract
A data-integration system provides access to a multitude of data sources through a single mediated schema. A key bottleneck in building such systems has been the laborious manual construction of semantic mappings between the source schemas and the mediated schema. We describe LSD, a system that employs and extends current machine-learning techniques to semi-automatically find such mappings. LSD first asks the user to provide the semantic mappings for a small set of data sources, then uses these ...Keywords
This publication has 10 references indexed in Scilit:
- A classifier for semi-structured documentsPublished by Association for Computing Machinery (ACM) ,2000
- SEMINT: A tool for identifying attribute correspondences in heterogeneous databases using neural networksData & Knowledge Engineering, 2000
- Wrapper induction: Efficiency and expressivenessArtificial Intelligence, 2000
- An adaptive query execution system for data integrationPublished by Association for Computing Machinery (ACM) ,1999
- Issues in Stacked GeneralizationJournal of Artificial Intelligence Research, 1999
- Scaling access to heterogeneous data sources with DISCOIEEE Transactions on Knowledge and Data Engineering, 1998
- The TSIMMIS Approach to Mediation: Data Models and LanguagesJournal of Intelligent Information Systems, 1997
- On the Optimality of the Simple Bayesian Classifier under Zero-One LossMachine Learning, 1997
- Stacked generalizationNeural Networks, 1992
- Correction to "A Formal Basis for the Heuristic Determination of Minimum Cost Paths"ACM SIGART Bulletin, 1972