Robust Identification of Fuzzy Duplicates
- 19 April 2005
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 865-876
- https://doi.org/10.1109/icde.2005.125
Abstract
No abstract availableThis publication has 14 references indexed in Scilit:
- Identification of common molecular subsequencesPublished by Elsevier ,2004
- Searching in metric spaces by spatial approximationThe VLDB Journal, 2002
- Learning domain-independent string transformation weights for high accuracy object identificationPublished by Association for Computing Machinery (ACM) ,2002
- Interactive deduplication using active learningPublished by Association for Computing Machinery (ACM) ,2002
- Eliminating Fuzzy Duplicates in Data WarehousesPublished by Elsevier ,2002
- Learning to match and cluster large high-dimensional data sets for data integrationPublished by Association for Computing Machinery (ACM) ,2002
- Integration of heterogeneous databases without common domains using queries based on textual similarityPublished by Association for Computing Machinery (ACM) ,1998
- Real-world Data is Dirty: Data Cleansing and The Merge/Purge ProblemData Mining and Knowledge Discovery, 1998
- Duplicate record elimination in large data filesACM Transactions on Database Systems, 1983
- A Theory for Record LinkageJournal of the American Statistical Association, 1969