Reclink: aplicativo para o relacionamento de bases de dados, implementando o método probabilistic record linkage
Open Access
- 1 June 2000
- journal article
- abstracts
- Published by FapUNIFESP (SciELO) in Cadernos de Saude Publica
- Vol. 16 (2) , 439-447
- https://doi.org/10.1590/s0102-311x2000000200014
Abstract
This paper presents a system for database linkage based on the probabilistic record linkage technique, developed in the C++ language with the Borland C++ Builder version 3.0 programming environment. The system was tested in the linkage of data sources of different sizes, evaluated both in terms of processing time and sensitivity for identifying true record pairs. Significantly less time was spent in record processing when the program was used, as compared to manual processing, especially in situations where larger databases were used. Manual and automatic processes had equivalent sensitivities in situations where we used databases with fewer records. However, as the number of records grew we noticed a clear reduction in the sensitivity of the manual process, but not in the automatic one. Although in its initial stage of development, the system performed well in terms of both processing speed and sensitivity. Although overall performance of algorithms was satisfactory, we intend to evaluate other routines in the attempt to improve the system's performance.Keywords
This publication has 5 references indexed in Scilit:
- Development of a Record Linkage Protocol for Use in the Dutch Cancer Registry for Epidemiological ResearchInternational Journal of Epidemiology, 1990
- Advances in Record-Linkage Methodology as Applied to Matching the 1985 Census of Tampa, FloridaJournal of the American Statistical Association, 1989
- Probabilistic methods in matching census samples to the National Death IndexJournal of Chronic Diseases, 1986
- A Theory for Record LinkageJournal of the American Statistical Association, 1969
- Automatic Linkage of Vital RecordsScience, 1959