Validation of a Hierarchical Deterministic Record-Linkage Algorithm Using Data From 2 Different Cohorts of Human Immunodeficiency Virus-Infected Persons and Mortality Databases in Brazil
Open Access
- 9 October 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in American Journal of Epidemiology
- Vol. 168 (11) , 1326-1332
- https://doi.org/10.1093/aje/kwn249
Abstract
Loss to follow-up is a major source of bias in cohorts of patients with human immunodeficiency virus (HIV) and could lead to underestimation of mortality. The authors developed a hierarchical deterministic linkage algorithm to be used primarily with cohorts of HIV-infected persons to recover vital status information for patients lost to follow-up. Data from patients known to be deceased in 2 cohorts in Rio de Janeiro, Brazil, and data from the Rio de Janeiro State mortality database for 1999–2006 were used to validate the algorithm. A fully automated procedure yielded a sensitivity of 92.9% and specificity of 100% when no information was missing. When the automated procedure was combined with clerical review, in a scenario of 5% death prevalence and 20% missing mothers’ names, sensitivity reached 96.5% and specificity 100%. In a practical application, the algorithm significantly increased death rates and decreased the rate of loss to follow-up in the cohorts. The finding that 23.9% of matched records did not give HIV or acquired immunodeficiency syndrome as the cause of death reinforces the need to search all-cause mortality databases and alerts for possible underestimation of death rates. These results indicate that the algorithm is accurate enough to recover vital status information on patients lost to follow-up in cohort studies.Keywords
This publication has 23 references indexed in Scilit:
- Non-Hodgkin lymphoma incidence in the Swiss HIV Cohort Study before and after highly active antiretroviral therapyAIDS, 2008
- The growing impact of HIV infection on the epidemiology of tuberculosis in England and Wales: 1999 2003Thorax, 2007
- The impact of antiretroviral therapy and isoniazid preventive therapy on tuberculosis incidence in HIV-infected patients in Rio de Janeiro, BrazilAIDS, 2007
- Statistical design of THRio: a phased implementation clinic-randomized study of a tuberculosis preventive therapy interventionClinical Trials, 2007
- Acurácia da metodologia de relacionamento probabilístico de registros para identificação de óbitos em estudos de sobrevidaCadernos de Saude Publica, 2006
- Which are the best identifiers for record linkage?Medical Informatics and the Internet in Medicine, 2004
- An empirical comparison of record linkage proceduresStatistics in Medicine, 2002
- Reclink: aplicativo para o relacionamento de bases de dados, implementando o método probabilistic record linkageCadernos de Saude Publica, 2000
- Record Linkage Strategies, Outpatient Procedures, and Administrative DataMedical Care, 1996
- Computerised record linkage: Compared with traditional patient follow-up methods in clinical trials and illustrated in a prospective epidemiological studyJournal of Clinical Epidemiology, 1995