A strategy for assembling samples of adult twin pairs in the United States
- 30 September 1993
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 12 (18) , 1693-1702
- https://doi.org/10.1002/sim.4780121805
Abstract
In this paper we develop a methodology for the identification of large numbers of U.S. adult twin pairs. Data for this study derive from the U.S. Department of Defense and the Vietnam Era Twin )VET( Registry. The Department of Defense identified potential male twins )n = 10,002( using a computerized record linkage algorithm based on the same last name, same date of birth, and the same first five digits of the Social Security number. Twinship was confirmed by comparison with the Vietnam Era Twin Registry. We developed a logistic regression model that predicts the probability that a paired record identifies twins based on the absolute difference in the last four digits in the Social Security number, the age of issuance of the Social Security number, and the frequency of occurrence of the last name. We used the estimated coefficients derived from this regression model to assign predicted probabilities of being a twin to each matched record. There is a close correspondence between the observed and expected number of twins when evaluated across deciles of predicted probabilities of being a twin; the value of the Harrell's c index )c = 0·68 ∓ 0·0004( indicates the overall predictive accuracy of the regression equation. The results from this study demonstrate the feasibility of identifying adult male–male twin pairs from any large computerized database that contains name, date of birth and Social Security number. However, the selection criteria used in the creation of the computer database must be clearly specified to avoid constructing a biased sample of twins.Keywords
This publication has 18 references indexed in Scilit:
- A revised estimate of twin concordance in systemic lupus erythematosusArthritis & Rheumatism, 1992
- Using multidimensional scaling on data from pairs of relatives to explore the dimensionality of categorical multifactorial traitsGenetic Epidemiology, 1992
- A Genetic Study of Male Sexual OrientationArchives of General Psychiatry, 1991
- High risk of HIV-1 infection for first-born twinsThe Lancet, 1991
- Heterogeneity in the Inheritance of AlcoholismArchives of General Psychiatry, 1991
- The Minnesota Twin Family Registry: Some Initial FindingsActa geneticae medicae et gemellologiae: twin research, 1990
- THE DEVELOPMENT OF A NEW ZEALAND TWIN REGISTERCommunity Health Studies, 1986
- Differential Enrollment in Twin Registries: Its Effect on Prevalence and Concordance Rates and Estimates of Genetic ParametersActa geneticae medicae et gemellologiae: twin research, 1985
- Genetic covariation between neuroticism and the symptoms of anxiety and depressionGenetic Epidemiology, 1984
- A METHOD FOR ESTIMATING YEAR OF BIRTH USING SOCIAL SECURITY NUMBERAmerican Journal of Epidemiology, 1983