Validation of large data sets, an essential prerequisite for data analysis: an analytical survey of the Bone Marrow Donors Worldwide
- 1 March 1996
- journal article
- Published by Wiley in Tissue Antigens
- Vol. 47 (3) , 169-178
- https://doi.org/10.1111/j.1399-0039.1996.tb02537.x
Abstract
Large data sets like the Bone Marrow Donors Worldwide (BMDW) data set can be used for population genetic analyses. The qualities of such data sets are unique. To be able to use the BMDW data for analyses, several problems, like limited size and selective DR typing, of the data have to be solved and the quality of the registry data subsets has to be examined. We describe these problems and methods to overcome them. Also, we give an overview of the qualities of the different registry subsets. Sixteen of the twenty‐nine examined subsets contain data that can be used for population genetic analysis. We will deal with these analyses in the future. Additionally, we present a method to calculate the minimum number of individuals required for reliable haplotype frequency estimation.Keywords
This publication has 6 references indexed in Scilit:
- Nomenclature for factors of the HLA system, 1994Tissue Antigens, 1994
- The National Marrow Donor ProgramTransfusion, 1993
- Longevity and Heredity in Humans.Annals of the New York Academy of Sciences, 1991
- HL‐A AND DISEASE THE DETECTION OF ASSOCIATIONSInternational Journal of Immunogenetics, 1974
- THE ESTIMATION AND SIGNIFICANCE OF THE LOGARITHM OF A RATIO OF FREQUENCIESAnnals of Human Genetics, 1956
- ON ESTIMATING THE RELATION BETWEEN BLOOD GROUP AND DISEASEAnnals of Human Genetics, 1955