An Unbiased Estimator of Gene Diversity in Samples Containing Related Individuals
Open Access
- 6 November 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 26 (3) , 501-512
- https://doi.org/10.1093/molbev/msn254
Abstract
Gene diversity is sometimes estimated from samples that contain inbred or related individuals. If inbred or related individuals are included in a sample, then the standard estimator for gene diversity produces a downward bias caused by an inflation of the variance of estimated allele frequencies. We develop an unbiased estimator for gene diversity that relies on kinship coefficients for pairs of individuals with known relationship and that reduces to the standard estimator when all individuals are noninbred and unrelated. Applying our estimator to data simulated based on allele frequencies observed for microsatellite loci in human populations, we find that the new estimator performs favorably compared with the standard estimator in terms of bias and similarly in terms of mean squared error. For human population-genetic data, we find that a close linear relationship previously seen between gene diversity and distance from East Africa is preserved when adjusting for the inclusion of close relatives.Keywords
This publication has 23 references indexed in Scilit:
- Standardized Subsets of the HGDP‐CEPH Human Genome Diversity Cell Line Panel, Accounting for Atypical and Duplicated Samples and Pairs of Close RelativesAnnals of Human Genetics, 2006
- Clines, Clusters, and the Effect of Study Design on the Inference of Human Population StructurePLoS Genetics, 2005
- Novel Case-Control Test in a Founder Population Identifies P-Selectin as an Atopy-Susceptibility LocusAmerican Journal of Human Genetics, 2003
- Detecting recent positive selection in the human genome from haplotype structureNature, 2002
- Genomic Microsatellites as Evolutionary Chronometers: A Test in Wild CatsGenome Research, 2002
- Estimation of allele frequencies with data on sibshipsGenetic Epidemiology, 2001
- Linkage disequilibrium between amino acid sites in immunoglobulin genes and other multigene familiesGenetics Research, 1980
- Analysis of Gene Diversity in Subdivided PopulationsProceedings of the National Academy of Sciences, 1973
- Urbanization, Technology, and the Division of Labor: International PatternsAmerican Sociological Review, 1962
- Measurement of DiversityNature, 1949