Estimating the Degree of Saturation in Mutant Screens
Open Access
- 1 September 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 168 (1) , 489-502
- https://doi.org/10.1534/genetics.103.024430
Abstract
Large-scale screens for loss-of-function mutants have played a significant role in recent advances in developmental biology and other fields. In such mutant screens, it is desirable to estimate the degree of “saturation” of the screen (i.e., what fraction of the possible target genes has been identified). We applied Bayesian and maximum-likelihood methods for estimating the number of loci remaining undetected in large-scale screens and produced credibility intervals to assess the uncertainty of these estimates. Since different loci may mutate to alleles with detectable phenotypes at different rates, we also incorporated variation in the degree of mutability among genes, using either gamma-distributed mutation rates or multiple discrete mutation rate classes. We examined eight published data sets from large-scale mutant screens and found that credibility intervals are much broader than implied by previous assumptions about the degree of saturation of screens. The likelihood methods presented here are a significantly better fit to data from published experiments than estimates based on the Poisson distribution, which implicitly assumes a single mutation rate for all loci. The results are reasonably robust to different models of variation in the mutability of genes. We tested our methods against mutant allele data from a region of the Drosophila melanogaster genome for which there is an independent genomics-based estimate of the number of undetected loci and found that the number of such loci falls within the predicted credibility interval for our models. The methods we have developed may also be useful for estimating the degree of saturation in other types of genetic screens in addition to classical screens for simple loss-of-function mutants, including genetic modifier screens and screens for protein-protein interactions using the yeast two-hybrid method.Keywords
This publication has 22 references indexed in Scilit:
- The FlyBase database of the Drosophila genome projects and community literatureNucleic Acids Research, 2003
- Bayesian Inference for the Negative Binomial Distribution via Polynomial ExpansionsJournal of Computational and Graphical Statistics, 2002
- The Genome Sequence of Drosophila melanogasterScience, 2000
- Coevolving protein residues: maximum likelihood identification and relationship to structure 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999
- Genetic dissection of trichome cell development in ArabidopsisCell, 1994
- Mutations affecting body organization in the Arabidopsis embryoNature, 1991
- Mutations affecting the pattern of the larval cuticle inDrosophila melanogasterWilhelm Roux' Archiv für Entwicklungsmechanik der Organismen, 1984
- Mutations affecting segment number and polarity in DrosophilaNature, 1980
- The estimation of the number of mutationally silent loci in saturation-mapping experimentsGenetics Research, 1980
- Monte Carlo sampling methods using Markov chains and their applicationsBiometrika, 1970