Sequence-based estimation of minisatellite and microsatellite repeat variability
Open Access
- 31 October 2007
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 17 (12) , 1787-1796
- https://doi.org/10.1101/gr.6554007
Abstract
Variable tandem repeats are frequently used for genetic mapping, genotyping, and forensics studies. Moreover, variation in some repeats underlies rapidly evolving traits or certain diseases. However, mutation rates vary greatly from repeat to repeat, and as a consequence, not all tandem repeats are suitable genetic markers or interesting unstable genetic modules. We developed a model, “SERV,” that predicts the variability of a broad range of tandem repeats in a wide range of organisms. The nonlinear model uses three basic characteristics of the repeat (number of repeated units, unit length, and purity) to produce a numeric “VARscore” that correlates with repeat variability. SERV was experimentally validated using a large set of different artificial repeats located in theSaccharomyces cerevisiae URA3gene. Further in silico analysis shows that SERV outperforms existing models and accurately predicts repeat variability in bacteria and eukaryotes, including plants and humans. Using SERV, we demonstrate significant enrichment of variable repeats within human genes involved in transcriptional regulation, chromatin remodeling, morphogenesis, and neurogenesis. Moreover, SERV allows identification of known and candidate genes involved in repeat-based diseases. In addition, we demonstrate the use of SERV for the selection and comparison of suitable variable repeats for genotyping and forensic purposes. Our analysis indicates that tandem repeats used for genotyping should have a VARscore between 1 and 3. SERV is publicly available fromhttp://hulsweb1.cgr.harvard.edu/SERV/.Keywords
This publication has 48 references indexed in Scilit:
- Coding Tandem Repeats Generate Diversity in Aspergillus fumigatus GenesEukaryotic Cell, 2007
- Relative Impact of Nucleotide and Copy Number Variation on Gene Expression PhenotypesScience, 2007
- Genes with internal repeats require the THO complex for transcriptionProceedings of the National Academy of Sciences, 2006
- Complex Minisatellite Rearrangements Generated in the Total or Partial Absence of Rad27/hFEN1 Activity Occur in a Single Generation and Are Rad51 and Rad52 DependentMolecular and Cellular Biology, 2006
- Adaptive evolution by mutations in theFLO11geneProceedings of the National Academy of Sciences, 2006
- Genome-wide prediction of human VNTRsGenomics, 2005
- A Genomic Basis for the Evolution of Vertebrate Transcription Factors Containing Amino Acid RunsGenetics, 2004
- MUC1 overexpression results in mammary gland tumorigenesis and prolonged alveolar differentiationOncogene, 2004
- Microsatellites: simple sequences with complex evolutionNature Reviews Genetics, 2004
- The Genetic Association DatabaseNature Genetics, 2004