Comparison of gene coverage of mouse oligonucleotide microarray platforms
Open Access
- 21 March 2006
- journal article
- research article
- Published by Springer Nature in BMC Genomics
- Vol. 7 (1) , 1-14
- https://doi.org/10.1186/1471-2164-7-58
Abstract
Background The increasing use of DNA microarrays for genetical genomics studies generates a need for platforms with complete coverage of the genome. We have compared the effective gene coverage in the mouse genome of different commercial and noncommercial oligonucleotide microarray platforms by performing an in-house gene annotation of probes. We only used information about probes that is available from vendors and followed a process that any researcher may take to find the gene targeted by a given probe. In order to make consistent comparisons between platforms, probes in each microarray were annotated with an Entrez Gene id and the chromosomal position for each gene was obtained from the UCSC Genome Browser Database. Gene coverage was estimated as the percentage of Entrez Genes with a unique position in the UCSC Genome database that is tested by a given microarray platform. Results A MySQL relational database was created to store the mapping information for 25,416 mouse genes and for the probes in five microarray platforms (gene coverage level in parenthesis): Affymetrix430 2.0 (75.6%), ABI Genome Survey (81.24%), Agilent (79.33%), Codelink (78.09%), Sentrix (90.47%); and four array-ready oligosets: Sigma (47.95%), Operon v.3 (69.89%), Operon v.4 (84.03%), and MEEBO (84.03%). The differences in coverage between platforms were highly conserved across chromosomes. Differences in the number of redundant and unspecific probes were also found among arrays. The database can be queried to compare specific genomic regions using a web interface. The software used to create, update and query the database is freely available as a toolbox named ArrayGene. Conclusion The software developed here allows researchers to create updated custom databases by using public or proprietary information on genes for any organisms. ArrayGene allows easy comparisons of gene coverage between microarray platforms for any region of the genome. The comparison presented here reveals that the commercial microarray Sentrix, which is based on the MEEBO public oligoset, showed the best mouse genome coverage currently available. We also suggest the creation of guidelines to standardize the minimum set of information that vendors should provide to allow researchers to accurately evaluate the advantages and disadvantages of using a given platform.Keywords
This publication has 16 references indexed in Scilit:
- An integrative genomics approach to infer causal associations between gene expression and diseaseNature Genetics, 2005
- Entrez Gene: gene-centered information at NCBINucleic Acids Research, 2004
- Characterization of QTL with Major Effects on Fatness and Growth on Mouse Chromosome 2Obesity Research, 2004
- Genetic analysis of genome-wide variation in human gene expressionNature, 2004
- KARMA: a web server application for comparing and annotating heterogeneous microarray platformsNucleic Acids Research, 2004
- Annotation and cross-indexing of array elements on multiple platforms.Environmental Health Perspectives, 2004
- Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factorsNature Genetics, 2003
- Genetics of gene expression surveyed in maize, mouse and manNature, 2003
- Genetic Dissection of Transcriptional Regulation in Budding YeastScience, 2002
- Genetical genomics: the added value from segregationTrends in Genetics, 2001