Toward a universal microarray: prediction of gene expression through nearest-neighbor probe sequence identification
Open Access
- 11 July 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (15) , e99
- https://doi.org/10.1093/nar/gkm549
Abstract
A generic DNA microarray design applicable to any species would greatly benefit comparative genomics. We have addressed the feasibility of such a design by leveraging the great feature densities and relatively unbiased nature of genomic tiling microarrays. Specifically, we first divided each Homo sapiens Refseq-derived gene's spliced nucleotide sequence into all of its possible contiguous 25 nt subsequences. For each of these 25 nt subsequences, we searched a recent human transcript mapping experiment's probe design for the 25 nt probe sequence having the fewest mismatches with the subsequence, but that did not match the subsequence exactly. Signal intensities measured with each gene's nearest-neighbor features were subsequently averaged to predict their gene expression levels in each of the experiment's thirty-three hybridizations. We examined the fidelity of this approach in terms of both sensitivity and specificity for detecting actively transcribed genes, for transcriptional consistency between exons of the same gene, and for reproducibility between tiling array designs. Taken together, our results provide proof-of-principle for probing nucleic acid targets with off-target, nearest-neighbor features.Keywords
This publication has 27 references indexed in Scilit:
- Expression profiling using a hexamer-based universal microarrayNature Biotechnology, 2004
- The UCSC Table Browser data retrieval toolNucleic Acids Research, 2004
- Microarray AnalysisPLoS Biology, 2003
- Gene structure-based splice variant deconvolution using a microarry platformBioinformatics, 2003
- A comparison of normalization methods for high density oligonucleotide array data based on variance and biasBioinformatics, 2003
- Large-Scale Transcriptional Activity in Chromosomes 21 and 22Science, 2002
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Gene Expression Analysis with Universal n-mer ArraysGenome Research, 2002
- RNA expression analysis using a 30 base pair resolution Escherichia coli genome arrayNature Biotechnology, 2000
- Assessment of the sensitivity and specificity of oligonucleotide (50mer) microarraysNucleic Acids Research, 2000