Development and production of an oligonucleotide MuscleChip: use for validation of ambiguous ESTs
Open Access
- 1 January 2002
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 3 (1) , 33
- https://doi.org/10.1186/1471-2105-3-33
Abstract
We describe the development, validation, and use of a highly redundant 120,000 oligonucleotide microarray (MuscleChip) containing 4,601 probe sets representing 1,150 known genes expressed in muscle and 2,075 EST clusters from a non-normalized subtracted muscle EST sequencing project (28,074 EST sequences). This set included 369 novel EST clusters showing no match to previously characterized proteins in any database. Each probe set was designed to contain 20–32 25 mer oligonucleotides (10–16 paired perfect match and mismatch probe pairs per gene), with each probe evaluated for hybridization kinetics (Tm) and similarity to other sequences. The 120,000 oligonucleotides were synthesized by photolithography and light-activated chemistry on each microarray. Hybridization of human muscle cRNAs to this MuscleChip (33 samples) showed a correlation of 0.6 between the number of ESTs sequenced in each cluster and hybridization intensity. Out of 369 novel EST clusters not showing any similarity to previously characterized proteins, we focused on 250 EST clusters that were represented by robust probe sets on the MuscleChip fulfilling all stringent rules. 102 (41%) were found to be consistently "present" by analysis of hybridization to human muscle RNA, of which 40 ESTs (39%) could be genome anchored to potential transcription units in the human genome sequence. 19 ESTs of the 40 ESTs were furthermore computer-predicted as exons by one or more than three gene identification algorithms. Our analysis found 40 transcriptionally validated, genome-anchored novel EST clusters to be expressed in human muscle. As most of these ESTs were low copy clusters (duplex and triplex) in the original 28,000 EST project, the identification of these as significantly expressed is a robust validation of the transcript units that permits subsequent focus on the novel proteins encoded by these genes.Keywords
This publication has 13 references indexed in Scilit:
- The Human Genome Browser at UCSCGenome Research, 2002
- Sources of variability and effect of experimental approach on expression profiling data interpretationBMC Bioinformatics, 2002
- When the chips are downNature, 2001
- An optimized protocol for analysis of EST sequencesNucleic Acids Research, 2000
- Expression profiling using cDNA microarraysNature Genetics, 1999
- High density synthetic oligonucleotide arraysNature Genetics, 1999
- Making and reading microarraysNature Genetics, 1999
- Molecular interactions on microarraysNature Genetics, 1999
- Expression monitoring by hybridization to high-density oligonucleotide arraysNature Biotechnology, 1996
- Identification of 4370 expressed sequence tags from a 3'-end-specific cDNA library of human skeletal muscle by DNA sequencing and filter hybridization.Genome Research, 1996