Next-generation tag sequencing for cancer gene expression profiling
Open Access
- 18 June 2009
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 19 (10) , 1825-1835
- https://doi.org/10.1101/gr.094482.109
Abstract
We describe a new method, Tag-seq, which employs ultra high-throughput sequencing of 21 base pair cDNA tags for sensitive and cost-effective gene expression profiling. We compared Tag-seq data to LongSAGE data and observed improved representation of several classes of rare transcripts, including transcription factors, antisense transcripts, and intronic sequences, the latter possibly representing novel exons or genes. We observed increases in the diversity, abundance, and dynamic range of such rare transcripts and took advantage of the greater dynamic range of expression to identify, in cancers and normal libraries, altered expression ratios of alternative transcript isoforms. The strand-specific information of Tag-seq reads further allowed us to detect altered expression ratios of sense and antisense (S-AS) transcripts between cancer and normal libraries. S-AS transcripts were enriched in known cancer genes, while transcript isoforms were enriched in miRNA targeting sites. We found that transcript abundance had a stronger GC-bias in LongSAGE than Tag-seq, such that AT-rich tags were less abundant than GC-rich tags in LongSAGE. Tag-seq also performed better in gene discovery, identifying >98% of genes detected by LongSAGE and profiling a distinct subset of the transcriptome characterized by AT-rich genes, which was expressed at levels below those detectable by LongSAGE. Overall, Tag-seq is sensitive to rare transcripts, has less sequence composition bias relative to LongSAGE, and allows differential expression analysis for a greater range of transcripts, including transcripts encoding important regulatory molecules.Keywords
This publication has 46 references indexed in Scilit:
- Accurate whole human genome sequencing using reversible terminator chemistryNature, 2008
- The diploid genome sequence of an Asian individualNature, 2008
- DNA sequencing of a cytogenetically normal acute myeloid leukaemia genomeNature, 2008
- Next-Generation Sequencing: The Race Is OnCell, 2008
- MicroRNA Targeting Specificity in Mammals: Determinants beyond Seed PairingPublished by Elsevier ,2007
- DAVID Bioinformatics Resources: expanded annotation database and novel algorithms to better extract biology from large gene listsNucleic Acids Research, 2007
- DeepSAGE—digital transcriptomics with high sensitivity, simple experimental protocol and multiplexing of samplesNucleic Acids Research, 2006
- Sequence biases in large scale gene expression profiling dataNucleic Acids Research, 2006
- MicroRNA biogenesis: coordinated cropping and dicingNature Reviews Molecular Cell Biology, 2005
- A census of human cancer genesNature Reviews Cancer, 2004