Fusion transcripts and transcribed retrotransposed loci discovered through comprehensive transcriptome analysis using Paired-End diTags (PETs)
- 13 June 2007
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 17 (6) , 828-838
- https://doi.org/10.1101/gr.6018607
Abstract
Identification of unconventional functional features such as fusion transcripts is a challenging task in the effort to annotate all functional DNA elements in the human genome. Paired-End diTag (PET) analysis possesses a unique capability to accurately and efficiently characterize the two ends of DNA fragments, which may have either normal or unusual compositions. This unique nature of PET analysis makes it an ideal tool for uncovering unconventional features residing in the human genome. Using the PET approach for comprehensive transcriptome analysis, we were able to identify fusion transcripts derived from genome rearrangements and actively expressed retrotransposed pseudogenes, which would be difficult to capture by other means. Here, we demonstrate this unique capability through the analysis of 865,000 individual transcripts in two types of cancer cells. In addition to the characterization of a large number of differentially expressed alternative 5′ and 3′ transcript variants and novel transcriptional units, we identified 70 fusion transcript candidates in this study. One was validated as the product of a fusion gene between BCAS4 and BCAS3 resulting from an amplification followed by a translocation event between the two loci, chr20q13 and chr17q23. Through an examination of PETs that mapped to multiple genomic locations, we identified 4055 retrotransposed loci in the human genome, of which at least three were found to be transcriptionally active. The PET mapping strategy presented here promises to be a useful tool in annotating the human genome, especially aberrations in human cancer genomes.Keywords
This publication has 71 references indexed in Scilit:
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- Multiplex sequencing of paired-end ditags (MS-PET): a strategy for the ultra-high-throughput analysis of transcriptomes and genomesNucleic Acids Research, 2006
- Diversification of transcriptional modulation: Large-scale identification and characterization of putative alternative promoters of human genesGenome Research, 2005
- Expression patterns of the LPP–HMGA2 fusion transcript in pulmonary chondroid hamartomas with t(3;12)(q27∼28;q14∼15)Cancer Genetics and Cytogenetics, 2005
- Gene identification signature (GIS) analysis for transcriptome characterization and genome annotationNature Methods, 2005
- Cloning of BCAS3 (17q23) and BCAS4 (20q13) genes that undergo amplification, overexpression, and fusion in breast cancer†Genes, Chromosomes and Cancer, 2002
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Molecular Fossils in the Human Genome: Identification and Analysis of the Pseudogenes in Chromosomes 21 and 22Genome Research, 2002
- Jumping translocations are common in solid tumor cell lines and result in recurrent fusions of whole chromosome armsGenes, Chromosomes and Cancer, 2001
- Transcription of a human dopamine D5 pseudogeneBiochemical and Biophysical Research Communications, 1991