The impact of SNPs on the interpretation of SAGE and MPSS experimental data
Open Access
- 1 January 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 32 (20) , 6104-6110
- https://doi.org/10.1093/nar/gkh937
Abstract
Serial Analysis of Gene Expression (SAGE) and Massively Parallel Signature Sequencing (MPSS) are powerful techniques for gene expression analysis. A crucial step in analyzing SAGE and MPSS data is the assignment of experimentally obtained tags to a known transcript. However, tag to transcript assignment is not a straightforward process since alternative tags for a given transcript can also be experimentally obtained. Here, we have evaluated the impact of Single Nucleotide Polymorphisms (SNPs) on the generation of alternative SAGE and MPSS tags. This was achieved through the construction of a reference database of SNP-associated alternative tags, which has been integrated with SAGE Genie. A total of 2020 SNP-associated alternative tags were catalogued in our reference database and at least one SNP-associated alternative tag was observed for ∼8.6% of all known human genes. A significant fraction (61.9%) of these alternative tags matched a list of experimentally obtained tags, validating their existence. In addition, the origin of four out of five SNP-associated alternative MPSS tags was experimentally confirmed through the use of the GLGI-MPSS protocol (Generation of Long cDNA fragments for Gene Identification). The availability of our SNP-associated alternative tag database will certainly improve the interpretation of SAGE and MPSS experiments.Keywords
This publication has 18 references indexed in Scilit:
- Detection and evaluation of intron retention events in the human transcriptomeRNA, 2004
- Allele-specific gene expression uncoveredTrends in Genetics, 2004
- Computational Analysis of Gene Identification with SAGEJournal of Computational Biology, 2002
- A map of human genome sequence variation containing 1.42 million single nucleotide polymorphismsNature, 2001
- Serial analysis of gene expression: from gene discovery to target identificationDrug Discovery Today, 2000
- Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arraysNature Biotechnology, 2000
- New Goals for the U.S. Human Genome Project: 1998-2003Science, 1998
- Large-Scale Identification, Mapping, and Genotyping of Single-Nucleotide Polymorphisms in the Human GenomeScience, 1998
- The New Genomics: Global Views of BiologyScience, 1996
- Serial Analysis of Gene ExpressionScience, 1995