Sputnik: a database platform for comparative plant genomics
- 1 January 2003
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 31 (1) , 128-132
- https://doi.org/10.1093/nar/gkg075
Abstract
Two million plant ESTs, from 20 different plant species, and totalling more than one 1000 Mbp of DNA sequence, represents a formidable transcriptomic resource. Sputnik uses the potential of this sequence resource to fill some of the information gap in the un-sequenced plant genomes and to serve as the foundation for in silicio comparative plant genomics. The complexity of the individual EST collections has been reduced using optimised EST clustering techniques. Annotation of cluster sequences is performed by exploiting and transferring information from the comprehensive knowledgebase already produced for the completed model plant genome (Arabidopsis thaliana) and by performing additional state of-the-art sequence analyses relevant to today's plant biologist. Functional predictions, comparative analyses and associative annotations for 500 000 plant EST derived peptides make Sputnik (http://mips.gsf.de/proj/sputnik/) a valid platform for contemporary plant genomics.Keywords
This publication has 16 references indexed in Scilit:
- Rice Genomes: A Grainy View of Future Evolutionary ResearchCurrent Biology, 2002
- Assaying gene content in ArabidopsisProceedings of the National Academy of Sciences, 2002
- A Draft Sequence of the Rice Genome ( Oryza sativa L. ssp. japonica )Science, 2002
- A Draft Sequence of the Rice Genome ( Oryza sativa L. ssp. indica )Science, 2002
- The Protein Data Bank: unifying the archiveNucleic Acids Research, 2002
- The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic speciesNucleic Acids Research, 2001
- Analysis of the genome sequence of the flowering plant Arabidopsis thalianaNature, 2000
- Now for the hard onesNature, 2000
- Complementary DNA Sequencing: Expressed Sequence Tags and Human Genome ProjectScience, 1991
- Simple sequences are ubiquitous repetitive components of eukaryotic genomesNucleic Acids Research, 1984