The TIGR Plant Repeat Databases: a collective resource for the identification of repetitive sequences in plants
- 1 January 2004
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 32 (90001) , 360D-363
- https://doi.org/10.1093/nar/gkh099
Abstract
In a number of higher plants, a substantial portion of the genome is composed of repetitive sequences that can hinder genome annotation and sequencing efforts. To better understand the nature of repetitive sequences in plants and provide a resource for identifying such sequences, we constructed databases of repetitive sequences for 12 plant genera: Arabidopsis, Brassica, Glycine, Hordeum, Lotus, Lycopersicon, Medicago, Oryza, Solanum, Sorghum, Triticum and Zea (www.tigr.org/tdb/e2k1/plant. repeats/index.shtml). The repetitive sequences within each database have been coded into super-classes, classes and sub-classes based on sequence and structure similarity. These databases are available for sequence similarity searches as well as downloadable files either as entire databases or subsets of each database. To further the utility for comparative studies and to provide a resource for searching for repetitive sequences in other genera within these families, repetitive sequences have been combined into four databases to represent the Brassicaceae, Fabaceae, Gramineae and Solanaceae families. Collectively, these databases provide a resource for the identification, classification and analysis of repetitive sequences in plants.Keywords
This publication has 16 references indexed in Scilit:
- Molecular and Cytological Analyses of Large Tracks of Centromeric DNA Reveal the Structure and Evolutionary Dynamics of Maize CentromeresGenetics, 2003
- Sequence and analysis of rice chromosome 4Nature, 2002
- The genome sequence and structure of rice chromosome 1Nature, 2002
- Plant transposable elements: where genetics meets genomicsNature Reviews Genetics, 2002
- Rice Bioinformatics. Analysis of Rice Sequence Data and Leveraging the Data to Other Plant SpeciesPlant Physiology, 2001
- Genetic Definition and Sequence Analysis of Arabidopsis CentromeresScience, 1999
- Plant telomeres and telomerases. A review.1997
- Molecular mapping of rice chromosomesTheoretical and Applied Genetics, 1988
- Repetitive DNA in ThreeGramineaeSpecies with Low DNA ContentHoppe-Seyler´s Zeitschrift Für Physiologische Chemie, 1979
- Genome size and the proportion of repeated nucleotide sequence DNA in plantsBiochemical Genetics, 1974