Databases and Information Integration for the Medicago truncatula Genome and Transcriptome
Open Access
- 1 May 2005
- journal article
- Published by Oxford University Press (OUP) in Plant Physiology
- Vol. 138 (1) , 38-46
- https://doi.org/10.1104/pp.104.059204
Abstract
An international consortium is sequencing the euchromatic genespace of Medicago truncatula. Extensive bioinformatic and database resources support the marker-anchored bacterial artificial chromosome (BAC) sequencing strategy. Existing physical and genetic maps and deep BAC-end sequencing help to guide the sequencing effort, while EST databases provide essential resources for genome annotation as well as transcriptome characterization and microarray design. Finished BAC sequences are joined into overlapping sequence assemblies and undergo an automated annotation process that integrates ab initio predictions with EST, protein, and other recognizable features. Because of the sequencing project's international and collaborative nature, data production, storage, and visualization tools are broadly distributed. This paper describes databases and Web resources for the project, which provide support for physical and genetic maps, genome sequence assembly, gene prediction, and integration of EST data. A central project Web site at medicago.org/genome provides access to genome viewers and other resources project-wide, including an Ensembl implementation at medicago.org, physical map and marker resources at mtgenome.ucdavis.edu, and genome viewers at the University of Oklahoma (www.genome.ou.edu), the Institute for Genomic Research (www.tigr.org), and Munich Information for Protein Sequences Center (mips.gsf.de).Keywords
This publication has 32 references indexed in Scilit:
- SpliceMachine: predicting splice sites from high-dimensional local context representationsBioinformatics, 2004
- UniProt: the Universal Protein knowledgebaseNucleic Acids Research, 2004
- Enrichment of Gene-Coding Sequences in Maize by Genome FiltrationScience, 2003
- Improving the Arabidopsis genome annotation using maximal transcript alignment assembliesNucleic Acids Research, 2003
- The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and communityNucleic Acids Research, 2003
- The Generic Genome Browser: A Building Block for a Model Organism System DatabaseGenome Research, 2002
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997
- Genome size and base composition in Medicago sativa and M. truncatula speciesGenome, 1994
- Nuclear DNA content of some important plant speciesPlant Molecular Biology Reporter, 1991