Using Chado to Store Genome Annotation Data
Open Access
- 1 December 2005
- journal article
- unit
- Published by Wiley in Current Protocols in Bioinformatics
- Vol. 12 (1) , 9.6.1-9.6.28
- https://doi.org/10.1002/0471250953.bi0906s12
Abstract
Chado is a relational database schema that can be used to manage a wide variety of biological information, including genome annotation, genetic, phenotypic, and expression data. Its flexibility comes from its use of “ontologies,” which are controlled vocabularies that describe data types and the relationships among them. By changing its ontologies, Chado can be customized to suit many different needs. Another aspect that gives Chado its flexibility is its use of a modular design, which means that users can choose to use only those features of Chado that are suitable for their needs. XORT is the main software tool used to move data in and out of Chado databases. XORT uses an XML-based file format for data import and export; this format is called ChadoXML, The protocols described in this chapter show how to use XORT and related software to import genome annotation data into Chado databases, and how to export data stored in Chado databases into different file formats for report and data mining purposes.Keywords
This publication has 6 references indexed in Scilit:
- An integrated computational pipeline and database to support whole-genome sequence annotationGenome Biology, 2002
- Apollo: a sequence annotation editorGenome Biology, 2002
- The Generic Genome Browser: A Building Block for a Model Organism System DatabaseGenome Research, 2002
- Genie—Gene Finding in Drosophila melanogasterGenome Research, 2000
- A Computer Program for Aligning a cDNA Sequence with a Genomic DNA SequenceGenome Research, 1998
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997