Using Chado to Store Genome Annotation Data

Open Access

1 December 2005

journal article
unit
Published by Wiley in Current Protocols in Bioinformatics

Vol. 12 (1) , 9.6.1-9.6.28
https://doi.org/10.1002/0471250953.bi0906s12

Abstract

Chado is a relational database schema that can be used to manage a wide variety of biological information, including genome annotation, genetic, phenotypic, and expression data. Its flexibility comes from its use of “ontologies,” which are controlled vocabularies that describe data types and the relationships among them. By changing its ontologies, Chado can be customized to suit many different needs. Another aspect that gives Chado its flexibility is its use of a modular design, which means that users can choose to use only those features of Chado that are suitable for their needs. XORT is the main software tool used to move data in and out of Chado databases. XORT uses an XML-based file format for data import and export; this format is called ChadoXML, The protocols described in this chapter show how to use XORT and related software to import genome annotation data into Chado databases, and how to export data stored in Chado databases into different file formats for report and data mining purposes.

Keywords

This publication has 6 references indexed in Scilit:

An integrated computational pipeline and database to support whole-genome sequence annotation
Genome Biology, 2002
Apollo: a sequence annotation editor
Genome Biology, 2002
The Generic Genome Browser: A Building Block for a Model Organism System Database
Genome Research, 2002
Genie—Gene Finding in Drosophila melanogaster
Genome Research, 2000
A Computer Program for Aligning a cDNA Sequence with a Genomic DNA Sequence
Genome Research, 1998
Prediction of complete gene structures in human genomic DNA
Journal of Molecular Biology, 1997