Colibri: a functional data base for the Escherichia coli genome.
- 1 September 1993
- journal article
- review article
- Vol. 57 (3) , 623-54
Abstract
Several data libraries have been created to organize all the data obtained worldwide about the Escherichia coli genome. Because the known data now amount to more than 40% of the whole genome sequence, it has become necessary to organize the data in such a way that appropriate procedures can associate knowledge produced by experiments about each gene to its position on the chromosome and its relation to other relevant genes, for example. In addition, global properties of genes, affected by the introduction of new entries, should be present as appropriate description fields. A data base, implemented on Macintosh by using the data base management system 4th Dimension, is described. It is constructed around a core constituted by known contigs of E. coli sequences and links data collected in general libraries (unmodified) to data associated with evolving knowledge (with modifiable fields). Biologically significant results obtained through the coupling of appropriate procedures (learning or statistical data analysis) are presented. The data base is available through a 4th Dimension runtime and through FTP on Internet. It has been regularly updated and will be systematically linked to other E. coli data bases (M. Kroger, R. Wahl, G. Schachtel, and P. Rice, Nucleic Acids Res. 20(Suppl.):2119-2144, 1992; K. E. Rudd, W. Miller, C. Werner, J. Ostell, C. Tolstoshev, and S. G. Satterfield, Nucleic Acids Res. 19:637-647, 1991) in the near future.This publication has 25 references indexed in Scilit:
- Analysis of the Escherichia coli Genome: DNA Sequence of the Region from 84.5 to 86.5 MinutesScience, 1992
- Mapping of sequenced genes (700 kbp) in the restriction map of the Escherichia coli chromosomeMolecular Microbiology, 1990
- Alignment of Escherichia coli K12 DNA sequences to a genomic restriction mapNucleic Acids Research, 1990
- The distribution of restriction enzyme sites inEscherichia coliNucleic Acids Research, 1990
- Completion of the detailed restriction map of theE.coligenome by the isolation of overlapping cosmid clonesNucleic Acids Research, 1989
- Randomly picked cosmid clones overlap thepyrB andoriC gap in the physical map of theE.colichromosomeNucleic Acids Research, 1988
- The physical map of the whole E. coli chromosome: Application of a new strategy for rapid analysis and sorting of a large genomic libraryCell, 1987
- Analysis of the Codon Bias inE. coliSequencesJournal of Biomolecular Structure and Dynamics, 1984
- Rapid similarity searches of nucleic acid and protein data banks.Proceedings of the National Academy of Sciences, 1983
- Inversions between ribosomal RNA genes of Escherichia coli.Proceedings of the National Academy of Sciences, 1981