Functional analysis of gapped microbial genomes: Amino acid metabolism of Thiobacillus ferrooxidans
Open Access
- 28 March 2000
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 97 (7) , 3509-3514
- https://doi.org/10.1073/pnas.97.7.3509
Abstract
A gapped genome sequence of the biomining bacterium Thiobacillus ferrooxidans strain ATCC23270 was assembled from sheared DNA fragments (3.2-times coverage) into 1,912 contigs. A total of 2,712 potential genes (ORFs) were identified in 2.6 Mbp (megabase pairs) of Thiobacillus genomic sequence. Of these genes, 2,159 could be assigned functions by using the WIT-Pro/EMP genome analysis system, most with a high degree of certainty. Nine hundred of the genes have been assigned roles in metabolic pathways, producing an overview of cellular biosynthesis, bioenergetics, and catabolism. Sequence similarities, relative gene positions on the chromosome, and metabolic reconstruction (placement of gene products in metabolic pathways) were all used to aid gene assignments and for development of a functional overview. Amino acid biosynthesis was chosen to demonstrate the analytical capabilities of this approach. Only 10 expected enzymatic activities, of the nearly 150 involved in the biosynthesis of all 20 amino acids, are currently unassigned in the Thiobacillus genome. This result compares favorably with 10 missing genes for amino acid biosynthesis in the complete Escherichiacoli genome. Gapped genome analysis can therefore give a decent picture of the central metabolism of a microorganism, equivalent to that of a complete sequence, at significantly lower cost.Keywords
This publication has 28 references indexed in Scilit:
- Reasons why ‘Leptospirillum’-like species rather than Thiobacillus ferrooxidans are the dominant iron-oxidizing bacteria in many commercial processes for the biooxidation of pyrite and related oresMicrobiology, 1999
- Shotgun Sequencing of the Human GenomeScience, 1998
- A Genomic Perspective on Protein FamiliesScience, 1997
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997
- Complete Genome Sequence of the Methanogenic Archaeon, Methanococcus jannaschii Science, 1996
- Bacterium genome sequenceNature, 1996
- Sequence Analysis of the Genome of the Unicellular Cyanobacterium Synechocystis sp. Strain PCC6803. II. Sequence Determination of the Entire Genome and Assignment of Potential Protein-coding Regions (Supplement)DNA Research, 1996
- Sequence Analysis of the Genome of the Unicellular Cyanobacterium Synechocystis sp. Strain PCC6803. II. Sequence Determination of the Entire Genome and Assignment of Potential Protein-coding RegionsDNA Research, 1996
- Mining with MicrobesBio/Technology, 1995
- Whole-Genome Random Sequencing and Assembly of Haemophilus influenzae RdScience, 1995