A Human-Curated Annotation of the Candida albicans Genome
Top Cited Papers
Open Access
- 17 June 2005
- journal article
- review article
- Published by Public Library of Science (PLoS) in PLoS Genetics
- Vol. 1 (1) , e1-57
- https://doi.org/10.1371/journal.pgen.0010001
Abstract
Recent sequencing and assembly of the genome for the fungal pathogen Candida albicans used simple automated procedures for the identification of putative genes. We have reviewed the entire assembly, both by hand and with additional bioinformatic resources, to accurately map and describe 6,354 genes and to identify 246 genes whose original database entries contained sequencing errors (or possibly mutations) that affect their reading frame. Comparison with other fungal genomes permitted the identification of numerous fungus-specific genes that might be targeted for antifungal therapy. We also observed that, compared to other fungi, the protein-coding sequences in the C. albicans genome are especially rich in short sequence repeats. Finally, our improved annotation permitted a detailed analysis of several multigene families, and comparative genomic studies showed that C. albicans has a far greater catabolic range, encoding respiratory Complex 1, several novel oxidoreductases and ketone body degrading enzymes, malonyl-CoA and enoyl-CoA carriers, several novel amino acid degrading enzymes, a variety of secreted catabolic lipases and proteases, and numerous transporters to assimilate the resulting nutrients. The results of these efforts will ensure that the Candida research community has uniform and comprehensive genomic information for medical research as well as for future diagnostic and therapeutic applications. Candida albicans is a commonly encountered fungal pathogen usually responsible for superficial infections (thrush and vaginitis). However, an estimated 30% of severe fungal infections, most due to Candida, result in death. Those who are most at risk include individuals taking immune-suppressive drugs following organ transplantation, people with HIV infection, premature infants, and cancer patients undergoing chemotherapy. Current therapies for this pathogen are made more difficult by the significant secondary effects of anti-fungal drugs that target proteins that are also found in the human host. Recent sequencing and assembly of the genome for the fungal pathogen C. albicans used simple automated procedures for the identification of putative genes. Here, we report a detailed annotation of the 6,354 genes that are present in the genome sequence of this organism, essentially writing the dictionary of the C. albicans genome. Comparison with other fungal genomes permitted the identification of numerous fungus-specific genes that are absent from the human genome and whose products might be targeted for antifungal therapy. The results of these efforts will thus ensure that the Candida research community has uniform and comprehensive genomic information for medical research, for the development of functional genomic tools as well as for future diagnostic and therapeutic applications.Keywords
This publication has 147 references indexed in Scilit:
- Genome evolution in yeastsNature, 2004
- The Pfam protein families databaseNucleic Acids Research, 2004
- Evolution of a Combinatorial Transcriptional CircuitCell, 2003
- Sequencing and comparison of yeast species to identify genes and regulatory elementsNature, 2003
- The genome sequence of the filamentous fungus Neurospora crassaNature, 2003
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- The genome sequence of Schizosaccharomyces pombeNature, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- The Genome Sequence of Drosophila melanogasterScience, 2000
- Molecular cloning and characterization of a novel gene of Candida albicans, CDR1, conferring multiple resistance to drugs and antifungalsCurrent Genetics, 1995