Why Are There Still Over 1000 Uncharacterized Yeast Genes?
- 1 May 2007
- journal article
- review article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 176 (1) , 7-14
- https://doi.org/10.1534/genetics.107.074468
Abstract
The yeast genetics community has embraced genomic biology, and there is a general understanding that obtaining a full encyclopedia of functions of the ∼6000 genes is a worthwhile goal. The yeast literature comprises over 40,000 research papers, and the number of yeast researchers exceeds the number of genes. There are mutated and tagged alleles for virtually every gene, and hundreds of high-throughput data sets and computational analyses have been described. Why, then, are there >1000 genes still listed as uncharacterized on the Saccharomyces Genome Database, 10 years after sequencing the genome of this powerful model organism? Examination of the currently uncharacterized gene set suggests that while some are small or newly discovered, the vast majority were evident from the initial genome sequence. Most are present in multiple genomics data sets, which may provide clues to function. In addition, roughly half contain recognizable protein domains, and many of these suggest specific metabolic activities. Notably, the uncharacterized gene set is highly enriched for genes whose only homologs are in other fungi. Achieving a full catalog of yeast gene functions may require a greater focus on the life of yeast outside the laboratory.Keywords
This publication has 58 references indexed in Scilit:
- Histone H3-K56 Acetylation Is Catalyzed by Histone Chaperone-Dependent ComplexesMolecular Cell, 2007
- A large-scale full-length cDNA analysis to explore the budding yeast transcriptomeProceedings of the National Academy of Sciences, 2006
- Saccharomyces cerevisiae S288C genome annotation: a working hypothesisYeast, 2006
- Global landscape of protein complexes in the yeast Saccharomyces cerevisiaeNature, 2006
- Proteome survey reveals modularity of the yeast cell machineryNature, 2006
- BioGRID: a general repository for interaction datasetsNucleic Acids Research, 2006
- Global analysis of protein localization in budding yeastNature, 2003
- Sequencing and comparison of yeast species to identify genes and regulatory elementsNature, 2003
- Functional profiling of the Saccharomyces cerevisiae genomeNature, 2002
- Comparative assessment of large-scale data sets of protein–protein interactionsNature, 2002