Origin and properties of non-coding ORFs in the yeast genome
Open Access
- 1 September 1999
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 27 (17) , 3503-3509
- https://doi.org/10.1093/nar/27.17.3503
Abstract
In a recent paper we have estimated the total number of protein coding open reading frames (ORFs) in the Saccharomyces cerevisiae genome, based on their properties, at about 4800. This number is much smaller than the 5800–6000 which is widely accepted. In this paper we analyse differences between the set of ORFs with known phenotypes annotated in the Munich Information Centre for Protein Sequences (MIPS) database and ORFs for which the probability of coding, counted by us, is very low. We have found that many of the latter ORFs have properties of anti-sense sequences of coding ORFs, which suggests that they could have been generated by duplication of coding sequences. Since coding sequences generate ORFs inside themselves, with especially high frequency in the antisense sequences, we have looked for homology between known proteins and hypothetical polypeptides generated by ORFs under consideration in all the six phases. For many ORFs we have found paralogues and orthologues in phases different than the phase which had been assumed in the MIPS database as coding.Keywords
This publication has 17 references indexed in Scilit:
- The Base Contents of A, C, G or U for the Three Codon Positions and the Total Coding Sequences Show Positive CorrelationJournal of Biomolecular Structure and Dynamics, 1998
- Life with 6000 GenesScience, 1996
- Bioinformatics and the discovery of gene functionTrends in Genetics, 1996
- The yeast genome project: what did we learn?Trends in Genetics, 1996
- Generation of overlapping open reading framesTrends in Genetics, 1996
- Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysisPhysical Review E, 1995
- Complete DNA sequence of yeast chromosome XINature, 1994
- Evolution of long-range fractal correlations and 1/fnoise in DNA base sequencesPhysical Review Letters, 1992
- Improved tools for biological sequence comparison.Proceedings of the National Academy of Sciences, 1988
- The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applicationsNucleic Acids Research, 1987