Using purine skews to predict genes in AT-rich poxviruses
Open Access
- 18 February 2005
- journal article
- research article
- Published by Springer Nature in BMC Genomics
- Vol. 6 (1) , 22
- https://doi.org/10.1186/1471-2164-6-22
Abstract
Background: Clusters or runs of purines on the mRNA synonymous strand have been found in many different organisms including orthopoxviruses. The purine bias that is exhibited by these clusters can be observed using a purine skew and in the case of poxviruses, these skews can be used to help determine the coding strand of a particular segment of the genome. Combined with previous findings that minor ORFs have lower than average aspartate and glutamate composition and higher than average serine composition, purine content can be used to predict the likelihood of a poxvirus ORF being a "real gene". Results: Using purine skews and a "quality" measure designed to incorporate previous findings about minor ORFs, we have found that in our training case (vaccinia virus strain Copenhagen), 59 of 65 minor (small and unlikely to be a real genes) ORFs were correctly classified as being minor. Of the 201 major (large and likely to be real genes) vaccinia ORFs, 192 were correctly classified as being major. Performing a similar analysis with the entomopoxvirus amsacta moorei (AMEV), it was found that 4 major ORFs were incorrectly classified as minor and 9 minor ORFs were incorrectly classified as major. The purine abundance observed for major ORFs in vaccinia virus was found to stem primarily from the first codon position with both the second and third codon positions containing roughly equal amounts of purines and pyrimidines. Conclusion: Purine skews and a "quality" measure can be used to predict functional ORFs and purine skews in particular can be used to determine which of two overlapping ORFs is most likely to be the real gene if neither of the two ORFs has orthologs in other poxviruses.Keywords
This publication has 17 references indexed in Scilit:
- Poxvirus Orthologous Clusters: toward Defining the Minimum Essential Poxvirus GenomeJournal of Virology, 2003
- Complete pathway for protein disulfide bond formation encoded by poxvirusesProceedings of the National Academy of Sciences, 2002
- Complete Genomic Sequence of the Amsacta moorei Entomopoxvirus: Analysis and Comparison with Other PoxvirusesVirology, 2000
- Deviations from Chargaff's Second Parity Rule Correlate with Direction of TranscriptionJournal of Theoretical Biology, 1999
- A simple vectorial representation of DNA sequences for the detection of replication origins in bacteriaBiochimie, 1996
- The complete DNA sequence of vaccinia virusVirology, 1990
- A backtranslation method based on codon usage strategyNucleic Acids Research, 1988
- Seapration of B. subtilis DNA into complementary strands. II. Template functions and composition as determined by transcription with RNA polymerase.Proceedings of the National Academy of Sciences, 1968
- Separation of B. subtilis DNA into complementary strands, I. Biological properties.Proceedings of the National Academy of Sciences, 1968
- Pyrimidine Clusters on the Transcribing Strand of DNA and Their Possible Role in the Initiation of RNA SynthesisCold Spring Harbor Symposia on Quantitative Biology, 1966