Reevaluation of human cytomegalovirus coding potential
- 30 October 2003
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 100 (23) , 13585-13590
- https://doi.org/10.1073/pnas.1735466100
Abstract
The Bio-Dictionary-based Gene Finder was used to reassess the coding potential of the AD169 laboratory strain of human cytomegalovirus and sequences in the Toledo strain that are missing in the laboratory strain of the virus. The gene-finder algorithm assesses the potential of an ORF to encode a protein based on matches to a database of amino acid patterns derived from a large collection of proteins. The algorithm was used to score all human cytomegalovirus ORFs with the potential to encode polypeptides >/=50 aa in length. As a further test for functionality, the genomes of the chimpanzee, rhesus, and murine cytomegaloviruses were searched for orthologues of the predicted human cytomegalovirus ORFs. The analysis indicates that 37 previously annotated ORFs ought to be discarded, and at least nine previously unrecognized ORFs with relatively strong coding potential should be added. Thus, the human cytomegalovirus genome appears to contain approximately 192 unique ORFs with the potential to encode a protein. Support for several of the predictions of our in silico analysis was obtained by sequencing several domains within a clinical isolate of human cytomegalovirus.Keywords
This publication has 59 references indexed in Scilit:
- In Silico Pattern-Based Analysis of the Human Cytomegalovirus GenomeJournal of Virology, 2003
- The Draft Genome of Ciona intestinalis : Insights into Chordate and Vertebrate OriginsScience, 2002
- The Human Cytomegalovirus US10 Gene Product Delays Trafficking of Major Histocompatibility Complex Class I MoleculesJournal of Virology, 2002
- Human Cytomegalovirus US7, US8, US9, and US10 Are Cytoplasmic Glycoproteins, Not Found at Cell Surfaces, and US9 Does Not Mediate Cell-to-Cell SpreadJournal of Virology, 2002
- Open Reading Frame UL26 of Human Cytomegalovirus Encodes a Novel Tegument Protein That Contains a Strong Transcriptional Activation DomainJournal of Virology, 2002
- Identification of Glycoprotein gpTRL10 as a Structural Component of Human CytomegalovirusJournal of Virology, 2002
- Abundant Early Expression of gpUL4 from a Human Cytomegalovirus Mutant Lacking a Repressive Upstream Open Reading FrameJournal of Virology, 2001
- In silico structural and functional analysis of the human cytomegalovirus (HHV5) genome 1 1Edited by F. CohenJournal of Molecular Biology, 2001
- Identification of a Novel Transcriptional Repressor Encoded by Human CytomegalovirusJournal of Virology, 2001
- Human cytomegalovirus open reading frame UL11 encodes a highly polymorphic protein expressed on the infected cell surfaceArchiv für die gesamte Virusforschung, 1997