Horizontal gene transfer and nucleotide compositional anomaly in large DNA viruses
Open Access
- 10 December 2007
- journal article
- Published by Springer Nature in BMC Genomics
- Vol. 8 (1) , 456
- https://doi.org/10.1186/1471-2164-8-456
Abstract
DNA viruses have a wide range of genome sizes (5 kb up to 1.2 Mb, compared to 0.16 Mb to 1.5 Mb for obligate parasitic bacteria) that do not correlate with their virulence or the taxonomic distribution of their hosts. The reasons for such large variation are unclear. According to the traditional view of viruses as gifted "gene pickpockets", large viral genome sizes could originate from numerous gene acquisitions from their hosts. We investigated this hypothesis by studying 67 large DNA viruses with genome sizes larger than 150 kb, including the recently characterized giant mimivirus. Given that horizontally transferred DNA often have anomalous nucleotide compositions differing from the rest of the genome, we conducted a detailed analysis of the inter- and intra-genome compositional properties of these viruses. We then interpreted their compositional heterogeneity in terms of possible causes, including strand asymmetry, gene function/expression, and horizontal transfer.Keywords
This publication has 80 references indexed in Scilit:
- Unique genes in giant viruses: Regular substitution pattern and anomalously short sizeGenome Research, 2007
- Distinctive features of large complex virus genomes and proteomesProceedings of the National Academy of Sciences, 2007
- Mimivirus Giant Particles Incorporate a Large Fraction of Anonymous and Unique Gene ProductsJournal of Virology, 2006
- Locus-Specific Gene Expression Pattern Suggests a Unique Propagation Strategy for a Giant Algal VirusJournal of Virology, 2006
- Viruses in the seaNature, 2005
- NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteinsNucleic Acids Research, 2004
- Biased biological functions of horizontally transferred genes in prokaryotic genomesNature Genetics, 2004
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994