The Evolutionary Analysis of Emerging Low Frequency HIV-1 CXCR4 Using Variants through Time—An Ultra-Deep Approach
Open Access
- 16 December 2010
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 6 (12) , e1001022
- https://doi.org/10.1371/journal.pcbi.1001022
Abstract
Large-scale parallel pyrosequencing produces unprecedented quantities of sequence data. However, when generated from viral populations current mapping software is inadequate for dealing with the high levels of variation present, resulting in the potential for biased data loss. In order to apply the 454 Life Sciences' pyrosequencing system to the study of viral populations, we have developed software for the processing of highly variable sequence data. Here we demonstrate our software by analyzing two temporally sampled HIV-1 intra-patient datasets from a clinical study of maraviroc. This drug binds the CCR5 coreceptor, thus preventing HIV-1 infection of the cell. The objective is to determine viral tropism (CCR5 versus CXCR4 usage) and track the evolution of minority CXCR4-using variants that may limit the response to a maraviroc-containing treatment regimen. Five time points (two prior to treatment) were available from each patient. We first quantify the effects of divergence on initial read k-mer mapping and demonstrate the importance of utilizing population-specific template sequences in relation to the analysis of next-generation sequence data. Then, in conjunction with coreceptor prediction algorithms that infer HIV tropism, our software was used to quantify the viral population structure pre- and post-treatment. In both cases, low frequency CXCR4-using variants (2.5–15%) were detected prior to treatment. Following phylogenetic inference, these variants were observed to exist as distinct lineages that were maintained through time. Our analysis, thus confirms the role of pre-existing CXCR4-using virus in the emergence of maraviroc-insensitive HIV. The software will have utility for the study of intra-host viral diversity and evolution of other fast evolving viruses, and is available from http://www.bioinf.manchester.ac.uk/segminator/. Due to high data volumes, error rates, and short sequence lengths, new sequencing technologies present a new challenge for computational biology. In addition, high-depth (or ultra-deep) datasets, for example from pathogens, contain exceptionally large amounts of variation over short genomes or genomic regions. Here we present software for the processing and downstream analysis of such short-read viral sequence data. We apply the software to the analysis of two HIV-1 infected individuals who did not respond optimally to the drug maraviroc. For each patient, pyrosequence data was available for five time points. In both cases we detect distinct clusters of low-frequency drug-insensitive variants that were present prior to maraviroc treatment and effectively unmasked by the removal of the drug-sensitive HIV.Keywords
This publication has 44 references indexed in Scilit:
- Detection of low-frequency pretherapy chemokine (CXC motif) receptor 4 (CXCR4)-using HIV-1 with ultra-deep pyrosequencingAIDS, 2009
- Next-generation DNA sequencingNature Biotechnology, 2008
- Bioinformatics challenges of new sequencing technologyPublished by Elsevier ,2008
- Characterization of mutation spectra with ultra-deep pyrosequencing: Application to HIV-1 drug resistanceGenome Research, 2007
- Structural Basis for Coreceptor Selectivity by The HIV Type 1 V3 LoopAIDS Research and Human Retroviruses, 2007
- Molecular switch for alternative conformations of the HIV-1 V3 region: Implications for phenotype conversionProceedings of the National Academy of Sciences, 2006
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Viral dynamics in human immunodeficiency virus type 1 infectionNature, 1995