Maintaining the integrity of human immunodeficiency virus sequence databases
- 1 August 1996
- journal article
- Published by American Society for Microbiology in Journal of Virology
- Vol. 70 (8) , 5720-30
- https://doi.org/10.1128/jvi.70.8.5720-5730.1996
Abstract
Human immunodeficiency virus type 1 (HIV-1) sequences are accumulating in the literature at a rapid pace. For this ever-expanding resource to be maximally useful, it is critical that researchers strive to maintain a high level of quality assurance, both in experimental design and conduct and in analyses. Here we present detailed analyses of problematic sets of HIV-1 sequences in the database that include sequence anomalies suggestive of mislabeling or sample contamination problems. These data are examined in the context of currently available HIV-1 sequence information to provide an example of how to identify potentially flawed data. Indicators of potential problems with sequences are (i) sequences that are nearly identical that are supposed to be derived from unlinked individuals and that are markedly distinct from other sequences from the putative source or (ii) sequences that are nearly identical to those of laboratory strains. We provide an outline of methods that researchers can use to perform preliminary laboratory and computational analyses that could help identify problematic data and thus help ensure the integrity of sequence databases.Keywords
This publication has 34 references indexed in Scilit:
- Protecting HIV databasesNature, 1995
- V3 sequences in primary HIV-I infectionAIDS, 1995
- The genetic data environment an expandable GUI for multiple sequence analysisBioinformatics, 1994
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- The origin of HIV-1 isolate HTLV-IIIBNature, 1993
- PHYLOGENIES FROM MOLECULAR SEQUENCES: INFERENCE AND RELIABILITYAnnual Review of Genetics, 1988
- Multiple aligned sequence editor (MASE)Trends in Biochemical Sciences, 1988
- The phylogenetic history of immunodeficiency virusesNature, 1988
- Origins of HTLV-4Nature, 1987
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980