SoDA2: a Hidden Markov Model approach for identification of immunoglobulin rearrangements
Open Access
- 9 February 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 26 (7) , 867-872
- https://doi.org/10.1093/bioinformatics/btq056
Abstract
Motivation: The inference of pre-mutation immunoglobulin (Ig) rearrangements is essential in the study of the antibody repertoires produced in response to infection, in B-cell neoplasms and in autoimmune disease. Often, there are several rearrangements that are nearly equivalent as candidates for a given Ig gene, but have different consequences in an analysis. Our aim in this article is to develop a probabilistic model of the rearrangement process and a Bayesian method for estimating posterior probabilities for the comparison of multiple plausible rearrangements. Results: We have developed SoDA2, which is based on a Hidden Markov Model and used to compute the posterior probabilities of candidate rearrangements and to find those with the highest values among them. We validated the software on a set of simulated data, a set of clonally related sequences, and a group of randomly selected Ig heavy chains from Genbank. In most tests, SoDA2 performed better than other available software for the task. Furthermore, the output format has been redesigned, in part, to facilitate comparison of multiple solutions. Availability: SoDA2 is available online at https://hippocrates.duhs.duke.edu/soda. Simulated sequences are available upon request. Contact: kepler@duke.eduKeywords
This publication has 28 references indexed in Scilit:
- SoDA: implementation of a 3D alignment algorithm for inference of antigen receptor recombinationsBioinformatics, 2005
- A new decoding algorithm for hidden Markov models improves the prediction of the topology of all-beta membrane proteinsBMC Bioinformatics, 2005
- Identification of common molecular subsequencesPublished by Elsevier ,2004
- IMGT/JunctionAnalysis: the first tool for the analysis of the immunoglobulin and T cell receptor complex V–J and V–D–J JUNCTIONsBioinformatics, 2004
- Yet Another Numbering Scheme for Immunoglobulin Variable Domains: An Automatic Modeling and Analysis ToolJournal of Molecular Biology, 2001
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Basic local alignment search toolJournal of Molecular Biology, 1990
- A tutorial on hidden Markov models and selected applications in speech recognitionProceedings of the IEEE, 1989
- Somatic generation of antibody diversityNature, 1983
- Synthesis of compositionally unique DNA by terminal deoxynucleotidyl transferaseBiochemical and Biophysical Research Communications, 1983