Ab-origin: an enhanced tool to identify the sourcing gene segments in germline for rearranged antibodies
Open Access
- 12 December 2008
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 9 (S12) , S20
- https://doi.org/10.1186/1471-2105-9-s12-s20
Abstract
Background: In the adaptive immune system, variable regions of immunoglobulin (IG) are encoded by random recombination of variable (V), diversity (D), and joining (J) gene segments in the germline. Partitioning the functional antibody sequences to their sourcing germline gene segments is vital not only for understanding antibody maturation but also for promoting the potential engineering of the therapeutic antibodies. To date, several tools have been developed to perform such "trace-back" calculations. Yet, the predicting ability and processing volume of those tools vary significantly for different sets of data. Moreover, none of them give a confidence for immunoglobulin heavy diversity (IGHD) identification. Developing fast, efficient and enhanced tools is always needed with the booming of immunological data. Results: Here, a program named Ab-origin is presented. It is designed by batch query against germline databases based on empirical knowledge, optimized scoring scheme and appropriate parameters. Special efforts have been paid to improve the identification accuracy of the short and volatile region, IGHD. In particular, a threshold score for certain sensitivity and specificity is provided to give the confidence level of the IGHD identification. Conclusion: When evaluated using different sets of both simulated data and experimental data, Ab-origin outperformed all the other five popular tools in terms of prediction accuracy. The features of batch query and confidence indication of IGHD identification would provide extra help to users. The program is freely available at http://mpsq.biosino.org/ab-origin/supplementary.html.Keywords
This publication has 26 references indexed in Scilit:
- ROC analysis: applications to the classification of biological sequences and 3D structuresBriefings in Bioinformatics, 2008
- B cells in autoimmune diseases: Insights from analyses of immunoglobulin variable (Ig V) gene usageAutoimmunity Reviews, 2007
- No evidence for the use of DIR, D–D fusions, chromosome 15 open reading frames or VHreplacement in the peripheral repertoire was found on application of an improved algorithm, JointML, to 6329 human immunoglobulin H rearrangementsImmunology, 2006
- MECHANISM AND CONTROL OF V(D)J RECOMBINATION AT THE IMMUNOGLOBULIN HEAVY CHAIN LOCUSAnnual Review of Immunology, 2006
- SoDA: implementation of a 3D alignment algorithm for inference of antigen receptor recombinationsBioinformatics, 2005
- Molecular basis of immunoglobulin variable region gene usage in systemic autoimmunityClinical and Experimental Medicine, 2005
- IMGT/JunctionAnalysis: the first tool for the analysis of the immunoglobulin and T cell receptor complex V–J and V–D–J JUNCTIONsBioinformatics, 2004
- Receiver Operating Characteristic Curve Analysis of Beach Water Quality Indicator VariablesApplied and Environmental Microbiology, 2003
- Sequence of the human immunoglobulin diversity (D) segment locus: a systematic analysis provides no evidence for the use of DIR segments, inverted D segments, “minor” D segments or D-D recombination 1 1Edited By J. KarnJournal of Molecular Biology, 1997
- Basic local alignment search toolJournal of Molecular Biology, 1990