Theoretical distribution of truncation lengths in incremental truncation libraries
- 17 March 2003
- journal article
- research article
- Published by Wiley in Biotechnology & Bioengineering
- Vol. 82 (5) , 564-577
- https://doi.org/10.1002/bit.10604
Abstract
Incremental truncation is a method for constructing libraries of every one base pair truncation of a segment of DNA. Incremental truncation libraries can be created using a time-dependent nuclease method or through the incorporation of alpha-phosphothioate dNTPs by PCR or by primer extension (THIO(pcr) truncation and THIO(extension) truncation, respectively). Libraries created by the fusion of two truncation libraries, known as ITCHY libraries, can be created using the above methods or by the incremental truncation-like method SHIPREC. Knowing and being able to tailor the distribution of truncations in incremental truncation, ITCHY and SHIPREC libraries would be beneficial for their use in protein engineering and other applications. However, the experimental determination of the distributions would require extensive, cost-prohibitive, DNA sequencing to obtain statistically relevant data. Instead, a theoretical prediction of the distributions was developed. Time-dependent incremental truncation libraries had the most uniform distribution of truncation lengths, but were biased against longer truncations. Essentially uniform distribution over the desired truncation range (from zero to N(max) base pairs) required that truncations be prepared up to at least 1.2-1.5 N(max). THIO(pcr) and THIO(extension) truncation libraries had a very nonuniform distribution of truncation lengths with a bias against longer truncations. Such nonuniformity could be significantly diminished by decreasing the incorporation rate of alphaS-dNTPs but at the expense of having a large fraction of the DNA truncated beyond the desired range or completely degraded. ITCHY libraries created using time-dependent truncation had the most uniform distribution of possible fusions and had the highest fraction of the library being parental-length fusions. However, the distribution of parental-length fusions was biased against fusions near the beginning/ends of genes unless the truncation libraries are prepared with a uniform distribution up to N(max). In contrast, SHIPREC libraries and THIO(pcr) ITCHY libraries, by the very nature of the nonuniform distributions of the truncated DNA, are ensured of having a uniform distribution of fusion points in parental-length fusions. This comes at the expense of having a smaller fraction of the library being parental-length fusions; however, this limitation can be overcome by performing size selection on the library.Keywords
This publication has 12 references indexed in Scilit:
- Creating multiple-crossover DNA libraries independent of sequence identityProceedings of the National Academy of Sciences, 2001
- Libraries of hybrid proteins from distantly related sequencesNature Biotechnology, 2001
- Rapid generation of incremental truncation libraries for protein engineering using alpha-phosphothioate nucleotidesNucleic Acids Research, 2001
- Combinatorial and computational challenges for biocatalyst designNature, 2001
- Construction of hybrid gene libraries involving the circular permutation of DNABiotechnology Letters, 2001
- Finding Cinderella's slipper—proteins that fitNature Biotechnology, 1999
- Combinatorial protein engineering by incremental truncationProceedings of the National Academy of Sciences, 1999
- DNA shuffling by random fragmentation and reassembly: in vitro recombination for molecular evolution.Proceedings of the National Academy of Sciences, 1994
- On the Activities of Escherichia coli Exonuclease IIIAnalytical Biochemistry, 1993
- Approximations for Digital ComputersPublished by Walter de Gruyter GmbH ,1955