A Computer-Based Method of Selecting Clones for a Full-Length cDNA Project: Simultaneous Collection of Negligibly Redundant and Variant cDNAs
Open Access
- 18 June 2002
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 12 (7) , 1127-1134
- https://doi.org/10.1101/gr.75202
Abstract
We describe a computer-based method that selects representative clones for full-length sequencing in a full-length cDNA project. Our method classifies end sequences using two kinds of criteria, grouping, and clustering. Grouping places together variant cDNAs, family genes, and cDNAs with sequencing errors. Clustering separates those cDNA clones into distinct clusters. The full-length sequences of the clones selected by grouping are determined preferentially, and then the sequences selected by clustering are determined. Grouping reduced the number of rice cDNA clones for full-length sequencing to 21% and mouse cDNA clones to 25%. Rice full-length sequences selected by grouping showed a 1.07-fold redundancy. Mouse full-length sequences showed a 1.04-fold redundancy, which can be reduced by ∼30% from the selection using our previous method. To estimate the coverage of unique genes, we used FANTOM (Functional Annotation of RIKEN Mouse cDNA Clones) clusters (the RIKEN Genome Exploration Research Group 2001). Grouping covered almost all unique genes (93% of FANTOM clusters), and clustering covered all genes. Therefore, our method is useful for the selection of appropriate representative clones for full-length sequencing, thereby greatly reducing the cost, labor, and time necessary for this process. [The programs used in this paper are available online at http://genome.gsc.riken.go.jp/software/2C.]Keywords
This publication has 29 references indexed in Scilit:
- FANTOM DB: database of Functional Annotation of RIKEN Mouse cDNA ClonesNucleic Acids Research, 2002
- Computer-Based Methods for the Mouse Full-Length cDNA Encyclopedia: Real-Time Sequence Clustering for Construction of a Nonredundant cDNA LibraryGenome Research, 2001
- [2] High-efficiency full-length cDNA cloningPublished by Elsevier ,1999
- High Efficiency Selection of Full-length cDNA by Improved Biotinylated Cap TrapperDNA Research, 1997
- High-Efficiency Full-Length cDNA Cloning by Biotinylated CAP TrapperGenomics, 1996
- ESTablishing a human transcript mapNature Genetics, 1995
- 3′-End cleavage and polyadenylation of mRNA precursorsBiochimica et Biophysica Acta (BBA) - Gene Structure and Expression, 1995
- TIGR Assembler: A New Tool for Assembling Large Shotgun Sequencing ProjectsGenome Science and Technology, 1995
- THE BIOCHEMISTRY OF 3′-END CLEAVAGE AND POLYADENYLATION OF MESSENGER RNA PRECURSORSAnnual Review of Biochemistry, 1992
- Improved tools for biological sequence comparison.Proceedings of the National Academy of Sciences, 1988