Fast analysis of DNA and protein sequence on Apple IIe: restriction sites search, alignment of short sequence and dot matrix analysis
- 1 January 1986
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 14 (1) , 583-590
- https://doi.org/10.1093/nar/14.1.583
Abstract
A fast restriction sites search algorithm using a quadruplet look-ahead feature has been written in 6502 assembly language code. The search time, tested on the sequence of pBR322, is 4.1 s/kilobase using a restriction site library including 112 specificities corresponding to a total site length of over 700 bases. The search for a short sequence (less than 36 bases) within a longer one (up to 9999 bases) with a given number of mismatches or gaps allowed has also been written in assembly language. Typical run time for the search of a 12 base sequence with 1, 2 or 3 gaps allowed are 6.2, 9.4 or 13.6 s/kilobase, respectively. The dot matrix analysis needs 7.5 minutes per square kilobase when using a stringency of 15 matched bases out of 25. A 7/21 matrix of two 500 amino acid proteins is obtained in 3 minutes. These three routines are included in DPSA, a general package of programs allowing manipulation and analysis of DNA and protein sequences.Keywords
This publication has 9 references indexed in Scilit:
- Cloning, nucleotide sequence, and expression of one of two genes coding for yeast elongation factor 1 alpha.Journal of Biological Chemistry, 1985
- Restriction and modification enzymes and their recognition sequencesNucleic Acids Research, 1985
- The diagonal-traverse homology search algorithm for locating similarities between two sequencesNucleic Acids Research, 1984
- A DNA sequence analysis package for the IBM personal computerNucleic Acids Research, 1984
- A comprehensive sequence analysis program for the IBM personal computerNucleic Acids Research, 1984
- Fast optimal alignmentNucleic Acids Research, 1984
- Rapid similarity searches of nucleic acid and protein data banks.Proceedings of the National Academy of Sciences, 1983
- Cloned human and mouse kappa immunoglobulin constant and J region genes conserve homology in functional segmentsCell, 1980
- A general method applicable to the search for similarities in the amino acid sequence of two proteinsJournal of Molecular Biology, 1970