Utilization of sequence libraries on a 16-bit mini computer with particular reference to high speed searching
- 1 January 1984
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 12 (1Part1) , 409-416
- https://doi.org/10.1093/nar/12.1part1.409
Abstract
An interactive menu driven system of programmes written in Fortran and designed to utilize the three main nucleotide sequence libraries and one amino acid sequence library was developed to run on a small 16-bit mini computer with limited main memory and mass storage. The software uses a minimum of system function calls and should be transportable with minimal rewriting to micro computers. Software has also been written to create secondary data bases containing the nucleotide triplet values (4(3) classes) derived from the sequence libraries. Using this secondary set, a given sequence and its reversed complement, once reduced to their trinucleotide values, can be compared to all sequences present in the libraries in about forty minutes on a PDP 11/10 mini computer using the correlation statistic. Because the statistic in this case may not be assumed to be normally distributed, we have termed it a quasi correlation coefficient (Qr).Keywords
This publication has 13 references indexed in Scilit:
- Identification of common molecular subsequencesPublished by Elsevier ,2004
- An efficient code searching for sequence homology and DNA duplicationJournal of Theoretical Biology, 1983
- Nucleotide sequences of influenza virus segments 1 and 3 reveal mosaic structure of a small viral RNA segmentCell, 1982
- Nucteotide sequence of human influenza A/PR/8/34 segment 2Nucleic Acids Research, 1982
- Neuraminidase gene from the early Asian strain of human influenza virus, A/RI/5-157 (H2N2)Nucleic Acids Research, 1982
- Sequence of RNA segment 7 of the influenza B virus genome: Partial amino acid homology between the membrane proteins (M1) of Influenza A and B viruses and conservation of a second open reading frameVirology, 1982
- Complete structure of the hemagglutinin gene from the human influenza A/Victoria/3/75 (H3N2) strain as determined from cloned DNACell, 1980
- A strategy of DNA sequencing employing computer programsNucleic Acids Research, 1979
- An application of information theory to genetic mutations and the matching of polypeptide sequencesJournal of Theoretical Biology, 1973
- An improved method of testing for evolutionary homologyJournal of Molecular Biology, 1966