A software tool for finding locally optimal alignments in protein and nucleic acid sequences
- 1 March 1988
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 4 (1) , 35-40
- https://doi.org/10.1093/bioinformatics/4.1.35
Abstract
We describe software for aligning protein or nucleic acid sequences based on the concept of match density. This method is especially useful for locating regions of short similarity between two longer sequences which may be largely dissimilar (e.g. locating active site regions in distantly related proteins). Our software is able to identify biologically interesting similarities between two sub-regions because it allows the user to control the matching parameters and the manner in which local alignments are selected for display. Furthermore, the collection and ranking of alignments for display uses a novel, highly efficient algorithm. We illustrate these features with several examples. In addition, we show that this tool can be used to find a new conserved sequence in several viral DNA polymerases, which, we suggest, occurs at a functionally important enzymatic site.This publication has 8 references indexed in Scilit:
- A sensitive procedure to compare amino acid sequencesJournal of Molecular Biology, 1987
- Related functional domains in virus DNA polymerases.The EMBO Journal, 1987
- The Complete DNA Sequence of Varicella-Zoster VirusJournal of General Virology, 1986
- Homology between DNA polymerases of poxviruses, herpesviruses, and adenoviruses: nucleotide sequence of the vaccinia virus DNA polymerase gene.Proceedings of the National Academy of Sciences, 1986
- Primary structural relationships may reflect similar DNA replication strategiesVirology, 1986
- Sequence and mapping analyses of the herpes simplex virus DNA polymerase gene predict a C-terminal substrate binding domain.Proceedings of the National Academy of Sciences, 1985
- DNA sequence and expression of the B95-8 Epstein—Barr virus genomeNature, 1984
- Pattern recognition in genetic sequences by mismatch densityBulletin of Mathematical Biology, 1984