Searching databases of conserved sequence regions by aligning protein multiple-alignments [published erratum appears in Nucleic Acids Res 1996 Nov 1;24(21):4372]
- 1 October 1996
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 24 (19) , 3836-3845
- https://doi.org/10.1093/nar/24.19.3836
Abstract
A general searching method for comparing multiple sequence alignments was developed to detect sequence relationships between conserved protein regions. Multiple alignments are treated as sequences of amino acid distributions and aligned by comparing pairs of such distributions. Four different comparison measures were tested and the Pearson correlation coefficient chosen. The method is sensitive, detecting weak sequence relationships between protein families. Relationships are detected beyond the range of conventional sequence database searches, illustrating the potential usefulness of the method. The previously undetected relation between flavoprotein subunits of two oxidoreductase families points to the potential active site in one of the families. The similarity between the bacterial RecA, DnaA and Rad51 protein families reveals a region in DnaA and Rad51 proteins likely to bind and unstack single-stranded DNA. Helix--turn--helix DNA binding domains from diverse proteins are readily detected and shown to be similar to each other. Glycosylasparaginase and gamma-glutamyltransferase enzymes are found to be similar in their proteolytic cleavage sites. The method has been fully implemented on the World Wide Web at URL: http://blocks.fhcrc.org/blocks-bin/LAMAvsearch.Keywords
This publication has 67 references indexed in Scilit:
- Amino acid substitution matrices from an information theoretic perspectivePublished by Elsevier ,2005
- Position-based sequence weightsPublished by Elsevier ,2004
- Homologous DNA Pairing Promoted by a 20-Amino Acid Peptide Derived from RecAScience, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Detecting Patterns in Protein SequencesJournal of Molecular Biology, 1994
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Sequence logos: a new way to display consensus sequencesNucleic Acids Research, 1990
- Determining residue-base interactions between AraC protein and araI DNAJournal of Molecular Biology, 1989
- Systematic method for the detection of potential λ Cro-like DNA-binding regions in proteinsJournal of Molecular Biology, 1987
- Prediction of the occurrence of the ADP-binding βαβ-fold in proteins, using an amino acid sequence fingerprintJournal of Molecular Biology, 1986