Detection of protein similarities using nucleotide sequence databases
- 11 July 1988
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 16 (13) , 6191-6204
- https://doi.org/10.1093/nar/16.13.6191
Abstract
A simple procedure is described for finding similarities between proteins using nucleotide sequence databases. The approach is illustrated by several examples of previously unknown correspondences with important biological implications: Drosophila elongation factor Tu is shown to be encoded by two genes that are differently expressed during development; a cluster of three Drosophila genes likely encode maltases; a flesh-fly fat body protein resembles the hypothesized Drosophila alcohol dehydrogenase ancestral protein; an unknown protein encoded at the multifunctional E. coli hisT locus resembles aspartate β-semialdehyde dehydrogenase; and the E. coli tryR protein is related to nitrogen regulatory proteins. These and other matches were discovered using a personal computer of the type available in most laboratories collecting DNA sequence data. As relatively few sequences were sampled to find these matches, it is likely that much of the existing data has not been adequately examined.Keywords
This publication has 56 references indexed in Scilit:
- At least two genes reside within a large intron of the dunce gene of DrosophilaNature, 1987
- Cloning and nucleotide sequence of the gene coding for enzymatically active fragments of the Bacillus polymyxa beta-amylaseJournal of Bacteriology, 1987
- Nucleotide sequence of the asd gene of Streptococcus mutans. Identification of the promoter region and evidence for attenuator-like sequences preceding the structural gene.Journal of Biological Chemistry, 1987
- The primary structure and the functional domains of an elongation factor-1 alpha from Mucor racemosus.Journal of Biological Chemistry, 1986
- PseqIP: A nonredundant and exhaustive protein sequence data bank generated from 4 major existing collectionsProteins-Structure Function and Bioinformatics, 1986
- Gene within a gene: Nested Drosophila genes encode unrelated proteins on opposite DNA strandsCell, 1986
- Structural Analysis of a Developmentally Regulated 25-kDa Protein Gene of Sarcophaga peregrina1The Journal of Biochemistry, 1985
- Nucleotide sequence of a functional cDNA for human thymidylate synthaseNucleic Acids Research, 1985
- Nucleotide sequence and transcription of the phenylalanine and tyrosine operons of Escherichia coli K12Journal of Molecular Biology, 1984
- Nucleotide sequence of Saccharomyces cerevisiae genes TRP2 and TRP3 encoding bifunctional anthranilate synthase: indole-3-glycerol phosphate synthase.Journal of Biological Chemistry, 1984