A method to locate protein coding sequences in DNA of prokaryotic systems
Open Access
- 1 January 1985
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 13 (1) , 185-194
- https://doi.org/10.1093/nar/13.1.185
Abstract
CDNA sequence data from E.coli phages, for which complete genome sequences are known, have been analysed. From this analysis thirteen triplets have been identified as markers to distinguish protein-coding frames from fortuitous open reading frames. The region of −18 to + 18 nucleotides around ATG/GTG, has been analysed and used to identify initiator codons from internal ATG/GTG. With the aid of criteria defined above a method has been developed to locate protein coding sequences by a combination of ‘gene search by signal’ and ‘gene search by content’ approaches. Application of this method to prokaryotic systems including those which were not part of our data base indicates that it is quite accurate and general in nature.Keywords
This publication has 14 references indexed in Scilit:
- DNA binding spectrum of the carcinogen N-acetoxy-N-2-acetylaminofluorene significantly differs from the mutation spectrumJournal of Molecular Biology, 1984
- Computer methods to locate signals in nucleic acid sequencesNucleic Acids Research, 1984
- A Markov analysis of DNA sequencesJournal of Theoretical Biology, 1983
- Recognition of protein coding regions in DNA sequencesNucleic Acids Research, 1982
- Codon preference and its use in identifying protein coding regions in long DNA sequencesNucleic Acids Research, 1982
- Translational Initiation in ProkaryotesAnnual Review of Microbiology, 1981
- Method to determine the reading frame of a protein from the purine/pyrimidine genome sequence and its possible evolutionary justification.Proceedings of the National Academy of Sciences, 1981
- The ribosome binding sites recognized by E. coli ribosomes have regions with signal character in both the leader and protein coding segmentsNucleic Acids Research, 1980
- Is UAA or UGA part of the recognition signal for ribosomal initiation?Nucleic Acids Research, 1979
- The 3′-Terminal Sequence of Escherichia coli 16S Ribosomal RNA: Complementarity to Nonsense Triplets and Ribosome Binding SitesProceedings of the National Academy of Sciences, 1974