SGP-1: Prediction and Validation of Homologous Genes Based on Sequence Alignments
Open Access
- 1 September 2001
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 11 (9) , 1574-1583
- https://doi.org/10.1101/gr.177401
Abstract
Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of SGP-1 depends little on species-specific properties such as codon usage or the nucleotide distribution. SGP-1 may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors.Keywords
This publication has 29 references indexed in Scilit:
- Extensive Duplication and Reshuffling in the Arabidopsis GenomePlant Cell, 2000
- The Complete Sequence of 340 kb of DNA around the Rice Adh1–Adh2 Region Reveals Interrupted Colinearity with Maize Chromosome 4Plant Cell, 2000
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997
- Evaluation of Gene Structure Prediction ProgramsGenomics, 1996
- DNA sequence evolution: the sounds of silencePhilosophical Transactions Of The Royal Society B-Biological Sciences, 1995
- A time-efficient, linear-space local similarity algorithmAdvances in Applied Mathematics, 1991
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Rates of DNA Sequence Evolution Differ Between Taxonomic GroupsScience, 1986
- A general method applicable to the search for similarities in the amino acid sequence of two proteinsJournal of Molecular Biology, 1970