Functional associations of proteins in entire genomes by means of exhaustive detection of gene fusions
Open Access
- 28 August 2001
- journal article
- research article
- Published by Springer Nature in Genome Biology
- Vol. 2 (9) , 1-7
- https://doi.org/10.1186/gb-2001-2-9-research0034
Abstract
It has recently been shown that the detection of gene fusion events across genomes can be used for predicting functional associations of proteins, including physical interaction or complex formation. To obtain such predictions we have made an exhaustive search for gene fusion events within 24 available completely sequenced genomes. Each genome was used as a query against the remaining 23 complete genomes to detect gene fusion events. Using an improved, fully automatic protocol, a total of 7,224 single-domain proteins that are components of gene fusions in other genomes were detected, many of which were identified for the first time. The total number of predicted pairwise functional associations is 39,730 for all genomes. Component pairs were identified by virtue of their similarity to 2,365 multidomain composite proteins. We also show for the first time that gene fusion is a complex evolutionary process with a number of contributory factors, including paralogy, genome size and phylogenetic distance. On average, 9% of genes in a given genome appear to code for single-domain, component proteins predicted to be functionally associated. These proteins are detected by an additional 4% of genes that code for fused, composite proteins. These results provide an exhaustive set of functionally associated genes and also delineate the power of fusion analysis for the prediction of protein interactions.Keywords
This publication has 24 references indexed in Scilit:
- CAST: an iterative algorithm for the complexity analysis of sequence tractsBioinformatics, 2000
- Novel Selenoproteins Identified in Silico andin Vivo by Using a Conserved RNA Structural MotifJournal of Biological Chemistry, 1999
- Genomes OnLine Database (GOLD 1.0): a monitor of complete and ongoing genome projects world-wideBioinformatics, 1999
- Detecting Protein Function and Protein-Protein Interactions from Genome SequencesScience, 1999
- The Transcriptional Program of Sporulation in Budding YeastScience, 1998
- Exploring the Metabolic and Genetic Control of Gene Expression on a Genomic ScaleScience, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- The RNA-binding Site of Bacteriophage Qβ Coat ProteinPublished by Elsevier ,1996
- Review: The Cct eukaryotic chaperonin subunits of Saccharomyces cerevisiae and other yeastsYeast, 1996
- Requirement for the Carboxyl-terminal Domain of Saccharomyces cerevisiae Carbamoyl-phosphate SynthetaseJournal of Biological Chemistry, 1996