Application of InterPro for the functional classification of the proteins of fish origin in SWISS-PROT and TrEMBL
- 1 June 2001
- journal article
- research article
- Published by Springer Nature in Journal of Biosciences
- Vol. 26 (2) , 277-284
- https://doi.org/10.1007/bf02703652
Abstract
InterPro (http://www.ebi.ac.uk/interpro/) is an integrated documentation resource for protein families, domains and sites, developed initially as a means of rationalizing the complementary efforts of the PROSITE, PRINTS, Pfam and ProDom database projects. It is a useful resource that aids the functional classification of proteins. Almost 90% of theactinopterygii protein sequences from SWISS-PROT and TrEMBL can be classified using InterPro. Over 30% of theactinopterygii protein sequences currently in SWISS-PROT and TrEMBL are of mitochondrial origin, the majority of which belong to the cytochrome b/b6 family. InterPro also gives insights into the domain composition of the classified proteins and has applications in the functional classification of newly determined sequences lacking biochemical characterization, and in comparative genome analysis. A comparison of theactinopterygii protein sequences against the sequences of other eukaryotes confirms the high representation of eukaryotic protein kinase in the organisms studied. The comparisons also show that, based on InterPro families, thetrans-species evolution of MHC class I and II molecules in mammals and teleost fish can be recognized.Keywords
This publication has 15 references indexed in Scilit:
- The InterPro database, an integrated documentation resource for protein families, domains and functional sitesNucleic Acids Research, 2001
- The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000Nucleic Acids Research, 2000
- ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisonsNucleic Acids Research, 2000
- The Pfam Protein Families DatabaseNucleic Acids Research, 2000
- PRINTS-S: the database formerly known as PRINTSNucleic Acids Research, 2000
- The Protein Data BankNucleic Acids Research, 2000
- Quantitative and Qualitative Analysis of Type III Antifreeze Protein Structure and FunctionJournal of Biological Chemistry, 1999
- The PROSITE database, its status in 1999Nucleic Acids Research, 1999
- [8] SRS: Information retrieval system for molecular biology data banksPublished by Elsevier ,1996
- Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genomeNature, 1993