iProClass: an integrated database of protein family, function and structure information
Open Access
- 1 January 2003
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 31 (1) , 390-392
- https://doi.org/10.1093/nar/gkg044
Abstract
The i ProClass database provides comprehensive, value-added descriptions of proteins and serves as a framework for data integration in a distributed networking environment. The protein information in i ProClass includes family relationships as well as structural and functional classifications and features. The current version consists of about 830 000 non-redundant PIR-PSD, SWISS-PROT, and TrEMBL proteins organized with more than 36 000 PIR superfamilies, 145 000 families, 4000 domains, 1300 motifs and 550 000 FASTA similarity clusters. It provides rich links to over 50 database of protein sequences, families, functions and pathways, protein–protein interactions, post-translational modifications, protein expressions, structures and structural classifications, genes and genomes, ontologies, literature and taxonomy. Protein and superfamily summary reports present extensive annotation information and include membership statistics and graphical display of domains and motifs. i ProClass employs an open and modular architecture for interoperability and scalability. It is implemented in the Oracle object-relational database system and is updated biweekly. The database is freely accessible from the web site at http://pir.georgetown.edu/iproclass/ and searchable by sequence or text string. The data integration in i ProClass supports exploration of protein relationships. Such knowledge is fundamental to the understanding of protein evolution, structure and function and crucial to functional genomic and proteomic research.Keywords
This publication has 8 references indexed in Scilit:
- The Protein Information Resource: an integrated public resource of functional annotation of proteinsNucleic Acids Research, 2002
- The Pfam Protein Families DatabaseNucleic Acids Research, 2002
- The PROSITE database, its status in 2002Nucleic Acids Research, 2002
- iProClass: an integrated, comprehensive and annotated protein classification databaseNucleic Acids Research, 2001
- The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000Nucleic Acids Research, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Superfamily classification in PIR-international protein sequence databasePublished by Elsevier ,1996
- Improved tools for biological sequence comparison.Proceedings of the National Academy of Sciences, 1988