Pfam: multiple sequence alignments and HMM-profiles of protein domains
- 1 January 1998
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 26 (1) , 320-322
- https://doi.org/10.1093/nar/26.1.320
Abstract
Pfam contains multiple alignments and hidden Markov model based profiles (HMM-profiles) of complete protein domains. The definition of domain boundaries, family members and alignment is done semi-automatically based on expert knowledge, sequence similarity, other protein family databases and the ability of HMM-profiles to correctly identify and align the members. Release 2.0 of Pfam contains 527 manually verified families which are available for browsing and on-line searching via the World Wide Web in the UK at http://www.sanger.ac.uk/Pfam/ and in the US at http://genome.wustl. edu/Pfam/ Pfam 2.0 matches one or more domains in 50% of Swissprot-34 sequences, and 25% of a large sample of predicted proteins from the Caenorhabditis elegans genome.Keywords
This publication has 10 references indexed in Scilit:
- Hidden Markov modelsPublished by Elsevier ,2002
- The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998Nucleic Acids Research, 1998
- Superior performance in protein homology detection with the Blocks Database serversNucleic Acids Research, 1998
- Pfam: A comprehensive database of protein domain families based on seed alignmentsProteins-Structure Function and Bioinformatics, 1997
- The PROSITE database, its status in 1997Nucleic Acids Research, 1997
- Novel developments with the PRINTS protein fingerprint databaseNucleic Acids Research, 1997
- Connecting protein family resourcesusing the proWeb networkTrends in Biochemical Sciences, 1996
- A Workbench for large-scale sequence homology analysisBioinformatics, 1994
- Modular arrangement of proteins as inferred from analysis of homologyProtein Science, 1994
- Hidden Markov Models in Computational BiologyJournal of Molecular Biology, 1994