AutoFACT: An Auto matic F unctional A nnotation and C lassification T ool
Open Access
- 16 June 2005
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 6 (1) , 151
- https://doi.org/10.1186/1471-2105-6-151
Abstract
Background: Assignment of function to new molecular sequence data is an essential step in genomics projects. The usual process involves similarity searches of a given sequence against one or more databases, an arduous process for large datasets. Results: We present AutoFACT, a fully automated and customizable annotation tool that assigns biologically informative functions to a sequence. Key features of this tool are that it (1) analyzes nucleotide and protein sequence data; (2) determines the most informative functional description by combining multiple BLAST reports from several user-selected databases; (3) assigns putative metabolic pathways, functional classes, enzyme classes, GeneOntology terms and locus names; and (4) generates output in HTML, text and GFF formats for the user's convenience. We have compared AutoFACT to four well-established annotation pipelines. The error rate of functional annotation is estimated to be only between 1–2%. Comparison of AutoFACT to the traditional top-BLAST-hit annotation method shows that our procedure increases the number of functionally informative annotations by approximately 50%. Conclusion: AutoFACT will serve as a useful annotation tool for smaller sequencing groups lacking dedicated bioinformatics staff. It is implemented in PERL and runs on LINUX/UNIX platforms. AutoFACT is available at http://megasun.bch.umontreal.ca/Software/AutoFACT.htm.Keywords
This publication has 27 references indexed in Scilit:
- GeneWise and GenomewiseGenome Research, 2004
- The Ensembl Automatic Gene Annotation SystemGenome Research, 2004
- UniProt: the Universal Protein knowledgebaseNucleic Acids Research, 2004
- The Pfam protein families databaseNucleic Acids Research, 2004
- Genome sequence of the human malaria parasite Plasmodium falciparumNature, 2002
- KEGG: Kyoto Encyclopedia of Genes and GenomesNucleic Acids Research, 2000
- EDITtoTrEMBL: a distributed approach to high-quality automated protein sequence annotation.Bioinformatics, 1999
- Recommended Name: Aminopeptidase BEuropean Journal of Biochemistry, 1997
- A Genomic Perspective on Protein FamiliesScience, 1997
- Basic local alignment search toolJournal of Molecular Biology, 1990