EGassembler: online bioinformatics service for large-scale processing, clustering and assembling ESTs and genomic DNA fragments
Open Access
- 1 July 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (Web Server) , W459-W462
- https://doi.org/10.1093/nar/gkl066
Abstract
Expressed sequence tag (EST) sequencing has proven to be an economically feasible alternative for gene discovery in species lacking a draft genome sequence. Ongoing large-scale EST sequencing projects feel the need for bioinformatics tools to facilitate uniform EST handling. This brings about a renewed importance for a universal tool for processing and functional annotation of large sets of ESTs. EGassembler ( http://egassembler.hgc.jp/ ) is a web server, which provides an automated as well as a user-customized analysis tool for cleaning, repeat masking, vector trimming, organelle masking, clustering and assembling of ESTs and genomic fragments. The web server is publicly available and provides the community a unique all-in-one online application web service for large-scale ESTs and genomic DNA clustering and assembling. Running on a Sun Fire 15K supercomputer, a significantly large volume of data can be processed in a short period of time. The results can be used to functionally annotate genes, to facilitate splice alignment analysis, to link the transcripts to genetic and physical maps, design microarray chips, to perform transcriptome analysis and to map to KEGG metabolic pathways. The service provides an excellent bioinformatics tool to research groups in wet-lab as well as an all-in-one-tool for sequence handling to bioinformatics researchers.Keywords
This publication has 14 references indexed in Scilit:
- The TIGR Gene Indices: clustering and assembling EST and known genes and integration with eukaryotic genomesNucleic Acids Research, 2004
- The TIGR Plant Repeat Databases: a collective resource for the identification of repetitive sequences in plantsNucleic Acids Research, 2004
- Database resources of the National Center for BiotechnologyNucleic Acids Research, 2003
- STACK: Sequence Tag Alignment and Consensus KnowledgebaseNucleic Acids Research, 2001
- Repbase Update: a database and an electronic journal of repetitive elementsTrends in Genetics, 2000
- Shotgun sequencing of the human transcriptome with ORF expressed sequence tagsProceedings of the National Academy of Sciences, 2000
- CAP3: A DNA Sequence Assembly ProgramGenome Research, 1999
- Base-Calling of Automated Sequencer Traces UsingPhred. I. Accuracy AssessmentGenome Research, 1998
- Generation and analysis of 280,000 human expressed sequence tags.Genome Research, 1996
- Complementary DNA Sequencing: Expressed Sequence Tags and Human Genome ProjectScience, 1991