Analysis and management of data from high-throughput expressed sequence tag projects
- 31 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. i, 585-594 vol.1
- https://doi.org/10.1109/hicss.1993.270682
Abstract
The authors have developed an integrated software system for analyzing, managing, and distributing data from high-throughput, steady-state expressed sequence tag (EST) projects. The system employs existing public and commercial software where available. Custom software has been developed and integrated as needed. The system was designed to facilitate sequence analysis on remote servers, complex queries of the data, and interactive browsing by nonexpert users. The design of the system was driven by the requirements of providing functionality with a short development time. The analysis procedures and database structures used in this system are not specific to a laboratory and could be used by an EST project or other project directed toward sequencing and mapping short sequence tags, including genomic sequence-tagged sites or tags generated by random amplification of polymorphic DNA mapping.<>Keywords
This publication has 16 references indexed in Scilit:
- Btab—a Blast output parserBioinformatics, 1992
- Caenorhabditis elegans expressed sequence tags identify gene families and potential disease gene homologuesNature Genetics, 1992
- gm: a tool for exploratory analysis of DNA sequence dataPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach.Proceedings of the National Academy of Sciences, 1991
- Searching protein sequence libraries: Comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithmsGenomics, 1991
- Isolation of a large number of novel mammalian genes by a differential cDNA library sreening strategyNucleic Acids Research, 1991
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Improved tools for biological sequence comparison.Proceedings of the National Academy of Sciences, 1988
- Primary structure of rat cardiac beta-adrenergic and muscarinic cholinergic receptors obtained by automated DNA sequence analysis: further evidence for a multigene family.Proceedings of the National Academy of Sciences, 1987
- A general method applicable to the search for similarities in the amino acid sequence of two proteinsJournal of Molecular Biology, 1970