SWORDS: A statistical tool for analysing large DNA sequences
- 1 February 2002
- journal article
- review article
- Published by Springer Nature in Journal of Biosciences
- Vol. 27 (1) , 1-6
- https://doi.org/10.1007/bf02703678
Abstract
In this article, we present some simple yet effective statistical techniques for analysing and comparing large DNA sequences. These techniques are based on frequency distributions of DNA words in a large sequence, and have been packaged into a software called SWORDS. Using sequences available in public domain databases housed in the Internet, we demonstrate how SWORDS can be conveniently used by molecular biologists and geneticists to unmask biologically important features hidden in large sequences and assess their statistical significance.Keywords
This publication has 37 references indexed in Scilit:
- Large compound Poisson approximations for occurrences of multiple wordsPublished by Institute of Mathematical Statistics ,1999
- PHYLOGENETIC ANALYSIS IN MOLECULAR EVOLUTIONARY GENETICSAnnual Review of Genetics, 1996
- Quartet Puzzling: A Quartet Maximum-Likelihood Method for Reconstructing Tree TopologiesMolecular Biology and Evolution, 1996
- Over- and Underrepresentation of Short DNA Words in Herpesvirus GenomesJournal of Computational Biology, 1996
- Estimation of Confidence in Phylogeny: The Complete-and-Partial Bootstrap TechniqueMolecular Phylogenetics and Evolution, 1995
- Exceptional Motifs in Different Markov Chain Models for a Statistical Analysis of DNA SequencesJournal of Computational Biology, 1995
- COMPUTATIONAL DNA SEQUENCE ANALYSISAnnual Review of Microbiology, 1994
- PHYLOGENIES FROM MOLECULAR SEQUENCES: INFERENCE AND RELIABILITYAnnual Review of Genetics, 1988
- Statistical Inference of PhylogeniesJournal of the Royal Statistical Society. Series A (General), 1983
- Some indications for inverse DNA duplicationJournal of Theoretical Biology, 1982