Significance analysis of lexical bias in microarray data
Open Access
- 3 April 2003
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 4 (1) , 12
- https://doi.org/10.1186/1471-2105-4-12
Abstract
Genes that are determined to be significantly differentially regulated in microarray analyses often appear to have functional commonalities, such as being components of the same biochemical pathway. This results in certain words being under- or overrepresented in the list of genes. Distinguishing between biologically meaningful trends and artifacts of annotation and analysis procedures is of the utmost importance, as only true biological trends are of interest for further experimentation. A number of sophisticated methods for identification of significant lexical trends are currently available, but these methods are generally too cumbersome for practical use by most microarray users. We have developed a tool, LACK, for calculating the statistical significance of apparent lexical bias in microarray datasets. The frequency of a user-specified list of search terms in a list of genes which are differentially regulated is assessed for statistical significance by comparison to randomly generated datasets. The simplicity of the input files and user interface targets the average microarray user who wishes to have a statistical measure of apparent lexical trends in analyzed datasets without the need for bioinformatics skills. The software is available as Perl source or a Windows executable. We have used LACK in our laboratory to generate biological hypotheses based on our microarray data. We demonstrate the program's utility using an example in which we confirm significant upregulation of SPI-2 pathogenicity island of Salmonella enterica serovar Typhimurium by the cation chelator dipyridyl.Keywords
This publication has 16 references indexed in Scilit:
- Practical Approaches to Analyzing Results of Microarray ExperimentsAmerican Journal of Respiratory Cell and Molecular Biology, 2002
- Whole-genome expression analysis: challenges beyond clusteringCurrent Opinion in Structural Biology, 2001
- Computational analysis of microarray dataNature Reviews Genetics, 2001
- Significance analysis of microarrays applied to the ionizing radiation responseProceedings of the National Academy of Sciences, 2001
- Mining functional information associated with expression arraysFunctional & Integrative Genomics, 2001
- Use of keyword hierarchies to interpret gene expression patternsBioinformatics, 2001
- Nitrogen regulatory protein C-controlled genes of Escherichia coli : Scavenging as a defense against nitrogen limitationProceedings of the National Academy of Sciences, 2000
- BIOPROSPECTOR: DISCOVERING CONSERVED DNA MOTIFS IN UPSTREAM REGULATORY REGIONS OF CO-EXPRESSED GENESPacific Symposium on Biocomputing, 2000
- Systematic determination of genetic network architectureNature Genetics, 1999
- Cluster analysis and display of genome-wide expression patternsProceedings of the National Academy of Sciences, 1998