A Statistical Model for Investigating Binding Probabilities of DNA Nucleotide Sequences Using Microarrays
- 1 December 2002
- journal article
- Published by Oxford University Press (OUP) in Biometrics
- Vol. 58 (4) , 981-988
- https://doi.org/10.1111/j.0006-341x.2002.00981.x
Abstract
Summary. There is considerable scientific interest in knowing the probability that a site‐specific transcription factor will bind to a given DNA sequence. Microarray methods provide an effective means for assessing the binding affinities of a large number of DNA sequences as demonstrated by Bulyk et al. (2001, Proceedings of the National Academy of Sciences, USA98, 7158–7163) in their study of the DNA‐binding specificities of Zif268 zinc fingers using microarray technology. In a follow‐up investigation, Bulyk, Johnson, and Church (2002, Nucleic Acid Research30, 1255–1261) studied the interdependence of nucleotides on the binding affinities of transcription proteins. Our article is motivated by this pair of studies. We present a general statistical methodology for analyzing microarray intensity measurements reflecting DNA‐protein interactions. The log probability of a protein binding to a DNA sequence on an array is modeled using a linear ANOVA model. This model is convenient because it employs familiar statistical concepts and procedures and also because it is effective for investigating the probability structure of the binding mechanism.Keywords
This publication has 9 references indexed in Scilit:
- Nucleotides of transcription factor binding sites exert interdependent effects on the binding affinities of transcription factorsNucleic Acids Research, 2002
- Non-independence of Mnt repressor-operator interaction determined by a new quantitative multiple fluorescence relative affinity (QuMFRA) assayNucleic Acids Research, 2001
- Exploring the DNA-binding specificities of zinc fingers with DNA microarraysProceedings of the National Academy of Sciences, 2001
- Importance of replication in microarray gene expression studies: Statistical methods and evidence from repetitive cDNA hybridizationsProceedings of the National Academy of Sciences, 2000
- Oligonucleotide frequency matrices addressed to recognizing functional DNA sites.Bioinformatics, 1999
- Matlnd and Matlnspector: new fast and versatile tools for detection of consensus matches in nucleotide sequence dataNucleic Acids Research, 1995
- Toward a code for the interactions of zinc fingers with DNA: selection of randomized fingers displayed on phage.Proceedings of the National Academy of Sciences, 1994
- Generalized Linear ModelsPublished by Springer Nature ,1989
- Quantitative analysis of the relationship between nucleotide sequence and functional activityNucleic Acids Research, 1986