Mixture Models as a Method to Find Present and Divergent Genes in Comparative Genomic Hybridization Studies on Bacteria
- 29 March 2007
- journal article
- research article
- Published by Wiley in Biometrical Journal
- Vol. 49 (2) , 242-258
- https://doi.org/10.1002/bimj.200510286
Abstract
Comparative genomic hybridization (CGH) using microarrays is performed on bacteria in order to test for genomic diversity within various bacterial species. The microarrays used for CGH are based on the genome of a fully sequenced bacterium strain, denoted reference strain. Labelled DNA fragments from a sample strain of interest and from the reference strain are hybridized to the array. Based on the obtained ratio intensities and the total intensities of the signals, each gene is classified as either present (one copy or multiple copies) or divergent (zero copies).In this paper mixture models with different number of components are tted on different combinations of variables and compared with each other. The study shows that mixture models fitted on both the ratio intensities and the total intensities including the replicates for each gene improve, compared to previously published methods, the results for several of the data sets tested. Some summaries of the data sets are proposed as a guide for the choice of model and the choice of number of components.The models are applied on data from CGH experiments with the bacteriaStaphylococcus aureusandStreptococcus pneumoniae.Keywords
This publication has 28 references indexed in Scilit:
- Prediction of Missing Values in Microarray and Use of Mixed Models to Evaluate the PredictorsStatistical Applications in Genetics and Molecular Biology, 2005
- Statistical methods for detecting genomic alterations through array-based comparative genomic hybridization (CGH)Frontiers in Bioscience-Landmark, 2004
- Alterations of chromosomal copy number during progression of diffuse‐type gastric carcinomas: metaphase‐ and array‐based comparative genomic hybridization analyses of multiple samples from individual tumoursThe Journal of Pathology, 2003
- Use of Mixture Models in a Microarray-Based Screening Procedure for Detecting Differentially Represented Yeast MutantsStatistical Applications in Genetics and Molecular Biology, 2003
- Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumorsProceedings of the National Academy of Sciences, 2002
- Comparison of Genetic Divergence and Fitness between Two Subclones of Helicobacter pyloriInfection and Immunity, 2001
- Importance of replication in microarray gene expression studies: Statistical methods and evidence from repetitive cDNA hybridizationsProceedings of the National Academy of Sciences, 2000
- Evaluation of alternative spectral feature extraction methods of textural images for multivariate modellingJournal of Chemometrics, 1998
- Measuring the Accuracy of Diagnostic SystemsScience, 1988
- Robust Locally Weighted Regression and Smoothing ScatterplotsJournal of the American Statistical Association, 1979