A process for analysis of microarray comparative genomics hybridisation studies for bacterial genomes
Open Access
- 29 January 2008
- journal article
- research article
- Published by Springer Nature in BMC Genomics
- Vol. 9 (1) , 53
- https://doi.org/10.1186/1471-2164-9-53
Abstract
Background: Microarray based comparative genomic hybridisation (CGH) experiments have been used to study numerous biological problems including understanding genome plasticity in pathogenic bacteria. Typically such experiments produce large data sets that are difficult for biologists to handle. Although there are some programmes available for interpretation of bacterial transcriptomics data and CGH microarray data for looking at genetic stability in oncogenes, there are none specifically to understand the mosaic nature of bacterial genomes. Consequently a bottle neck still persists in accurate processing and mathematical analysis of these data. To address this shortfall we have produced a simple and robust CGH microarray data analysis process that may be automated in the future to understand bacterial genomic diversity. Results: The process involves five steps: cleaning, normalisation, estimating gene presence and absence or divergence, validation, and analysis of data from test against three reference strains simultaneously. Each stage of the process is described and we have compared a number of methods available for characterising bacterial genomic diversity, for calculating the cut-off between gene presence and absence or divergence, and shown that a simple dynamic approach using a kernel density estimator performed better than both established, as well as a more sophisticated mixture modelling technique. We have also shown that current methods commonly used for CGH microarray analysis in tumour and cancer cell lines are not appropriate for analysing our data. Conclusion: After carrying out the analysis and validation for three sequenced Escherichia coli strains, CGH microarray data from 19 E. coli O157 pathogenic test strains were used to demonstrate the benefits of applying this simple and robust process to CGH microarray studies using bacterial genomes.Keywords
This publication has 45 references indexed in Scilit:
- Genetic Diversity among Escherichia coli O157:H7 Isolates and Identification of Genes Linked to Human InfectionsInfection and Immunity, 2008
- Subtyping Method for Escherichia coli Shiga Toxin (Verocytotoxin) 2 Variants and Correlations to Clinical ManifestationsJournal of Clinical Microbiology, 2007
- Comparative Genomics of Canadian Epidemic Lineages of Methicillin-Resistant Staphylococcus aureusJournal of Clinical Microbiology, 2007
- Exploiting noise in array CGH data to improve detection of DNA copy number changeNucleic Acids Research, 2007
- NCBI GEO: mining tens of millions of expression profiles--database and tools updateNucleic Acids Research, 2006
- Identification of Core and Variable Components of the Salmonella enterica Subspecies I Genome by MicroarrayInfection and Immunity, 2005
- Mechanisms of, and Barriers to, Horizontal Gene Transfer between BacteriaNature Reviews Microbiology, 2005
- Array comparative genomic hybridization and its applications in cancerNature Genetics, 2005
- Genome rearrangement by replication-directed translocationNature Genetics, 2000
- Evolution by acquisition: the case for horizontal gene transfersTrends in Biochemical Sciences, 1992