Statistical mixture modeling for cell subtype identification in flow cytometry
Open Access
- 21 May 2008
- journal article
- research article
- Published by Wiley in Cytometry Part A
- Vol. 73A (8) , 693-701
- https://doi.org/10.1002/cyto.a.20583
Abstract
Statistical mixture modeling provides an opportunity for automated identification and resolution of cell subtypes in flow cytometric data. The configuration of cells as represented by multiple markers simultaneously can be modeled arbitrarily well as a mixture of Gaussian distributions in the dimension of the number of markers. Cellular subtypes may be related to one or multiple components of such mixtures, and fitted mixture models can be evaluated in the full set of markers as an alternative, or adjunct, to traditional subjective gating methods that rely on choosing one or two dimensions. Four color flow data from human blood cells labeled with FITC‐conjugated anti‐CD3, PE‐conjugated anti‐CD8, PE‐Cy5‐conjugated anti‐CD4, and APC‐conjugated anti‐CD19 Abs was acquired on a FACSCalibur. Cells from four murine cell lines, JAWS II, RAW 264.7, CTLL‐2, and A20, were also stained with FITC‐conjugated anti‐CD11c, PE‐conjugated anti‐CD11b, PE‐Cy5‐conjugated anti‐CD8a, and PE‐Cy7‐conjugated‐CD45R/B220 Abs, respectively, and single color flow data were collected on an LSRII. The data were fitted with a mixture of multivariate Gaussians using standard Bayesian statistical approaches and Markov chain Monte Carlo computations. Statistical mixture models were able to identify and purify major cell subsets in human peripheral blood, using an automated process that can be generalized to an arbitrary number of markers. Validation against both traditional expert gating and synthetic mixtures of murine cell lines with known mixing proportions was also performed. This article describes the studies of statistical mixture modeling of flow cytometric data, and demonstrates their utility in examples with four‐color flow data from human peripheral blood samples and synthetic mixtures of murine cell lines. © 2008 International Society for Advancement of CytometryKeywords
This publication has 21 references indexed in Scilit:
- A focus on automated recognitionCytometry Part A, 2007
- A new “Logicle” display method avoids deceptive effects of logarithmic scaling for low signals and compensated dataCytometry Part A, 2006
- Hyperlog—A flexible log‐like transform for negative, zero, and positive valued dataCytometry Part A, 2005
- Bayesian analysis of mixture models with an unknown number of components—an alternative to reversible jump methodsThe Annals of Statistics, 2000
- Hierarchical Mixture Models in Neurological Transmission AnalysisJournal of the American Statistical Association, 1997
- Hierarchical Mixture Models in Neurological Transmission AnalysisJournal of the American Statistical Association, 1997
- Markov Chain Monte Carlo Convergence Diagnostics: A Comparative ReviewJournal of the American Statistical Association, 1996
- Bayesian Density Estimation and Inference Using MixturesJournal of the American Statistical Association, 1995
- Sampling-Based Approaches to Calculating Marginal DensitiesJournal of the American Statistical Association, 1990
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978