Automated gating of flow cytometry data via robust model‐based clustering
Top Cited Papers
Open Access
- 28 February 2008
- journal article
- conference paper
- Published by Wiley in Cytometry Part A
- Vol. 73A (4) , 321-332
- https://doi.org/10.1002/cyto.a.20531
Abstract
The capability of flow cytometry to offer rapid quantification of multidimensional characteristics for millions of cells has made this technology indispensable for health research, medical diagnosis, and treatment. However, the lack of statistical and bioinformatics tools to parallel recent high‐throughput technological advancements has hindered this technology from reaching its full potential. We propose a flexible statistical model‐based clustering approach for identifying cell populations in flow cytometry data based ont‐mixture models with a Box–Cox transformation. This approach generalizes the popular Gaussian mixture models to account for outliers and allow for nonelliptical clusters. We describe an Expectation‐Maximization (EM) algorithm to simultaneously handle parameter estimation and transformation selection. Using two publicly available datasets, we demonstrate that our proposed methodology provides enough flexibility and robustness to mimic manual gating results performed by an expert researcher. In addition, we present results from a simulation study, which show that this new clustering framework gives better results in terms of robustness to model misspecification and estimation of the number of clusters, compared to the popular mixture models. The proposed clustering methodology is well adapted to automated analysis of flow cytometry data. It tends to give more reproducible results, and helps reduce the significant subjectivity and human time cost encountered in manual gating analysis. © 2008 International Society for Analytical CytologyKeywords
This publication has 55 references indexed in Scilit:
- High-Content Flow Cytometry and Temporal Data Analysis for Defining a Cellular Signature of Graft-Versus-Host DiseaseTransplantation and Cellular Therapy, 2007
- Data Standards for Flow CytometryOMICS: A Journal of Integrative Biology, 2006
- Model-Based Clustering, Discriminant Analysis, and Density EstimationJournal of the American Statistical Association, 2002
- Choosing models in model-based clustering and discriminant analysisJournal of Statistical Computation and Simulation, 1999
- Gaussian parsimonious clustering modelsPattern Recognition, 1995
- A classification EM algorithm for clustering and two stochastic versionsComputational Statistics & Data Analysis, 1992
- Robust Statistical Modeling Using the t DistributionJournal of the American Statistical Association, 1989
- Prediction and Power Transformations When the Choice of Power is Restricted to a Finite SetJournal of the American Statistical Association, 1982
- An Analysis of Transformations RevisitedJournal of the American Statistical Association, 1981
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978