Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features
Top Cited Papers
Open Access
- 16 August 2016
- journal article
- research article
- Published by Springer Nature in Nature Communications
- Vol. 7 (1) , 12474
- https://doi.org/10.1038/ncomms12474
Abstract
Lung cancer is the most prevalent cancer worldwide, and histopathological assessment is indispensable for its diagnosis. However, human evaluation of pathology slides cannot accurately predict patients’ prognoses. In this study, we obtain 2,186 haematoxylin and eosin stained histopathology whole-slide images of lung adenocarcinoma and squamous cell carcinoma patients from The Cancer Genome Atlas (TCGA), and 294 additional images from Stanford Tissue Microarray (TMA) Database. We extract 9,879 quantitative image features and use regularized machine-learning methods to select the top features and to distinguish shorter-term survivors from longer-term survivors with stage I adenocarcinoma (PP=0.023) in the TCGA data set. We validate the survival prediction framework with the TMA cohort (P<0.036 for both tumour types). Our results suggest that automatically derived image features can predict the prognosis of lung cancer patients and thereby contribute to precision oncology. Our methods are extensible to histopathology images of other organs.This publication has 56 references indexed in Scilit:
- Comprehensive genomic characterization of squamous cell lung cancersNature, 2012
- FoxQ1 Overexpression Influences Poor Prognosis in Non-Small Cell Lung Cancer, Associates with the Phenomenon of EMTPLOS ONE, 2012
- Improved structure, function and compatibility for CellProfiler: modular high-throughput image analysis softwareBioinformatics, 2011
- Computer aided diagnostic tools aim to empower rather than replace pathologists: Lessons learned from computational chessJournal of Pathology Informatics, 2011
- Metadata matters: access to image data in the real worldThe Journal of cell biology, 2010
- Computer-aided prognosis of neuroblastoma on whole-slide images: Classification of stromal developmentPublished by Elsevier ,2008
- Conditional variable importance for random forestsBMC Bioinformatics, 2008
- The Stanford Tissue Microarray DatabaseNucleic Acids Research, 2007
- Gene-expression profiles predict survival of patients with lung adenocarcinomaNature Medicine, 2002