Modeling major lung resection outcomes using classification trees and multiple imputation techniques
Open Access
- 1 November 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in European Journal of Cardio-Thoracic Surgery
- Vol. 34 (5) , 1085-1089
- https://doi.org/10.1016/j.ejcts.2008.07.037
Abstract
Objective: Modeling of operative risks associated with major lung resection is potentially inaccurate and inefficient because of incomplete observations for predictor variables (covariates). Missing values do not usually occur randomly, potentially introducing an important source of bias in modeling. Deletion of cases with missing data also results in loss of precision. The current study analyzes incomplete variables as potential predictors of outcomes after major lung resection using imputation techniques. Methods: We analyzed major lung resection patients treated from 1980 to 2006 for predictors of pulmonary, cardiovascular, and overall complications, as well as mortality. Predictive variables were initially determined using classification and regression tree (CART) methods. Imputation models were developed and variables with missing values were multiply imputed. We fit a logistic regression model for each outcome using CART variables and any covariates that were of interest clinically. Results: Of 1046 resected patients, serum albumin and diffusing capacity (DLCO%) had a large number of missing values (32% and 13% missing, respectively). Models included 10 covariates for pulmonary complications (p ≪ 0.05 for DLCO% and forced expiratory volume in the first second [FEV1%]), 12 covariates for cardiovascular complications (p ≪ 0.05 for FEV1%, extent of resection, year of operation, and age), 15 covariates for overall complications (p ≪ 0.05 for DLCO%, performance status, serum albumin, and FEV1/FVC ratio), and 12 covariates for death (p ≪ 0.05 for DLCO%, extent of resection, and operation year). Conclusions: We identified serum albumin as a previously under-reported and strong predictor of overall complications. Serum albumin was marginally significantly related to pulmonary and cardiovascular outcomes after major lung surgery. Use of imputation techniques for modeling surgical risks has potential value in identifying important predictive variables that may ordinarily be eliminated from analysis or not identified as predictors because of incomplete observations in clinical databases.Keywords
This publication has 19 references indexed in Scilit:
- SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivationNature Genetics, 2008
- Changes in patient presentation and outcomes for major lung resection over three decades☆European Journal of Cardio-Thoracic Surgery, 2008
- Inflammation and outcome after general thoracic surgeryEuropean Journal of Cardio-Thoracic Surgery, 2007
- A comparison of imputation techniques for handling missing predictor values in a risk model with a binary outcomeStatistical Methods in Medical Research, 2007
- Measured FEV1 in the first postoperative day, and not ppoFEV1, is the best predictor of cardio-respiratory morbidity after lung resection☆European Journal of Cardio-Thoracic Surgery, 2007
- Review: A gentle introduction to imputation of missing valuesPublished by Elsevier ,2006
- Carbon monoxide lung diffusion capacity improves risk stratification in patients without airflow limitation: evidence for systematic measurement before lung resectionEuropean Journal of Cardio-Thoracic Surgery, 2006
- The European Thoracic Surgery Database project: modelling the risk of in-hospital death following lung resectionEuropean Journal of Cardio-Thoracic Surgery, 2005
- Weight loss and low body cell mass in males with lung cancer: relationship with systemic inflammation, acute-phase response, resting energy expenditure, and catabolic and anabolic hormonesClinical Science, 1999
- Optimizing selection of patients for major lung resectionThe Journal of Thoracic and Cardiovascular Surgery, 1995