Predictive Modeling of Total Healthcare Costs Using Pharmacy Claims Data
- 1 November 2005
- journal article
- research article
- Published by Wolters Kluwer Health in Medical Care
- Vol. 43 (11) , 1065-1072
- https://doi.org/10.1097/01.mlr.0000182408.54390.00
Abstract
Objective: We sought to evaluate several statistical modeling approaches in predicting prospective total annual health costs (medical plus pharmacy) of health plan participants using Pharmacy Health Dimensions (PHD), a pharmacy claims-based risk index. Methods: We undertook a 2-year (baseline year/follow-up year) longitudinal analysis of integrated medical and pharmacy claims. Included were plan participants younger than 65 years of age with continuous medical and pharmacy coverage (n = 344,832). PHD drug categories, age, gender, and pharmacy costs were derived across the baseline year. Annual total health costs were calculated for each plan participant in follow-up year. Models examined included ordinary least squares (OLS) regression, log-transformed OLS regression with smearing estimator, and 3 two-part models using OLS regression, log-OLS regression with smearing estimator, and generalized linear modeling (GLM), respectively. A 10% random sample was withheld for model validation, which was assessed via adjusted r2, mean absolute prediction error, specificity, and positive predictive value. Results: Most PHD drug categories were significant independent predictors of total costs. Among models tested, the OLS model had the lowest mean absolute prediction error and highest adjusted r2. The log-OLS and 2-part log-OLS models did not predict costs accurately as the result of issues of log-scale heteroscedasticity. The 2-part model using GLM had lower adjusted r2 but similar performance in other assessment measures compared with the OLS or 2-part OLS models. Conclusion: The PHD system derived solely from pharmacy claims data can be used to predict future total health costs. Using PHD with a simple OLS model may provide similar predictive accuracy in comparison to more advanced econometric models.Keywords
This publication has 21 references indexed in Scilit:
- Risk Adjustment Using Automated Ambulatory Pharmacy DataMedical Care, 2003
- Chronic Disease Score as a Predictor of HospitalizationEpidemiology, 2002
- The Medicaid Rx ModelMedical Care, 2001
- Development and Estimation of a Pediatric Chronic Disease Score Using Automated Pharmacy DataMedical Care, 1999
- Pharmacy Costs GroupsMedical Care, 1999
- Development of a Chronic Disease Indicator Score Using a Veterans Affairs Medical Center Medication DatabaseJournal of Clinical Epidemiology, 1999
- A Chronic Disease Score with Empirically Derived WeightsMedical Care, 1995
- Replicating the chronic disease score (CDS) from automated pharmacy dataJournal of Clinical Epidemiology, 1994
- The Role of Insurance Claims Databases in Drug Therapy Outcomes ResearchPharmacoEconomics, 1993
- A chronic disease score from automated pharmacy dataJournal of Clinical Epidemiology, 1992