Privacy-Maintaining Propensity Score-Based Pooling of Multiple Databases Applied to a Study of Biologics
- 1 June 2010
- journal article
- research article
- Published by Wolters Kluwer Health in Medical Care
- Vol. 48 (6) , S83-S89
- https://doi.org/10.1097/mlr.0b013e3181d59541
Abstract
Introduction: A large study on the safety of biologics required pooling of data from multiple data sources, but while extensive confounder adjustment was necessary, private, individual-level covariate information could not be shared. Objectives: To describe the methods of pooling data that investigators considered, and to detail the strengths and limitations of the chosen method: a propensity score (PS)-based approach that allowed for full multivariate adjustment without compromising patient privacy. Research Design: The project had a central data coordinating center responsible for collection and analysis of data. Private data could not be transmitted to the data coordinating center. Investigators assessed 4 methods for pooled analyses: full covariate sharing, cell-aggregated sharing, meta-analysis, and the PS-based method. We evaluated each method for protection of private information, analytic integrity and flexibility, and ability to meet the study's operational and statistical needs. Results: Analysis of 4 example datasets yielded substantially similar estimates if data were pooled with a PS versus individual covariates (0%–3% difference in point estimates). Several practical challenges arose. (1) PSs are best suited for dichotomous exposures but 6 or more exposure categories were desired; we chose a series of exposure contrasts with a common referent group. (2) Subgroup analyses had to be specified a priori. (3) Time-varying exposures and confounders required appropriate analytic handling including re-estimation of PSs. (4) Detection of heterogeneity among centers was necessary. Conclusions: The PS-based pooling method offered strong protection of patient privacy and a reasonable balance between analytic integrity and flexibility of study execution. We would recommend its use in other studies that require pooling of databases, multivariate adjustment, and privacy protection.Keywords
This publication has 20 references indexed in Scilit:
- Privacy-Maintaining Propensity Score-Based Pooling of Multiple Databases Applied to a Study of BiologicsMedical Care, 2010
- Multivariate-adjusted pharmacoepidemiologic analyses of confidential information pooled from multiple health care utilization databasesPharmacoepidemiology and Drug Safety, 2010
- Cardiovascular Outcomes and Mortality in Patients Using Clopidogrel With Proton Pump Inhibitors After Percutaneous Coronary Intervention or Acute Coronary SyndromeCirculation, 2009
- Confounder summary scores when comparing the effects of multiple drug exposuresPharmacoepidemiology and Drug Safety, 2009
- High-dimensional Propensity Score Adjustment in Studies of Treatment Effects Using Health Care Claims DataEpidemiology, 2009
- A population-based study of the drug interaction between proton pump inhibitors and clopidogrelCMAJ : Canadian Medical Association Journal, 2009
- American College of Rheumatology 2008 recommendations for the use of nonbiologic and biologic disease‐modifying antirheumatic drugs in rheumatoid arthritisArthritis Care & Research, 2008
- Indications for Propensity Scores and Review of their Use in PharmacoepidemiologyBasic & Clinical Pharmacology & Toxicology, 2006
- Meta-analysis in clinical trialsControlled Clinical Trials, 1986
- The central role of the propensity score in observational studies for causal effectsBiometrika, 1983