Data Mining in Pharmacovigilance
- 1 January 2005
- journal article
- Published by Springer Nature in Drug Safety
- Vol. 28 (10) , 835-842
- https://doi.org/10.2165/00002018-200528100-00001
Abstract
Data mining is receiving considerable attention as a tool for pharmacovigilance and is generating many perspectives on its uses. This paper presents four concepts that have appeared in various professional venues and represent potential sources of misunderstanding and/or entail extended discussions: (i) data mining algorithms are unvalidated; (ii) data mining algorithms allow data miners to objectively screen spontaneous report data; (iii) mathematically more complex Bayesian algorithms are superior to frequentist algorithms; and (iv) data mining algorithms are not just for hypothesis generation. Key points for a balanced perspective are that: (i) validation exercises have been done but lack a gold standard for comparison and are complicated by numerous nuances and pitfalls in the deployment of data mining algorithms. Their performance is likely to be highly situation dependent; (ii) the subjective nature of data mining is often underappreciated; (iii) simpler data mining models can be supplemented with ‘clinical shrinkage’, preserving sensitivity; and (iv) applications of data mining beyond hypothesis generation are risky, given the limitations of the data. These extended applications tend to ‘creep’, not pounce, into the public domain, leading to potential overconfidence in their results. Most importantly, in the enthusiasm generated by the promise of data mining tools, users must keep in mind the limitations of the data and the importance of clinical judgment and context, regardless of statistical arithmetic. In conclusion, we agree that contemporary data mining algorithms are promising additions to the pharmacovigilance toolkit, but the level of verification required should be commensurate with the nature and extent of the claimed applications.Keywords
This publication has 19 references indexed in Scilit:
- A challenge to the data minersPharmacoepidemiology and Drug Safety, 2004
- Expecting the Unexpected — Drug Safety, Pharmacovigilance, and the Prepared MindNew England Journal of Medicine, 2004
- Drug‐induced pancreatitis: lessons in data miningBritish Journal of Clinical Pharmacology, 2004
- Safety Related Drug-Labelling ChangesDrug Safety, 2004
- Disproportionality analysis using empirical Bayes data mining: a tool for the evaluation of drug interactions in the post‐marketing settingPharmacoepidemiology and Drug Safety, 2003
- Application of Quantitative Signal Detection in the Dutch Spontaneous Reporting System for Adverse Drug ReactionsDrug Safety, 2003
- Quantitative Methods in PharmacovigilanceDrug Safety, 2003
- A comparison of measures of disproportionality for signal detection in spontaneous reporting systems for adverse drug reactionsPharmacoepidemiology and Drug Safety, 2002
- Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reportsPharmacoepidemiology and Drug Safety, 2001
- A Retrospective Evaluation of a Data Mining Approach to Aid Finding New Adverse Drug Reaction Signals in the WHO International DatabaseDrug Safety, 2000