Influence of model‐building strategies on the results of a case‐control study
- 30 July 1993
- journal article
- Published by Wiley in Statistics in Medicine
- Vol. 12 (14) , 1325-1338
- https://doi.org/10.1002/sim.4780121405
Abstract
We evaluate the analysis of a case‐control study in which many variables were investigated simultaneously. The purpose of the study was to explore some rather unspecific hypotheses about potential risk factors for adult brain tumour. Our aim is to show that in the analysis of case‐control studies many decisions are necessary which are usually not published in detail. As in most studies these decisions are made during analysis and are data dependent. We demonstrate that the data allow sensible alternative decisions which influence the final results. A sensitivity analysis of several aspects of the analysis such as different measurement scales, variable selection, handling of missing values and interactions was performed, and demonstrated variation in the results based on the strategy for analysis. We conclude that details of the final analysis should be decided in the planning phase of a case‐control study, and that more details of model‐building strategies must be published. Results from a study where the analysis is highly data dependent must be interpreted with caution and validation of the results with new studies is essential.Keywords
This publication has 26 references indexed in Scilit:
- A bootstrap resampling procedure for model building: Application to the cox regression modelStatistics in Medicine, 1992
- Survival analysis 1982–1991: The second decade of the proportional hazards regression modelStatistics in Medicine, 1991
- Efficient evaluation of treatment effects in the presence of missing covariate valuesStatistics in Medicine, 1990
- Measures of explained variation for survival dataStatistics in Medicine, 1990
- Effects of mismodelling and mismeasuring explanatory variables on tests of their association with a response variableStatistics in Medicine, 1988
- Power, sample size and smallest detectable effect determination for multivariate studiesStatistics in Medicine, 1985
- The results of logistic analyses when the variables are highly correlated: An empirical example using diet and CHD incidenceJournal of Chronic Diseases, 1984
- Further aspects of data analysisControlled Clinical Trials, 1983