A review of methods for misclassified categorical data in epidemiology
- 1 September 1989
- journal article
- review article
- Published by Wiley in Statistics in Medicine
- Vol. 8 (9) , 1095-1106
- https://doi.org/10.1002/sim.4780080908
Abstract
Misclassification introduces errors in categorical variables. This paper presents a review of methods for misclassified categorical data in epidemiology. Different sampling schemes for a 2 × 2 × 2 table and methods of analyses will be discussed first. A misclassification matrix is defined, and the usual misclassification models will be shown to be a subclass of log‐linear models. Well‐known results on a 2 × 2 table with misclassification and recent results on a 2 × 2 × 2 table are then reviewed. Finally two methods of adjusting for misclassification will be given. The first method assumes a known misclassification matrix, and the second method uses subsampling to estimate the misclassification matrix. The analysis is based on a recursive system of log‐linear models: first determine a misclassification model, then select a model for the correctly classified variables. The methods are illustrated by data from traffic safety research on the effectiveness of seatbelt use in reducing injuries.Keywords
This publication has 31 references indexed in Scilit:
- Regression Analysis for Categorical Variables with Outcome Subject to Nonignorable NonresponseJournal of the American Statistical Association, 1988
- The case-control study: Valid selection of subjectsJournal of Chronic Diseases, 1985
- Valid selection of subjects in case-control studiesJournal of Chronic Diseases, 1985
- The “case-control” study: Valid selection of subjectsJournal of Chronic Diseases, 1985
- Log-Linear Models for Doubly Sampled Categorical Data Fitted by the EM AlgorithmJournal of the American Statistical Association, 1985
- Double Standards, Scientific Methods, and Epidemiologic ResearchNew England Journal of Medicine, 1982
- Hierarchical Log-Linear Models Not Preserved by Classification ErrorJournal of the American Statistical Association, 1981
- The analysis of multidimensional contingency tables when some variables are posterior to others: a modified path analysis approachBiometrika, 1973
- A Note on Measurement Errors and Detecting Real DifferencesJournal of the American Statistical Association, 1961