Confidence intervals for multinomial logistic regression in sparse data
- 17 February 2006
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 26 (4) , 903-918
- https://doi.org/10.1002/sim.2518
Abstract
Logistic regression is one of the most widely used regression models in practice, but alternatives to conventional maximum likelihood estimation methods may be more appropriate for small or sparse samples. Modification of the logistic regression score function to remove first‐order bias is equivalent to penalizing the likelihood by the Jeffreys prior, and yields penalized maximum likelihood estimates (PLEs) that always exist, even in samples in which maximum likelihood estimates (MLEs) are infinite. PLEs are an attractive alternative in small‐to‐moderate‐sized samples, and are preferred to exact conditional MLEs when there are continuous covariates. We present methods to construct confidence intervals (CI) in the penalized multinomial logistic regression model, and compare CI coverage and length for the PLE‐based methods to that of conventional MLE‐based methods in trinomial logistic regressions with both binary and continuous covariates. Based on simulation studies in sparse data sets, we recommend profile CIs over asymptotic Wald‐type intervals for the PLEs in all cases. Furthermore, when finite sample bias and data separation are likely to occur, we prefer PLE profile CIs over MLE methods. Copyright © 2006 John Wiley & Sons, Ltd.Keywords
This publication has 18 references indexed in Scilit:
- A solution to the problem of separation in logistic regressionStatistics in Medicine, 2002
- Small-sample bias and corrections for conditional maximum-likelihood odds-ratio estimatorsBiostatistics, 2000
- Bias reduction of maximum likelihood estimatesBiometrika, 1993
- On the computation of likelihood ratio and score test based confidence intervals in generalized linear modelsStatistics in Medicine, 1992
- On Bayesian Analysis of Generalized Linear Models Using Jeffreys's PriorJournal of the American Statistical Association, 1991
- On Bayesian Analysis of Generalized Linear Models Using Jeffreys's PriorJournal of the American Statistical Association, 1991
- Multiple Imputation of Industry and Occupation Codes in Census Public-use Samples Using Bayesian Logistic RegressionJournal of the American Statistical Association, 1991
- Judging Inference Adequacy in Logistic RegressionJournal of the American Statistical Association, 1986
- Calculation of Polychotomous Logistic Regression Parameters Using Individualized RegressionsBiometrika, 1984
- On the existence of maximum likelihood estimates in logistic regression modelsBiometrika, 1984