Sample Size and Robustness of Inferences from Logistic Regression in the Presence of Nonlinearity and Multicollinearity

The logistic regression models has been widely used in the social and natural sciences and results from studies using this model can have significant impact. Thus, confidence in the reliability of inferences drawn from these models is essential. The robustness of such inferences is dependent on sample size. The purpose of this study is to examine the impact of sample size on the mean estimated bias and efficiency of parameter estimation and inference for the logistic regression model. A number of simulations are conducted examining the impact of sample size, nonlinear predictors, and multicollinearity on substantive inferences (e.g. odds ratios, marginal effects) and goodness of fit (e.g. pseudo-R2, predictability) of logistic regression models. Findings suggest that sample size can affect parameter estimates and inferences in the presence of multicollinearity and nonlinear predictor functions, but marginal effects estimates are relatively robust to sample size.

Issue Date:
Publication Type:
Conference Paper/ Presentation
DOI and Other Identifiers:
Record Identifier:
PURL Identifier:
Total Pages:
Series Statement:
Selected Paper

 Record created 2017-04-01, last modified 2019-08-26

Download fulltext

Rate this document:

Rate this document:
(Not yet reviewed)