10. Modelling binary outcome data
10.1 Introduction
10.2 Problems with standard regression models
10.2.1 The r-x relationship may well not be linear
10.2.2 Predicted values of the risk may be outside the valid range
10.2.3 The error distribution is not normal
10.3 Logistic regression
10.4 Interpretation of logistic regression coefficients
10.4.1 Binary risk factors
10.4.2 Quantitative risk factors
10.4.3 Categorical risk factors
10.4.4 Ordinal risk factors
10.4.5 Floating absolute risks
10.5 Generic data
10.6 Multiple logistic regression models
10.7 Tests of hypotheses
10.7.1 Goodness of fit for grouped data
10.7.2 Goodness of fit for generic data
10.7.3 Effect of a risk factor
10.7.4 Tests for linearity and nonlinearity
10.7.5 Tests based upon estimates and their standard errors
10.7.6 Problems with missing values
10.8 Confounding
10.9 Interaction
10.9.1 Between two categorical variables
10.9.2 Between a quantitative and a categorical variable
10.9.3 Between two quantitative variables
10.10 Model checking
10.10.1 Residuals
10.10.2 Influential observations
10.11 Regression dilution
10.11.1 Correcting for regression dilution
10.12 Case–control studies
10.12.1 Unmatched studies
10.12.2 Matched studies
10.13 Outcomes with several ordered levels
10.13.1 The proportional odds assumption
10.13.2 The proportional odds model
10.14 Longitudinal data
10.15 Complex sampling designs
Exercises