Click to enlarge See the back cover 
Applied Survey Data Analysis 

$69.50 each 




Comment from the Stata technical groupApplied Survey Data Analysis is an intermediatelevel, exampledriven treatment of current methods for complex survey data. It will appeal to researchers of all disciplines who work with survey data and have basic knowledge of applied statistical methodology for standard (nonsurvey) data. The authors begin with some history and by discussing some widely used survey datasets, such as the National Health and Nutrition Examination Survey (NHANES). They then follow with the basic concepts of survey data: sampling plans, weights, clustering, prestratification and poststratification, design effects, and multistage samples. Discussion then turns to the types of variance estimators: Taylor linearization, jackknife, bootstrap, and balanced and repeated replication. The middle sections of the text provide indepth coverage of the types of analyses that can be performed with survey data, including means and proportions, correlations, tables, linear regression, regression with limited dependent variables (including logit and Poisson), and survival analysis (including Cox regression). Two final chapters are devoted to advanced topics, such as multiple imputation, Bayesian analysis, and multilevel models. The appendix provides overviews of popular statistical software, including Stata. 

Table of contentsView table of contents >> 8. Logistic Regression and Generalized Linear Models for Binary Survey Variables
8.1 Introduction
8.2 Generalized Linear Models for Binary Survey Responses
8.2.1 The Logistic Regression Model
8.3 Building the Logistic Regression Model: Stage 1, Model Specification8.2.2 The Probit Regression Model 8.2.3 The Complementary Log–Log Model 8.4 Building the Logistic Regression Model: Stage 2, Estimation of Model Parameters and Standard Errors 8.5 Building the Logistic Regression Model: Stage 3, Evaluation of the Fitted Model
8.5.1 Wald Tests of Model Parameters
8.6 Building the Logistic Regression Model: Stage 4, Interpretation and Inference8.5.2 Goodness of Fit and Logistic Regression Diagnostics 8.7 Analysis Application
8.7.1 Stage 1: Model Specification
8.8 Comparing the Logistic, Probit, and Complementary Log–Log GLMs for Binary Dependent Variables8.7.2 Stage 2: Model Estimation 8.7.3 Stage 3: Model Evaluation 8.7.4 Stage 4: Model Interpretation/Inference 8.9 Exercises 9. Generalized Linear Models for Multinomial, Ordinal, and Count Variables
9.1 Introduction
9.2 Analyzing Survey Data Using Multinomial Logit Regression Models
9.2.1 The Multinomial Logit Regression Model
9.3 Logistic Regression Models for Ordinal Survey Data9.2.2 Multinomial Logit Regression Model: Specification Stage 9.2.3 Multinomial Logit Regression Model: Estimation Stage 9.2.4 Multinomial Logit Regression Model: Evaluation Stage 9.2.5 Multinomial Logit Regression Model: Interpretation Stage 9.2.6 Example: Fitting a Multinomial Logit Regression Model to Complex Sample Survey Data
9.3.1 Cumulative Logit Regression Model
9.4 Regression Models for Count Outcomes9.3.2 Cumulative Logit Regression Model: Specification Stage 9.3.3 Cumulative Logit Regression Model: Estimation Stage 9.3.4 Cumulative Logit Regression Model: Evaluation Stage 9.3.5 Cumulative Logit Regression Model: Interpretation Stage 9.3.6 Example: Fitting a Cumulative Logit Regression Model to Complex Sample Survey Data
9.4.1 Survey Count Variables and Regression Modeling Alternatives
9.5 Exercises9.4.2 Generalized Linear Models for Count Variables
9.4.2.1 The Poisson Regression Model
9.4.3 Regression Models for Count Data: Specification Stage9.4.2.2 The Negative Binomial Regression Model 9.4.2.3 TwoPart Models: ZeroInflated Poisson and Negative Binomial Regression Models 9.4.4 Regression Models for Count Data: Estimation Stage 9.4.5 Regression Models for Count Data: Evaluation Stage 9.4.6 Regression Models for Count Data: Interpretation Stage 9.4.7 Example: Fitting Poisson and Negative Binomial Regression Models to Complex Sample Survey Data 10. Survival Analysis of Event History Survey Data
10.1 Introduction
10.2 Basic Theory of Survival Analysis
10.2.1 Survey Measurement of Event History Data
10.3 (Nonparametric) Kaplan–Meier Estimation of the Survivor Function10.2.2 Data for Event History Models 10.2.3 Important Notation and Definitions 10.2.4 Models for Survival Analysis
10.3.1 K–M Model Specification and Estimation
10.4 Cox Proportional Hazards Model10.3.2 K–M Estimator—Evaluation and Interpretation 10.3.3 K–M Survival Analysis Example
10.4.1 Cox Proportional Hazards Model: Specification
10.5 Discrete Time Survival Models10.4.2 Cox Proportional Hazards Model: Estimation Stage 10.4.3 Cox Proportional Hazards Model: Evaluation and Diagnostics 10.4.4 Cox Proportional Hazards Model: Interpretation and Presentation of Results 10.4.5 Example: Fitting a Cox Proportional Hazards Model to Complex Sample Survey Data
10.5.1 The Discrete Time Logistic Model
10.6 Exercises10.5.2 Data Preparation for Discrete Time Survival Models 10.5.3 Discrete Time Models: Estimation Stage 10.5.4 Discrete Time Models: Evaluation and Interpretation 10.5.5 Fitting a Discrete Time Model to Complex Sample Survey Data 11. Multiple Imputation: Methods and Applications for Survey Analysts
11.1 Introduction
11.2 Important Missing Data Concepts
11.2.1 Sources and Patterns of ItemMissing Data in Surveys
11.3 An Introduction to Imputation and the Multiple Imputation Method11.2.2 ItemMissing Data Mechanisms 11.2.3 Implications of ItemMissing Data for Survey Data Analysis 11.2.4 Review of Strategies to Address ItemMissing Data in Surveys
11.3.1 A Brief History of Imputation Procedures
11.4 Models for Multiply Imputing Missing Data11.3.2 Why the Multiple Imputation Method? 11.3.3 Overview of Multiple Imputation and MI Phases
11.4.1 Choosing the Variables to Include in the Imputation Model
11.5 Creating the Imputations11.4.2 Distributional Assumptions for the Imputation Model
11.5.1 Transforming the Imputation Problem to Monotonic Missing Data
11.6 Estimation and Inference for Multiply Imputed Data11.5.2 Specifying an Explicit Multivariate Model and Applying Exact Bayesian Posterior Simulation Methods 11.5.3 Sequential Regression or “Chained Regressions”
11.6.1 Estimators for Population Parameters and Associated Variance Estimators
11.7 Applications to Survey Data11.6.2 Model Evaluation and Inference
11.7.1 Problem Definition
11.8 Exercises11.7.2 The Imputation Model for the NHANES Blood Pressure Example 11.7.3 Imputation of the ItemMissing Data 11.7.4 Multiple Imputation Estimation and Inference
11.7.4.1 Multiple Imputation Analysis 1: Estimation of Mean Diastolic Blood Pressure
11.7.4.2 Multiple Imputation Analysis 2: Estimation of the Linear Regression Model for Diastolic Blood Pressure 12. Advanced Topics in the Analysis of Survey Data
12.1 Introduction
12.2 Bayesian Analysis of Complex Sample Survey Data 12.3 Generalized Linear Mixed Models (GLMMs) in Survey Data Analysis
12.3.1 Overview of Generalized Linear Mixed Models
12.4 Fitting Structural Equation Models to Complex Sample Survey Data12.3.2 Generalized Linear Mixed Models and Complex Sample Survey Data 12.3.3 GLMM Approaches to Analyzing Longitudinal Survey Data 12.3.4 Example: Longitudinal Analysis of the HRS Data 12.3.5 Directions for Future Research 12.5 Small Area Estimation and Complex Sample Survey Data 12.6 Nonparametric Methods for Complex Sample Survey Data Appendix A: Software Overview
A.1 Introduction
A.1.1 Historical Perspective
A.2 Overview of Stata® Version 10+A.1.2 Software for Sampling Error Estimation A.3 Overview of SAS® Version 9.2
A.3.1 The SAS SURVEY Procedures
A.4 Overview of SUDAAN® Version 9.0
A.4.1 The SUDAAN Procedures
A.5. Overview of SPSS®
A.5.1 The SPSS Complex Samples Commands
A.6 Overview of Additional Software
A.6.1 WesVar®
A.7 SummaryA.6.2 IVEware (Imputation and Variance Estimation Software) A.6.3 Mplus A.6.4 The R survey Package References
Index
