Stata Bookstore: Multiple Imputation and Its Application

Home / Bookstore / Title index / Biostatistics and epidemiology / Multiple Imputation and Its Application

Multiple Imputation and Its Application

As an Amazon Associate, StataCorp earns a small referral credit from qualifying purchases made from affiliate links on our site.

Amazon Associate affiliate link

What are VitalSource eBooks?
Your access code will be emailed upon purchase.

eBook not available for this title

Authors:	James R. Carpenter and Michael G. Kenward
Publisher:	Wiley
Copyright:	2013
ISBN-13:	978-0-470-74052-1
Pages:	345; hardcover

Authors:	James R. Carpenter and Michael G. Kenward
Publisher:	Wiley
Copyright:	2013
ISBN-13:
Pages:	345; eBook
Price:	$0.00

Authors:	James R. Carpenter and Michael G. Kenward
Publisher:	Wiley
Copyright:	2013
ISBN-13:
Pages:	345; Kindle
Price:	$

Comment from the Stata technical group

Multiple Imputation and its Application, by James R. Carpenter and Michael G. Kenward, provides an excellent review of multiple imputation (MI) from basic to advanced concepts. MI is a statistical method for analyzing incomplete data. The flexibility of the MI procedure has prompted its use in a wide variety of applications. This book describes the rationale for MI and its underlying assumptions in a broad range of statistical settings, and demonstrates the use of this procedure for handling missing data in complex data structures.

The text provides a good mixture of theory and practice. Throughout the book, the concepts are illustrated with real data examples.

The book is divided into three parts: foundations, MI for cross-sectional data, and advanced topics. The first part reviews the basic concepts of missing data, such as types of missing data and missing-data assumptions, and of multiple imputation, such as the MI procedure and its justification. The second part describes the use of MI for handling missing values in cross-sectional data, including the imputation of different types of data (continuous, binary, ordinal, etc.), and for handling nonlinearities and interactions during imputation. The third part discusses the advanced use of MI for dealing with missing data in complex data structures such as survival data and multilevel data. Other important advanced topics are covered, including the handling of survey weights during imputation, sensitivity analysis, and robust MI.

View table of contents >>

Preface

Data acknowledgements

Acknowledgements

Glossary

PART 1 FOUNDATIONS

1 Introduction

1.1 Reasons for missing data
1.2 Examples
1.3 Patterns of missing data

1.3.1 Consequences of missing data

1.4 Inferential framework and notation

1.4.1 Missing Completely At Random (MCAR)
1.4.2 Missing At Random (MAR)
1.4.3 Missing Not At Random (MNAR)
1.4.4 Ignorability

1.5 Using observed data to inform assumptions about the missingness mechanism
1.6 Implications of missing data mechanisms for regression analyses

1.6.1 Partially observed response
1.6.2 Missing covariates
1.6.3 Missing covariates and response
1.6.4 Subtle issues I: The odds ratio
1.6.5 Implication for linear regression
1.6.6 Subtle issues II: Subsample ignorability
1.6.7 Summary: When restricting to complete records is valid

1.7 Summary

2 The multiple imputation procedure and its justification

2.1 Introduction
2.2 Intuitive outline of the MI procedure
2.3 The generic MI procedure
2.4 Bayesian justification of MI
2.5 Frequentist inference

2.5.1 Large number of imputations
2.5.2 Small number of imputations

2.6 Choosing the number of imputations
2.7 Some simple examples
2.8 MI in more general settings

2.8.1 Survey sample settings

2.9 Constructing congenial imputation models
2.10 Practical considerations for choosing imputation models
2.11 Discussion

PART II MULTIPLE IMPUTATION FOR CROSS SECTIONAL DATA

3 Multiple imputation of quantitative data

3.1 Regression imputation with a monotone missingness pattern

3.1.1 MAR mechanisms consistent with a monotone pattern
3.1.2 Justification

3.2 Joint modelling

3.2.1 Fitting the imputation model

3.3 Full conditional specification

3.3.1 Justification

3.4 Full conditional specification versus joint modelling
3.5 Software for multivariate normal imputation
3.6 Discussion

4 Multiple imputation of binary and ordinal data

4.1 Sequential imputation with monotone missingness pattern
4.2 Joint modelling with the multivariate normal distribution
4.3 Modelling binary data using latent normal variables

4.3.1 Latent normal model for ordinal data

4.4 General location model
4.5 Full conditional specification

4.5.1 Justification

4.6 Issues with over-fitting
4.7 Pros and cons of the various approaches
4.8 Software
4.9 Discussion

5 Multiple imputation of unordered categorical data

5.1 Monotone missing data
5.2 Multivariate normal imputation for categorical data
5.3 Maximum indicant model

5.3.1 Continuous and categorical variable
5.3.2 Imputing missing data
5.3.3 More than one categorical variable

5.4 General location model
5.5 FCS with categorical data
5.6 Perfect prediction issues with categorical data
5.7 Software
5.8 Discussion

6 Nonlinear relationships

6.1 Passive imputation
6.2 No missing data in nonlinear relationships
6.3 Missing data in nonlinear relationships

6.3.1 Predictive Mean Matching (PMM)
6.3.2 Just Another Variable (JAV)
6.3.3 Joint modelling approach
6.3.4 Extension to more general models and missing data patterns
6.3.5 Metropolis-Hastings sampling
6.3.6 Rejection sampling
6.3.7 FCS approach

6.4 Discussion

7 Interactions

7.1 Interaction variables fully observed
7.2 Interactions of categorical variables
7.3 General nonlinear relationships
7.4 Software
7.5 Discussion

PART III ADVANCED TOPICS

8 Survival data, skips and large datasets

8.1 Time-to-event data

8.1.1 Imputing missing covariate values
8.1.2 Survival data as categorical
8.1.3 Imputing censored survival times

8.2 Nonparametric, or ‘hot deck’ imputation

8.2.1 Nonparametric imputation for survival data

8.3 Multiple imputation for skips
8.4 Two-stage MI
8.5 Large datasets

8.5.1 Large datasets and joint modelling
8.5.2 Shrinkage by constraining parameters
8.5.3 Comparison of the two approaches

8.6 Multiple imputation and record linkage
8.7 Measurement error
8.8 Multiple imputation for aggregated scores
8.9 Discussion

9 Multilevel multiple imputation

9.1 Multilevel imputation model
9.2 MCMC algorithm for imputation model
9.3 Imputing level-2 covariates using FCS
9.4 Individual patient meta-analysis

9.4.1 When to apply Rubin’s rules

9.5 Extensions

9.5.1 Random level-1 covariance matrices
9.5.2 Model fit

9.6 Discussion

10 Sensitivity analysis: MI unleashed

10.1 Review of MNAR modelling
10.2 Framing sensitivity analysis
10.3 Pattern mixture modelling with MI

10.3.1 Missing covariates
10.3.2 Application to survival analysis

10.4 Pattern mixture approach with longitudinal data via MI

10.4.1 Change in slope post-deviation

10.5 Piecing together post-deviation distributions from other trial arms
10.6 Approximating a selection model by importance weighting

10.6.1 Algorithm for approximate sensitivity analysis by re-weighting

10.7 Discussion

11 Including survey weights

11.1 Using model based predictions
11.2 Bias in the MI variance estimator

11.2.1 MI with weights
11.2.2 Estimation in domains

11.3 A multilevel approach
11.4 Further developments
11.5 Discussion

12 Robust multiple imputation

12.1 Introduction
12.2 Theoretical background

12.2.1 Simple estimating equations
12.2.2 The Probability Of Missingness (POM) model
12.2.3 Augmented inverse probability weighted estimating equation

12.3 Robust multiple imputation

12.3.1 Univariate MAR missing data
12.3.2 Longitudinal MAR missing data

12.4 Simulation studies

12.4.1 Univariate MAR missing data
12.4.2 Longitudinal monotone MAR missing data
12.4.3 Longitudinal nonmonotone MAR missing data
12.4.4 Nonlongitudinal nonmonotone MAR missing data
12.4.5 Results and discussion

12.5 The RECORD study
12.6 Discussion

Appendix A Markov Chain Monte Carlo

Appendix B Probability distributions

B.1 Posterior for the multivariate normal distribution

Bibliography

Index of Authors

Index of Examples

Index

Multiple Imputation and Its Application

Comment from the Stata technical group

Table of contents

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies

Stata/MP4 Annual License (download)

Multiple Imputation and Its Application

Comment from the Stata technical group

Table of contents

We use cookies

Privacy policy

Required cookies

Advertising and performance cookies