
Statistics Using Stata: An Integrative Approach, Second Edition |
||||||||||||||||||||||||||||||||||||
![]() Click to enlarge |
![]()
As an Amazon Associate, StataCorp earns a small referral credit from
qualifying purchases made from affiliate links on our site.
VitalSource eBook details
VitalSource eBooks are read using the Bookshelf How to access your eBook1) Visit Bookshelf online to sign in or create an account.
2) Once logged in, click redeem in the upper right corner. Enter your eBook code. Your eBook code will be in your order confirmation email under the eBook's title. 3) The eBook will be added to your library. You may then download Bookshelf on other devices and sync your library to view the eBook.
Bookshelf is available on the following:
Bookshelf allows you to have 2 computers and 2 mobile devices activated at any given time.
I was amazed at the VitalSource way of presenting the books....
Everything looks perfectly typeset, but yet you can "flip"
through the book in the same way you would "flip" through a very
long web page in your web browser. And best of all, whenever I
have my tablet with me, my books are just a swipe away.
— Michael Mitchell
Senior statistician at the USC Children's Data Network, author of four Stata Press books, and former UCLA statistical consultant who envisioned and designed the UCLA Statistical Consulting Resources website. Return policy for eBooksStata Press eBooks are nonreturnable and nonrefundable. ×eBook not available for this title |
|
||||||||||||||||||||||||||||||||||
Comment from the Stata technical groupStatistics Using Stata: An Integrative Approach, Second Edition, by Sharon Lawner Weinberg and Sarah Knapp Abramowitz, is an excellent introduction to applied statistics and its implementation in Stata. The authors cover essential topics from exploratory data analysis to multiple regression, interweaving statistical concepts and their application in Stata. Their repeated use of real data throughout the book clearly connects the statistical concepts to real-world applications. Designed for teaching graduate and undergraduate students from the behavioral, social, and health sciences, this text is accompanied by additional resources online such as Powerpoint slides and Stata do-files. Each chapter concludes with exercises and a review of Stata code used in the examples, allowing readers to test their knowledge and refer back to Stata commands. The authors guide the reader from basic statistical concepts to more advanced material, tying concepts together to emphasize the overarching ideas. They begin with descriptive statistics, discussing the different variable types and the corresponding graphs and statistics used to examine their distribution and relationship with other variables. Then, they discuss the law of large numbers, theoretical probability distributions, and sampling, preparing the reader to dive into inferential statistics. The authors then present ANOVA, simple and multiple regression, and nonparametric methods. They carefully explain what the values represent in context of the data and how the methods relate to one another, allowing readers to really grasp the meaning behind the analyses. Weinberg and Abramowitz are just as careful when teaching the reader how to implement statistical methods in Stata. First, they introduce the reader to Stata's interface and the general syntax of Stata’s commands. Then, they explain the importance of do-files for reproducing one’s work and encourage the reader to work alongside the text with the do-files provided at the companion website. Readers can then use these do-files as a starting point when performing analyses on their own data. The authors have updated the second edition based on the most recent version of Stata, version 16. Building on the concept of power discussed in the first edition, they use the power suite to demonstrate how to perform power analysis for the inferential methods discussed. Additionally, an entirely new chapter is devoted to understanding and visualizing statistical interactions, providing much needed clarity on a topic that has confused many students in the past. And, reflecting on the importance of creating publication-quality reports that can be easily reproduced, the authors have added a new chapter demonstrating how to embed Stata results in Excel files. |
||||||||||||||||||||||||||||||||||||
Table of contentsView table of contents >> Preface
New to the Second Edition
Guiding Principles Underlying Our Approach Overview of Content Coverage and Intended Audience Acknowledgments
1 INTRODUCTION
The Role of Statistical Software in Data Analysis
Statistics: Descriptive and Inferential Variables and Constants The Measurement of Variables
Nominal Level
Discrete and Continuous VariablesOrdinal Level Interval Level Ratio Level Choosing a Scale of Measurement Setting a Context with Real Data Exercises 2 EXAMINING UNIVARIATE DISTRIBUTIONS
Counting the Occurrence of Data Values
When Variables are Measured at the Nominal Level
Frequency and Percent Distribution Tables
When Variables are Measured at the Ordinal, Interval, or Ratio LevelBar Charts Pie Charts
Frequency and Percent Distribution Tables
Describing the Shape of a DistributionStem-and-Leaf Displays Histograms Line Graphs Accumulating Data
Cumulative Percent Distributions
Summary of Graphical SelectionOgive Curves Percentile Ranks Percentiles Five-Number Summaries and Boxplots Modifying the Appearance of Graphs Summary of Stata Commands Exercises 3 MEASURES OF LOCATION, SPREAD, AND SKEWNESS
Characterizing the Location of a Distribution
The Mode
Characterizing the Spread of a DistributionThe Median The Arithmetic Mean
Interpreting the Mean of a Dichotomous Variable
Comparing the Mode, Median, and MeanThe Weighted Mean
The Range and Interquartile Range
Characterizing the Skewness of a DistributionThe Variance The Standard Deviation Selecting Measures of Location and Spread Applying What We Have Learned Summary of Stata Commands
Helpful Hints When Using Stata
Exercises
Online Resources
The Stata Command Stata Tips 4 RE–EXPRESSING VARIABLES
Linear and Nonlinear Transformations
Linear Transformations: Addition, Subtraction, Multiplication, and Division
The Effect on the Shape of a Distribution
Nonlinear Transformations: Square Roots and LogarithmsThe Effect on Summary Statistics of a Distribution Common Linear Transformations Standard Scores z-Scores
Using z-Scores to Detect Outliers
Using z-Scores to Compare Scores in Different Distributions Relating z-Scores to Percentile Ranks Nonlinear Transformations: Ranking Variables Other Transformations: Recoding and Combining Variables
Recoding Variables
Data Management Fundamentals: The Do-FileCombining Variables Summary of Stata Commands Exercises 5 EXPLORING RELATIONSHIPS BETWEEN TWO VARIABLES
When Both Variables are at Least Interval-Leveled
Scatterplots
When at Least One Variable Is Ordinal and the Other Is at Least Ordinal: The
Spearman Rank Correlation CoefficientThe Pearson Product–Moment Correlation Coefficient
Interpreting the Pearson Correlation Coefficient
Judging the Strength of the Linear Relationship
The Correlation Scale Itself Is Ordinal Correlation Does Not Imply Causation The Effect of Linear Transformations Restriction of Range The Shape of the Underlying Distributions The Reliability of the Data When at Least One Variable Is Dichotomous: Other Special Cases of the Pearson Correlation Coefficient
The Point Biserial Correlation Coefficient: The Case of One at Least
Interval and One Dichotomous Variable
Other Visual Displays of Bivariate RelationshipsThe Phi Coefficient: The Case of Two Dichotomous Variables Selection of Appropriate Statistic or Graph to Summarize a Relationship Summary of Stata Commands Exercises 6 SIMPLE LINEAR REGRESSION
The “Best-Fitting” Linear Equation
The Accuracy of Prediction Using the Linear Regression Model The Standardized Regression Equation R As a Measure of the Overall Fit of the Linear Regression Model Simple Linear Regression When the Independent Variable Is Dichotomous Using r and R As Measures of Effect Size Emphasizing the Importance of the Scatterplot Summary of Stata Commands Exercises 7 PROBABILITY FUNDAMENTALS
The Discrete Case
The Complement Rule of Probability The Additive Rules of Probability
First Additive Rule of Probability
The Multiplicative Rule of ProbabilitySecond Additive Rule of Probability The Relationship between Independence and Mutual Exclusivity Conditional Probability The Law of Total Probability Bayes' Theorem The Law of Large Numbers Exercises 8 THEORETICAL PROBABILITY MODELS
The Binomial Probability Model and Distribution
The Applicability of the Binomial Probability Model
The Normal Probability Model and DistributionUsing the Normal Distribution to Approximate the Binomial Distribution Summary of Stata Commands Exercises 9 THE ROLE OF SAMPLING IN INFERENTIAL STATISTICS
Samples and Populations
Random Samples
Obtaining a Simple Random Sample
Sampling with and without ReplacementSampling Distributions Describing the Sampling Distribution of Means Empirically Describing the Sampling Distribution of Means Theoretically
Central Limit Theorem
Estimators and BIASSummary of Stata Commands Exercises 10 INFERENCES INVOLVING THE MEAN OF A SINGLE POPULATION
WHEN σ IS KNOWN
Estimating the Population Mean, μ, When the Population Standard Deviation,
σ, Is Known
Interval Estimation Relating the Length of a Confidence Interval, the Level of Confidence, and the Sample Size Hypothesis Testing The Relationship between Hypothesis Testing and Interval Estimation Effect Size Type II Error and the Concept of Power
Increasing the Level of Significance, α
Closing RemarksIncreasing the Effect Size, δ Decreasing the Standard Error of the Mean, σ𝓍̅ Summary of Stata Commands Exercises 11 INFERENCES INVOLVING THE MEAN WHEN σ IS NOT
KNOWN: ONE- AND TWO-SAMPLE DESIGNS
Single Sample Designs When the Parameter of Interest Is the Mean and σ
Is Not Known
The t- Distribution
Two-Sample Designs When the Parameter of Interest Is μ, and σ Is
Not KnownDegrees of Freedom for the One-Sample t-Test Violating the Assumption of a Normally Distributed Parent Population in the One-Sample t-Test Confidence Intervals for the One-Sample t-Test Hypothesis Tests: The One-Sample t-Test Effect Size for the One-Sample t-Test
Independent (or Unrelated) and Dependent (or Related) Samples
The BootstrapIndependent Samples t-Test and Confidence Interval The Assumptions of the Independent Samples t-Test Effect Size for the Independent Samples t-Test Paired Samples t-Test and Confidence Interval The Assumptions of the Paired Samples t-Test Effect Size for the Paired Samples t-Test Conducting Power Analyses for t-Tests on Means Summary Summary of Stata Commands Exercises 12 RESEARCH DESIGN: INTRODUCTION AND OVERVIEW
Questions and their Link to Descriptive, Relational, and Causal Research
Studies
The Need for a Good Measure of Our Construct: Weight
The Gold Standard of Causal Studies: The True Experiment and Random AssignmentThe Descriptive Study From Descriptive to Relational Studies From Relational to Causal Studies Comparing Two Kidney Stone Treatments Using a Non-Randomized Controlled Study Including Blocking in a Research Design Underscoring the Importance of Having a True Control Group Using Randomization Analytic Methods for Bolstering Claims of Causality from Observational Data Quasi-Experimental Designs Threats to the Internal Validity of a Quasi-Experimental Design Threats to the External Validity of a Quasi-Experimental Design Threats to the Validity of a Study: Some Clarifications and Caveats Threats to the Validity of a Study: Some Examples Exercises 13 ONE-WAY ANALYSIS OF VARIANCE
The Disadvantage of Multiple t-Tests
The One-Way Analysis of Variance
A Graphical Illustration of the Role of Variance in Tests on Means
Measuring the Effect SizeANOVA As an Extension of the Independent Samples t-Test Developing an Index of Separation for the Analysis of Variance Carrying Out the ANOVA Computation
The Between Group Variance (MSB)
The Assumptions of the One-Way ANOVAThe Within Group Variance (MSW) Testing the Equality of Population Means: The F-Ratio How to Read the Tables and Use Stata Functions for the F-Distribution ANOVA Summary Table Post-Hoc Multiple Comparison Tests The Bonferroni Adjustment: Testing Planned Comparisons The Bonferroni Tests on Multiple Measures Conducting Power Analyses for One-Way ANOVA Summary of Stata Commands Exercises 14 TWO-WAY ANALYSIS OF VARIANCE
The Two-Factor Design
The Concept of Interaction The Hypotheses That are Tested by a Two-Way Analysis of Variance Assumptions of the Two-Way Analysis of Variance Balanced versus Unbalanced Factorial Designs Partitioning the Total Sum of Squares Using the F-Ratio to Test the Effects in Two-Way ANOVA Carrying Out the Two-Way ANOVA Computation by Hand Decomposing Score Deviations about the Grand Mean Modeling Each Score as a Sum of Component Parts Explaining the Interaction As a Joint (or Multiplicative) Effect Measuring Effect Size Fixed versus Random Factors Post-hoc Multiple Comparison Tests
Simple Effects and Pairwise Comparisons
Summary of Steps to Be Taken in a Two-Way ANOVA ProcedureConducting Power Analyses for Two-Way ANOVA Summary of Stata Commands Exercises 15 CORRELATION AND SIMPLE REGRESSION AS INFERENTIAL
TECHNIQUES
The Bivariate Normal Distribution
Testing whether the Population Pearson Product-Moment Correlation Equals Zero Using a Confidence Interval to Estimate the Size of the Population Correlation Coefficient, ρ Revisiting Simple Linear Regression for Prediction
Estimating the Population Standard Error of Prediction, σΥ|Χ
Exploring the Goodness of Fit of the Regression Equation: Using Regression
DiagnosticsTesting the b-Weight for Statistical Significance Explaining Simple Regression Using an Analysis of Variance Framework Measuring the Fit of the Overall Regression Equation: Using R and R2 Relating R2 to σ2Υ|Χ Testing R2 for Statistical Significance Estimating the True Population R2: The Adjusted R2
Residual Plots: Evaluating the Assumptions Underlying Regression
Using the Prediction Model to Predict Ice Cream SalesDetecting Influential Observations: Discrepancy and Leverage Using Stata to Obtain Leverage Using Stata to Obtain Discrepancy Using Stata to Obtain Influence Using Diagnostics to Evaluate the Ice Cream Sales Example Simple Regression When the Predictor is Dichotomous Conducting Power Analyses for Correlation and Simple Regression Summary of Stata Commands Exercises 16 AN INTRODUCTION TO MULTIPLE REGRESSION
The Basic Equation with Two Predictors
Equations for b, β, and RΥ.12 When the Predictors Are Not Correlated Equations for b, β, and RΥ.12 When the Predictors Are Correlated Summarizing and Expanding on Some Important Principles of Multiple Regression Testing the b-Weights for Statistical Significance Assessing the Relative Importance of the Independent Variables in the Equation Measuring the Drop in R2 Directly: An Alternative to the Squared Semipartial Correlation Evaluating the Statistical Significance of the Change in R2 The b-Weight As a Partial Slope in Multiple Regression Multiple Regression When One of the Two Independent Variables is Dichotomous Controlling Variables Statistically: A Closer Look
A Hypothetical Example
Conducting Power Analyses for Multiple RegressionSummary of Stata Commands Exercises 17 TWO-WAY INTERACTIONS IN MULTIPLE REGRESSION
Testing the Statistical Significance of an Interaction Using Stata
Comparing the Y-Hat Values from the Additive and Interaction Models Centering First-Order Effects if the Equation Has an Interaction Probing the Nature of a Two-Way Interaction Interaction When One of the Independent Variables Is Dichotomous and the Other Is Continuous Methods Useful for Model Selection Conducting a Power Analysis to Detect an Interaction Summary of Stata Commands Exercises 18 NONPARAMETRIC METHODS
Parametric versus Nonparametric Methods
Nonparametric Methods When the Dependent Variable Is at the Nominal Level The Chi-Square Distribution (Χ2)
The Chi-Square Goodness-of-Fit Test
Nonparametric Methods When the Dependent Variable Is Ordinal-LeveledThe Chi-Square Test of Independence
Assumptions of the Chi-Square Test of Independence
Fisher’s Exact Test
Calculating the Fisher is Exact Test by Hand Using the Hypergeometric
Distribution
Wilcoxon Sign Test
The Mann–Whitney U Test or Wilcoxon's Rank-Sum Test
The Kruskal–Wallis Analysis of Variance
Summary of Stata CommandsExercises 19 COMMUNICATING YOUR STATA RESULTS VIA EXCEL
Setting the Working Directory
Reproducing a Table of Univariate Summary Statistics in Excel
Using estpost and esttab
Reproducing a Correlation Matrix As a Table in ExcelUsing putexcel
Using estpost and esttab
Reproducing Regression Output As a Table in ExcelUsing putexcel
Using outreg2 to obtain a table of model statistics in Excel
Reproducing a Graph in Excel (Using putexcel)Using eststo and esttab to obtain a table of model statistics in Excel Using putexcel to reproduce a table of regression coefficients in Excel Conclusion Summary of Stata Commands Exercises Appendix A Data Set Descriptions
Appendix B Stata .Do-files and Data Sets in Stata Format
Appendix C Statistical Tables
Appendix D Solutions
References
Index
|
Learn
NetCourses
Classroom and web training
Organizational training
Webinars
Video tutorials
Third-party courses
Web resources
Teaching with Stata
© Copyright 1996–2023 StataCorp LLC. All rights reserved.
×
We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.
Cookie Settings
Last updated: 16 November 2022
StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.
These cookies are essential for our website to function and do not store any personally identifiable information. These cookies cannot be disabled.
This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.
Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.