Structural equation modeling (SEM)

Structural equation modeling (SEM) was introduced in Stata 12.

See the latest version of SEM. See all of Stata's SEM features.

See the new features in Stata 19.

Structural equation modeling (SEM)

If you don’t know what SEM is, go here.

View the complete list of SEM capabilities

SEM stands for structural equation modeling. SEM is a notation for specifying structural equations, a way of thinking about them, and methods for estimating their parameters.

SEM encompasses a broad array of models from linear regression to measurement models to simultaneous equations, including along the way confirmatory factor analysis (CFA), correlated uniqueness models, latent growth models, and multiple indicators and multiple causes (MIMIC).

Stata’s new sem command fits SEMs.

Features

Use GUI or command language to specify model.
Standardized and unstandardized results.
Direct and indirect effects.
Goodness-of-fit statistics.
Tests for omitted paths and tests of model simplification including modification indices, score tests, and Wald tests.
Predicted values and factor scores.
Linear and nonlinear (1) tests of estimated parameters and (2) combinations of estimated parameters with CIs.
Estimation across groups is as easy as adding group(sex) to the command. Test for group invariance. Easily add or relax constraints across groups.
SEMs may be fitted using raw or summary statistics data.
Maximum likelihood (ML) and asymptotic distribution free (ADF) estimation. ADF is also known as generalized method of moments (GMM). Missing at random (MAR) data supported via FIML.
Robust estimate of standard errors and standard errors for clustered samples available.
Support for survey data including sampling weights, stratification and poststratification, and clustered sampling at one or more levels.

GUI or commands, it’s your choice

Enter your model graphically,

or use the command syntax

. sem (L1 -> m1 m2)
      (L2 -> m3 m4)
      (L3 <- L1 L2)
      (L3 -> m5 m6 m7)

It’s the same model either way.

Stata’s GUI uses standard path notation.

In command syntax, you type the path diagram. Capitalized names are latent variables. Lowercased names are observed variables. You can type arrows in either direction. The above model could be equally well typed as

. sem (m1 m2 <- L1)
      (L2 -> m3 m4)
      (L3 <- L1 L2)
      (L3 -> m5 m6 m7)

and order does not matter, and neither does spacing:

. sem (m1 m2 <- L1) (L2 -> m3 m4) (L3 -> m5 m6 m7) (L3 <- L1 L2)

You can specify paths individually,

. sem (m1 <- L1) (m2 <- L1) (L2 -> m3) (L2 -> m4)  (L3 -> m5) (L3 -> m6) (L3 -> m7) (L3 <- L1) (L3 <- L2)

or combined,

. sem (m1 m2 <- L1) (L2 -> m3 m4) (L3 -> m5 m6 m7) (L3 <- L1 L2)

Show me

Let’s fit a structural model with a measurement component using data from Wheaton, Muthén, Alwin, and Summers (1977):

Below we will demonstrate

Fitting the model
Modification indices
Refitting the model

Show me, fitting the model

Simplified versions of the model fit by the authors of the referenced paper appear in many SEM software manuals. One simplified model is

You can also readily fit this model using the following command:

And the results are

Notes:

Measurement component: In both 1967 and 1971 anomia and powerlessness are used to measure endogenous latent variables representing Alienation for the same two years. Education and occupational status are used to measure the exogenous latent variable SES.
Structural component: SES->Alien67 and SES->Alien71, and Alien67->Alien71.
The model vs. saturated chi-squared test indicates the model is a poor fit.

Show me, modification indices

That the model is a poor fit leads us to looking at the modification indices:

Notes:

There are lots of statistically significant paths we could add to the model.
Some of those statistically significant paths also make theoretical sense.
Two in particular that make sense are the covariances between e.anomia67 and e.anomia71, and between e.pwless67 and e.pwless71.

Show me, refitting the model

Let’s refit the model and include those two previously excluded covariances:

And the results are

Notes:

We find the covariance between e.anomia67 and e.anomia71 to be significant(Z=5.14).
We find the covariance between e.pwless67 and e.pwless71 to be insignificant at the 5% level (Z=1.29).

Back to highlights

See New in Stata 19 to learn about what was added in Stata 19.

We use cookies

We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.

Cookie Settings

Last updated: 16 November 2022

StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.

Advertising and performance cookies

This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.

Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.

Structural equation modeling (SEM) was introduced in Stata 12.

See the latest version of SEM. See all of Stata's SEM features.

See the new features in Stata 19.