Logistic regression

Order

<- See Stata's other features

Stata supports all aspects of logistic regression. View the list of logistic regression features.

Stata’s logistic fits maximum-likelihood dichotomous logistic models:

. webuse lbw
(Hosmer & Lemeshow data)

. logistic low age lwt i.race smoke ptl ht ui

Logistic regression                                     Number of obs =    189
                                                        LR chi2(8)    =  33.22
                                                        Prob > chi2   = 0.0001
Log likelihood = -100.724                               Pseudo R2     = 0.1416




         low    Odds ratio   Std. err.      z    P>|z|     [95% conf. interval]

   

         age     .9732636   .0354759    -0.74   0.457     .9061578    1.045339

         lwt     .9849634   .0068217    -2.19   0.029     .9716834    .9984249

          

        race  

      Black      3.534767   1.860737     2.40   0.016     1.259736    9.918406

      Other      2.368079   1.039949     1.96   0.050     1.001356    5.600207

                

       smoke     2.517698    1.00916     2.30   0.021     1.147676    5.523162

         ptl     1.719161   .5952579     1.56   0.118     .8721455    3.388787

          ht     6.249602   4.322408     2.65   0.008     1.611152    24.24199

          ui       2.1351   .9808153     1.65   0.099     .8677528      5.2534

       _cons     1.586014   1.910496     0.38   0.702     .1496092     16.8134

Note: _cons estimates baseline odds.

The syntax of all estimation commands is the same: the name of the dependent variable is followed by the names of the independent variables. In this case, the dependent variable low (containing 1 if a newborn had a birthweight of less than 2500 grams and 0 otherwise) was modeled as a function of a number of explanatory variables. By default, logistic reports odds ratios; logit alternative will report coefficients if you prefer.

Once a model has been fitted, you can use Stata's predict to obtain the predicted probabilities of a positive outcome, the value of the logit index, or the standard error of the logit index. You can also obtain Pearson residuals, standardized Pearson residuals, leverage (the diagonal elements of the hat matrix), Delta chi-squared, Delta D, and Pregibon's Delta beta influence measures by typing a single command. All statistics are adjusted for the number of covariate patterns in the data—m-asymptotic rather than n-asymptotic in Hosmer and Lemeshow (2000) jargon. Every diagnostic graph suggested by Hosmer and Lemeshow can be drawn by Stata.

Also available are the goodness-of-fit test, using either cells defined by the covariate patterns or grouping, as suggested by Hosmer and Lemeshow; classification statistics and the classification table; and a graph and area under the ROC curve.

Stata’s mlogit performs maximum likelihood estimation of models with categorical dependent variables. It is intended for use when the dependent variable takes on more than two outcomes and the outcomes have no natural ordering. Uniquely, linear constraints on the coefficients can be specified both within and across equations using algebraic syntax. Much thought has gone into making mlogit truly usable. For instance, there are no artificial constraints placed on the nature of the dependent variable. The dependent variable is not required to take on integer, contiguous values such as 1, 2, and 3, although such a coding would be acceptable. Equally acceptable would be 1, 3, and 4, or even 1.2, 3.7, and 4.8.

Stata’s clogit performs maximum likelihood estimation with a dichotomous dependent variable; conditional logistic analysis differs from regular logistic regression in that the data are stratified and the likelihoods are computed relative to each stratum. The form of the likelihood function is similar but not identical to that of multinomial logistic regression. Conditional logistic analysis is known in epidemiology circles as the matched case–control model and in econometrics as McFadden's choice model. The form of the data, as well as the nature of the sampling, differs across the two settings, but clogit handles both. clogit allows both 1:1 and 1:k matching, and there may even be more than one positive outcome per strata (which is handled using the exact solution).

Stata’s ologit performs maximum likelihood estimation to fit models with an ordinal dependent variable, meaning a variable that is categorical and in which the categories can be ordered from low to high, such as “poor”, “good”, and “excellent”. Unlike mlogit, ologit can exploit the ordering in the estimation process. (Stata also provides oprobit for fitting ordered probit models.) As with mlogit the categorical dependent variable may take on any values whatsoever.

See Greene (2012) for a straightforward description of the models fitted by clogit, mlogit, ologit, and oprobit.

References

Breslow, N. E. 1974. Covariance analysis of censored survival data. Biometrics 30: 89–99.

Greene, W. H. 2012. Econometric Analysis. 8th ed. Upper Saddle River, NJ: Prentice Hall.

Hosmer, D. W. Jr., S. Lemeshow, and Sturdivant R. X. 2013. Applied Logistic Regression. 3rd ed. New York: Wiley.

McFadden, D. 1974. Conditional logit analysis of qualitative choice behavior. In Frontiers in Econometrics, ed. P. Zarembka, 105–142. New York: Academic Press.

Products

New in Stata 19

Why Stata

All features

Disciplines

Stata/MP

StataNow

Order Stata

Purchase

Order Stata

Bookstore

Stata Press

Stata Journal

Gift Shop

Learn

Free webinars

NetCourses

Classroom and web training

Organizational training

Video tutorials

Third-party courses

Web resources

Teaching with Stata

Support

Training

Video tutorials

FAQs

Statalist: The Stata Forum

Resources

Technical support

Customer service

Alerts

Company

News and events

Customer service

Careers

We use cookies

We use cookies to ensure that we give you the best experience on our website—to enhance site navigation, to analyze usage, and to assist in our marketing efforts. By continuing to use our site, you consent to the storing of cookies on your device and agree to delivery of content, including web fonts and JavaScript, from third party web services.

Cookie Settings

Privacy policy

Last updated: 16 November 2022

StataCorp LLC (StataCorp) strives to provide our users with exceptional products and services. To do so, we must collect personal information from you. This information is necessary to conduct business with our existing and potential customers. We collect and use this information only where we may legally do so. This policy explains what personal information we collect, how we use it, and what rights you have to that information.

Required cookies

Advertising cookies

Required cookies

These cookies are essential for our website to function and do not store any personally identifiable information. These cookies cannot be disabled.
Advertising and performance cookies

This website uses cookies to provide you with a better user experience. A cookie is a small piece of data our website stores on a site visitor's hard drive and accesses each time you visit so we can improve your access to our site, better understand how you use our site, and serve you content that may be of interest to you. For instance, we store a cookie when you log in to our shopping cart so that we can maintain your shopping cart should you not complete checkout. These cookies do not directly store your personal information, but they do support the ability to uniquely identify your internet browser and device.

Please note: Clearing your browser cookies at any time will undo preferences saved here. The option selected here will apply only to the device you are currently using.

Accept Cookies


low		Odds ratio Std. err. z P>\|z\| [95% conf. interval]

age		.9732636 .0354759 -0.74 0.457 .9061578 1.045339
lwt		.9849634 .0068217 -2.19 0.029 .9716834 .9984249

race
Black		3.534767 1.860737 2.40 0.016 1.259736 9.918406
Other		2.368079 1.039949 1.96 0.050 1.001356 5.600207

smoke		2.517698 1.00916 2.30 0.021 1.147676 5.523162
ptl		1.719161 .5952579 1.56 0.118 .8721455 3.388787
ht		6.249602 4.322408 2.65 0.008 1.611152 24.24199
ui		2.1351 .9808153 1.65 0.099 .8677528 5.2534
_cons		1.586014 1.910496 0.38 0.702 .1496092 16.8134

Note: _cons estimates baseline odds.