StataData Analysis and Statistical Software

Capabilities

Data management

data transformations, match-merge, import/export data, ODBC, SQL, XML, by-group processing, append files, sort, row–column transposition, labeling, saving results, more

Basic statistics

summaries, cross-tabulations, correlations, t tests, equality-of-variance tests, tests of proportions, confidence intervals, factor variables, more

Linear models

regression; bootstrap, jackknife, and robust Huber/White/sandwich variance estimates; instrumental variables; three-stage least squares; constraints; quantile regression; GLS; more

Longitudinal data/panel data

random and fixed effects with robust standard errors, linear mixed models, random-effects probit, GEE, random- and fixed-effects Poisson, dynamic panel-data models, and instrumental-variables regression; panel unit-root tests; AR(1) disturbances; more

Multilevel mixed-effects models

continuous, binary, and count outcomes; two-, three-, and multiway random-intercepts and random-coefficients models; crossed random effects; ML and REML estimation; BLUPs of effects and fitted values; hierarchical models; residual error structures; support for survey data in linear multilevel models; more

Binary, count, and limited dependent variables

logistic, probit, tobit; Poisson and negative binomial; conditional, multinomial, nested, ordered, rank-ordered, and stereotype logistic; multinomial probit; zero-inflated and left-truncated count models; selection models; marginal effects; more

ANOVA/MANOVA

balanced and unbalanced designs; factorial, nested, and mixed designs; repeated measures; marginal means; contrasts; more

SEM (Structural equation modeling)

graphical model builder, standardized and unstandardized estimates, modification indices, direct and indirect effects, path diagrams, factors scores and other predictions, estimations with groups and tests of invariance, goodness of fit, handling of MAR data by FIML, survey data, clustered data, more

Multivariate methods

factor analysis, principal components, discriminant analysis, rotation, multidimensional scaling, Procrustean analysis, correspondence analysis, biplots, dendrograms, user-extensible analyses, more

Cluster analysis

hierarchical clustering; kmeans and kmedian nonhierarchical clustering; dendrograms; stopping rules; user-extensible analyses; more

Generalized linear models (GLMs)

ten link functions, user-defined links, seven distributions, ML and IRLS estimation, nine variance estimators, seven residuals, more

Nonparametric methods

Wilcoxon–Mann–Whitney, Wilcoxon signed ranks, and Kruskal–Wallis tests; Spearman and Kendall correlations; Kolmogorov–Smirnov tests; exact binomial CIs; survival data; ROC analysis; smoothing; bootstrapping; more

Exact statistics

exact logistic and Poisson regression, exact case–control statistics, binomial tests, Fisher’s exact test for r × c tables, more

Resampling and simulation methods

bootstrap, jackknife, and Monte Carlo simulation, permutation tests, more

Internet capabilities

ability to install new commands, web updating, web file sharing, latest Stata news, more

Sample session

A sample session of Stata for Mac, Unix, or Windows.

User-written commands

user-written commands for meta-analysis, data management, survival, econometrics, more

Graphics

line charts, scatterplots, bar charts, pie charts, hi–lo charts, contour plots, Graph Editor, regression diagnostic graphs, survival plots, nonparametric smoothers, distribution Q–Q plots, more

Graphical User Interface

Results window, Command window, Review window, Data Editor, Variables Manager, Do-file Editor, variable properties, Viewer, Clipboard Preview Tool, menus/dialogs for all commands, multiple preference sets, more

Time series

ARIMA, ARFIMA, ARCH/GARCH, VAR, VECM, multivariate GARCH, unobserved components model, dynamic factors, state-space models, business calendars, correlograms, periodograms, forecasts, impulse-response functions, unit-root tests, filters and smoothers, rolling and recursive estimation, more

Survey methods

multistage designs; bootstrap, BRR, jackknife, linearized, and SDR variance estimation; poststratification; DEFF; predictive margins; means, proportions, ratios, totals; summary tables; regression, instrumental variables, probit, Cox regression; more

Survival analysis

Kaplan–Meier and Nelson–Aalen estimators, Cox regression (frailty); parametric models (frailty); competing risks; hazards; time-varying covariates; left and right censoring, Weibull, exponential, and Gompertz analysis; sample size and power analysis; more

Tests, predictions, and effects

Wald tests; LR tests; linear and nonlinear combinations, predictions and generalized predictions, marginal means, least-squares means, adjusted means; marginal and partial effects; Hausman tests; more

Contrasts and pairwise comparisons

compare means, intercepts, or slopes; compare adjacent categories; compare with reference category or grand mean; orthogonal polynomials; adjust for multiple comparisons; treatment effects; graph effects and potential outcomes; more

Epidemiology

standardization of rates, case–control, cohort, matched case–control, Mantel–Haenszel, pharmacokinetics, ROC analysis, ICD-9-CM, more

Multiple imputation

nine univariate imputation methods, multivariate normal imputation, chained equations, explore pattern of missingness, manage imputed datasets, fit model and pool results, transform parameters, joint tests of parameter estimates, predictions, more

Other statistical methods

sample size and power, kappa mesaure of interrater agreement, Cronbach's alpha, stepwise regression, statistical and mathematical functions, more

GMM and nonlinear regression

generalized method of moments (GMM), nonlinear regression, more

Maximum likelihood

user-specified functions; NR, DFP, BFGS, BHHH; OIM, OPG, robust, bootstrap, and jackknife matrices; Wald tests; survey data; numeric or analytic derivatives; more

Programming language

adding new commands, command scripting, if, while, command parsing, debugging, menu and dialog-box programming, markup and control language, more

Matrix programming—Mata

interactive sessions, large-scale development projects, optimization, matrix inversions, decompositions, eigenvalues and eigenvectors, LAPACK engine, real and complex numbers, string matrices, interface to Stata datasets and matrices, numerical derivatives, object-oriented programming, more

Embedded statistical computations

Numerics by Stata

Installation Qualification

IQ report for regulatory agencies such as the FDA, installation verification accessibility for persons with disabilities

Accessibility

Section 508 compliance, accessibility for persons with disabilities


Search Stata’s help files