|
Data management
|
data transformations,
match-merge,
import/export data,
ODBC,
SQL,
XML,
by-group processing,
append files,
sort,
row–column transposition,
labeling,
saving results,
more |
Basic statistics
|
summaries,
cross-tabulations,
correlations,
t tests,
equality-of-variance tests,
tests of proportions,
confidence intervals,
factor variables,
more |
Linear models
|
regression;
bootstrap,
jackknife,
and robust Huber/White/sandwich variance estimates;
instrumental variables;
three-stage least squares;
constraints;
quantile regression;
GLS;
more |
Longitudinal data/panel data
|
random and fixed effects with robust standard errors,
linear mixed models,
random-effects probit,
GEE,
random- and fixed-effects Poisson,
dynamic panel-data models,
and instrumental-variables regression;
panel unit-root tests;
AR(1) disturbances;
more |
Multilevel mixed-effects models
|
continuous, binary, and count outcomes; two-, three-, and
multiway random-intercepts and random-coefficients
models; crossed random effects; ML and REML estimation; BLUPs of
effects and fitted values; hierarchical models; residual error structures;
support for survey data in linear multilevel models;
more |
Binary, count, and limited dependent variables
|
logistic,
probit,
tobit;
Poisson and negative binomial;
conditional,
multinomial,
nested,
ordered,
rank-ordered,
and stereotype logistic;
multinomial probit;
zero-inflated and left-truncated count models;
selection models;
marginal effects;
more |
ANOVA/MANOVA
|
balanced and unbalanced designs;
factorial, nested, and mixed designs;
repeated measures;
marginal means;
contrasts;
more |
SEM (Structural equation modeling)
|
graphical model builder,
standardized and unstandardized estimates,
modification indices,
direct and indirect effects,
path diagrams,
factors scores and other predictions,
estimations with groups and tests of invariance,
goodness of fit,
handling of MAR data by FIML,
survey data, clustered data,
more |
Multivariate methods
|
factor analysis,
principal components,
discriminant analysis,
rotation,
multidimensional scaling,
Procrustean analysis,
correspondence analysis,
biplots,
dendrograms,
user-extensible analyses,
more |
Cluster analysis
|
hierarchical clustering;
kmeans and kmedian nonhierarchical clustering;
dendrograms;
stopping rules;
user-extensible analyses;
more |
Generalized linear models (GLMs)
|
ten link functions,
user-defined links,
seven distributions,
ML and IRLS estimation,
nine variance estimators,
seven residuals,
more |
Nonparametric methods
|
Wilcoxon–Mann–Whitney,
Wilcoxon signed ranks, and Kruskal–Wallis tests;
Spearman and Kendall correlations;
Kolmogorov–Smirnov tests;
exact binomial CIs;
survival data;
ROC analysis;
smoothing;
bootstrapping;
more |
Exact statistics
|
exact logistic and Poisson regression,
exact case–control statistics,
binomial tests,
Fisher’s exact test for r × c tables,
more |
Resampling and simulation methods
|
bootstrap,
jackknife, and Monte Carlo simulation,
permutation tests,
more |
Internet capabilities
|
ability to install new commands,
web updating,
web file sharing,
latest Stata news,
more |
Sample session
User-written commands
|
user-written commands for meta-analysis, data management, survival,
econometrics, more |
|
Graphics
|
line charts, scatterplots,
bar charts,
pie charts,
hi–lo charts,
contour plots,
Graph Editor,
regression diagnostic graphs,
survival plots,
nonparametric smoothers,
distribution Q–Q plots,
more |
Graphical User Interface
|
Results window,
Command window,
Review window,
Data Editor,
Variables Manager,
Do-file Editor,
variable properties,
Viewer,
Clipboard Preview Tool,
menus/dialogs for all commands,
multiple preference sets,
more |
Time series
|
ARIMA,
ARFIMA,
ARCH/GARCH,
VAR,
VECM,
multivariate GARCH,
unobserved components model,
dynamic factors,
state-space models,
business calendars,
correlograms,
periodograms,
forecasts,
impulse-response functions,
unit-root tests,
filters and smoothers,
rolling and recursive estimation,
more |
Survey methods
|
multistage designs;
bootstrap,
BRR,
jackknife,
linearized, and
SDR variance estimation;
poststratification;
DEFF;
predictive margins;
means,
proportions,
ratios,
totals;
summary tables;
regression,
instrumental
variables,
probit,
Cox regression;
more |
Survival analysis
|
Kaplan–Meier and
Nelson–Aalen estimators,
Cox regression (frailty);
parametric models (frailty);
competing risks;
hazards;
time-varying covariates;
left and right censoring,
Weibull,
exponential,
and Gompertz analysis;
sample size and power analysis;
more |
Tests, predictions, and effects
|
Wald tests;
LR tests;
linear and nonlinear combinations,
predictions and generalized predictions,
marginal means,
least-squares means,
adjusted means;
marginal and partial effects;
Hausman tests;
more |
Contrasts and pairwise comparisons
|
compare means,
intercepts,
or slopes;
compare adjacent categories;
compare with reference category or grand mean;
orthogonal polynomials;
adjust for multiple comparisons;
treatment effects;
graph effects and
potential outcomes;
more |
Epidemiology
|
standardization of rates,
case–control,
cohort,
matched case–control,
Mantel–Haenszel,
pharmacokinetics,
ROC analysis,
ICD-9-CM,
more |
Multiple imputation
|
nine univariate imputation methods, multivariate normal imputation,
chained equations, explore pattern of missingness, manage imputed datasets,
fit model and pool results, transform parameters, joint tests of
parameter estimates, predictions,
more |
Other statistical methods
|
sample size and power,
kappa mesaure of interrater agreement,
Cronbach's alpha,
stepwise regression,
statistical and
mathematical functions,
more
|
GMM and nonlinear regression
|
generalized method of moments (GMM),
nonlinear regression,
more
|
Maximum likelihood
|
user-specified functions;
NR, DFP, BFGS, BHHH;
OIM, OPG, robust, bootstrap, and jackknife matrices;
Wald tests;
survey data;
numeric or analytic derivatives;
more |
Programming language
|
adding new commands,
command scripting,
if,
while,
command parsing,
debugging,
menu and dialog-box programming,
markup and control language,
more |
Matrix programming—Mata
|
interactive sessions,
large-scale development projects,
optimization,
matrix inversions,
decompositions,
eigenvalues and eigenvectors,
LAPACK engine,
real and complex numbers,
string matrices,
interface to Stata datasets and matrices,
numerical derivatives,
object-oriented programming,
more |
Embedded statistical computations
Installation Qualification
|
IQ report for regulatory agencies such as the FDA, installation verification
accessibility for persons with disabilities |
Accessibility
|
Section 508 compliance,
accessibility for persons with disabilities |
|