Stata 11 help for summarize

help summarize dialog: summarize -------------------------------------------------------------------------------

Title

[R] summarize -- Summary statistics

Syntax

summarize [varlist] [if] [in] [weight] [, options]

options description ------------------------------------------------------------------------- Main detail display additional statistics meanonly suppress the display; calculate only the mean; programmer's option format use variable's display format separator(#) draw separator line after every # variables; default is separator(5) display_options control spacing and base and empty cells

------------------------------------------------------------------------- varlist may contain factor variables; see fvvarlist. varlist may contain time-series operators; see tsvarlist. by is allowed; see [D] by. aweights, fweights, and iweights are allowed. However, iweights may not be used with the detail option; see weight.

Menu

Statistics > Summaries, tables, and tests > Summary and descriptive statistics > Summary statistics

Description

summarize calculates and displays a variety of univariate summary statistics. If no varlist is specified, summary statistics are calculated for all the variables in the dataset.

Also see [R] ci for calculating the standard error and confidence intervals of the mean.

Options

+------+ ----+ Main +-------------------------------------------------------------

detail produces additional statistics including skewness, kurtosis, the four smallest and four largest values, and various percentiles.

meanonly, which is allowed only when detail is not specified, suppresses the display of results and calculation of the variance. Ado-file writers will find this useful for fast calls.

format requests that the summary statistics be displayed using the display formats associated with the variables rather than the default g display format; see [D] format.

separator(#) specifies how often to insert separation lines into the output. The default is separator(5), meaning that a line is drawn after every five variables. separator(10) would draw a line after every 10 variables. separator(0) suppresses the separation line.

display_options: vsquish, noemptycells, baselevels, allbaselevels; see [R] estimation options.

Examples

. sysuse auto . summarize . summarize mpg weight . summarize mpg weight if foreign . summarize mpg weight if foreign, detail . summarize i.rep78

Saved results

summarize saves the following in r():

Scalars r(N) number of observations r(mean) mean r(skewness) skewness (detail only) r(min) minimum r(max) maximum r(sum_w) sum of the weights r(p1) 1st percentile (detail only) r(p5) 5th percentile (detail only) r(p10) 10th percentile (detail only) r(p25) 25th percentile (detail only) r(p50) 50th percentile (detail only) r(p75) 75th percentile (detail only) r(p90) 90th percentile (detail only) r(p95) 95th percentile (detail only) r(p99) 99th percentile (detail only) r(Var) variance r(kurtosis) kurtosis (detail only) r(sum) sum of variable r(sd) standard deviation

Also see

Manual: [R] summarize

Help: [R] ameans, [R] centile, [D] codebook, [D] describe, [D] inspect, [R] mean, [ST] stsum, [SVY] svy estimation, [R] table, [R] tabstat, [R] tabulate, summarize(), [XT] xtsum


© Copyright 1996–2009 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index