___ ____ ____ ____ ____ ®
/__ / ____/ / ____/ 18.0
___/ / /___/ / /___/ SE—Standard Edition
Statistics and Data Science Copyright 19852023 StataCorp LLC
StataCorp
4905 Lakeway Drive
College Station, Texas 77845 USA
800STATAPC https://www.stata.com
9796964600 stata@stata.com
Stata license: 10user network perpetual
Serial number: 1
Licensed to: Stata Developer
StataCorp LLC
Notes:
1. Stata is running in batch mode.
2. Unicode is supported; see help unicode_advice.
3. Maximum number of variables is set to 5,000 but can be increased;
see help set_maxvar.
Running /home/krg/bin/profile.do ...
Compile number 180110
. do sl7b.do
.
. /* NIST/ITL StRD benchmark
> *** MODIFIED by William Gould, StataCorp.
> ***
> *** The dependent variable in the original test had values such as
> *** as 1000000000000.4, 1000000000000.3, etc. These numbers cannot be
> *** stored on a binary computer with more than 4 digits of accuracy.
> *** E.g., in double precision,
> ***
> *** 1000000000000.4  1000000000000 = .40002441...
> ***
> *** That is, you might enter 1000000000000.4, but a doubleprecision binar
> y
> *** computer actually stores the number 1000000000000.40002441...
> ***
> *** Thus, even a perfectly accurate ANOVA routine could not obtain the
> *** the results the authors intended because, at the instant the data was
> *** stored, numbers were rounded. The test, as given, amounts to a test
> *** of whether data is being stored in better than binary double precision
> .
> ***
> *** The intention of the test, I believe, was to determine if the ANOVA
> *** routine could deal with numbers that varied only in their trailing
> *** digits.
> ***
> *** Thus, the test is modified as follows:
> ***
> *** 1) y is multiplied by 10. What was previously 1000000000000.4
> *** now becomes 10000000000004, a number that a digital computer
> *** can store accurately in double precision.
> ***
> *** 2) All results should now be as the authors originally expect
> *** except sums of squares of y will be multiplied by 100 and
> *** the root mean square error will be mutliplied by 10.
> ***
> *** The remaining comments below, from the NIST data, are not modified.
>
> ANOVA
>
> Difficulty=Higher n_i=21 k=9 Generated
>
> Dataset Name: SimonLesage7 (SimonLesage7.dat)
>
>
> Procedure: Analysis of Variance
>
>
> Reference: Simon, Stephen D. and Lesage, James P. (1989).
> "Assessing the Accuracy of ANOVA Calculations in
> Statistical Software".
> Computational Statistics & Data Analysis, 8, pp. 325332.
>
>
> Data: 1 Factor
> 9 Treatments
> 21 Replicates/Cell
> 189 Observations
> 13 Constant Leading Digits
> Higher Level of Difficulty
> Generated Data
>
>
> Model: 10 Parameters (mu,tau_1, ... , tau_9)
> y_{ij} = mu + tau_i + epsilon_{ij}
>
>
> Certified Values:
>
> Source of Sums of Mean
> Variation df Squares Squares F Statisti
> c
>
> Between Treatment 8 1.68000000000000E+00 2.10000000000000E01
> 2.10000000000000E+01
> Within Treatment 180 1.80000000000000E+00 1.00000000000000E02
>
> Certified RSquared 4.82758620689655E01
>
> Certified Residual
> Standard Deviation 1.00000000000000E01
> */
.
. clear
.
. scalar N = 189
. scalar df_r = 180
. scalar df_m = 8
.
. scalar mss = 168
. scalar F = 21
. scalar rss = 180
. scalar r2 = 4.82758620689655E01
. scalar rmse = 1
.
. qui input byte treat double resp
.
. anova resp treat
Number of obs = 189 Rsquared = 0.4828
Root MSE = 1 Adj Rsquared = 0.4598
Source  Partial SS df MS F Prob>F
+
Model  168 8 21 21.00 0.0000

treat  168 8 21 21.00 0.0000

Residual  180 180 1
+
Total  348 188 1.8510638
.
. assert N == e(N)
. assert df_r == e(df_r)
. assert df_m == e(df_m)
.
. lrecomp e(F) F e(rmse) rmse e(r2) r2 e(mss) mss e(rss) rss
e(F)
e(rmse)
e(r2) 15.6
e(mss)
e(rss)
.
end of dofile