# Re: st: cluster and F test

 From sara borelli To statalist@hsphsun2.harvard.edu Subject Re: st: cluster and F test Date Sun, 6 Jul 2008 09:14:54 +0000 (GMT)

```Hi Nicholas,
thank you very much for your help.

When I test (with the F-test) 18 restrictions with 14 clusters stata drops the 5 constraints because, as you said, it can test only 13 constraints.
There is something I do not understand, however. With the cluster option the number of observations useful to estimate the standard errors becomes the number of clusters, 14. Thus, if I have 37  standard errors to estimate and only 14 clusters, how is that possible that stata is able to estimate all the standar errors, but still test only 13 constraints?
Basically, when the number of clusters is smaller than the number of regressors, is only the F-test computed in a wrong way or also the standar errors?
Thank you
Sara

--- Sab 5/7/08, Austin Nichols <austinnichols@gmail.com> ha scritto:

> Da: Austin Nichols <austinnichols@gmail.com>
> Oggetto: Re: st: cluster and F test
> A: statalist@hsphsun2.harvard.edu
> Data: Sabato 5 luglio 2008, 19:01
> sara borelli <saraborelli77@yahoo.it>:
> The cluster-robust standard error (CRSE) estimator has at
> most M-1 df
> with M clusters, so with 14 clusters you can test the joint
> sig. of at
> most 13 coefs. But the performance of the estimator gets
> worse as you
> increase the the number of constraints.  The CRSE's
> performance
> improves as M-k increases toward infinity, where M is the
> number of
> clusters and k the number of constraints you are testing,
> and for M-k
> at least 20 and clusters balanced you should expect good
> performance.
> Since you have M-k equal to one (the minimum possible
> value), you
> should expect that the estimated variance is too low and
> the F stat is
> too high, on average.  Note that clusters are like
> super-observations,
> for the purposes of the SE of estimated coefs, so a
> regression on 37
> variables with 14 clusters is a bit like a regression on 37
> vars with
> 14 obs--you really don't want to test more than one
> coef there, and
> maybe not even that many.  How are your clusters defined?
> Is there
> any possibility of adding more clusters, or redefining them
> sensibly
> so you have more clusters?
>
> On Fri, Jul 4, 2008 at 5:16 AM, sara borelli
> <saraborelli77@yahoo.it> wrote:
> > Dear Stata List members,
> >
> > I have found some related questions on FAQs, but I
> cannot fins exactly what I need.
> > I am running a regression with the cluster option. I
> have 37 independent variables (including the constant),
> 1647 observations, and 14 clusters.
> > I want to test the joint significance of 18 variables.
> > If I do NOT use the cluster option the F is calculated
> correctly as F(18, 1637).
> > But once I introduce the cluster option I get the
> following result:
> >  (1)  x1= 0
> >  (2)  x2 = 0
> >  (3)  x3 = 0
> >  (4)  x3 = 0
> >  ...
> >  (18)  x18 = 0
> >       Constraint 1 dropped
> >       Constraint 2 dropped
> >       Constraint 3 dropped
> >       Constraint 4 dropped
> >       Constraint 14 dropped
> >
> >       F( 13,    13) =  109.42
> >            Prob > F =    0.0000
> >
> > I guess stata is doing something on the degree of
> freedoms, but I have not clear what is going on, why it is
> dropping the constraints. Is the final F test calculated
> correct?
> > Thank you in advance for any help
> >
> > Sara Borelli
> *
> *   For searches and help try:
> *   http://www.stata.com/support/faqs/res/findit.html
> *   http://www.stata.com/support/statalist/faq
> *   http://www.ats.ucla.edu/stat/stata/

Posta, news, sport, oroscopo: tutto in una sola pagina.