Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: xtivreg2, clustered errors and F statistic

From	"Anna Rosso" <[email protected]>
To	<[email protected]>
Subject	st: xtivreg2, clustered errors and F statistic
Date	Tue, 11 Oct 2011 23:08:49 +0100

Dear list,

I am using Stata/MP 11.2 for Unix (Linux 64-bit x86-64)
Born 30 Mar 2011

I am running IV regressions using xtivreg2 (latest updated version) on a
panel of 16 regions and 11 years.

I estimate using regional fixed effects. My specification includes among the
RHS variables year dummies, 6 continuous control variables, and the
endogenous regressor of interest.
I use 90  IVs to instrument my endogenous regressor, and I cluster standard
errors at the regional level.

The problem is the following:
When I only include as RHS variables region and year dummies and the
endogenous regressor, my first stage F-stat for the significance of excluded
instruments goes to infinite. This is what I would expect given the degrees
of freedom are "negative": F stat is distributed as F(k,d-k)
Where k is the number of constraints (90 in my case, as I have 90
instruments to test), d is the number of clusters(16)  

When I add the additional six control variables: my first stage F-statistic
is more than normal: 14.76. 
Do you think it is possible? I don't understand why this is happening only
when I put controls in the regression.

I have also tried to "partial out" some variables, as suggested in Baum,
Schaffer and Stillman's paper("Enhanced routines for instrumental
variables/generalized method of moments" The Stata Journal (2007), 7, Number
4, pp. 465-506) when the number of clusters is less than the number of
exogenous regressors + excluded instruments. Partialling out some exogenous
regressors helps the covariance matrix of orthogonality
conditions to have full rank. Unfortunately, this still has not solved the
problem as I have many instruments.
Also, the Kleibergen-Paap Wald rk F statistic  (which is the one suggested
by the authors of the above paper in case of clustered errors) is reported
as missing.

I report the command used and the output of first stage statistics when I
only control for year dummies using fixed effect estimator (xtivreg2 with fe
and cluster() options)
----------------------------------------------------------------------------
---------------------------------------------------------------------------
xi: xtivreg2 netpay  (share_reg = GWmean40_UK_* GWmean40_USA_* GWmean40_DE_*
mean2004_40_UK_* mean2004_40_USA_* mean2004_40_DE_*) i.year if
year>=1997&year<=2007 ,fe cluster(won) first

.....

F test of excluded instruments:
  F( 90,    15) =  1.3e+13
  Prob > F      =   0.0000
Angrist-Pischke multivariate F test of excluded instruments:
  F( 90,    15) =  6.7e+12
  Prob > F      =   0.0000

Summary results for first-stage regressions
-------------------------------------------

                                           (Underid)            (Weak id)
Variable     | F( 90,    15)  P-val | AP Chi-sq( 90) P-val | AP F( 90,
15)
share_reg    |    1.3e+13    0.0000 |     1.5e+15   0.0000 |     6.7e+12

NB: first-stage test statistics cluster-robust

.....

Underidentification test
Ho: matrix of reduced form coefficients has rank=K1-1 (underidentified)
Ha: matrix has rank=K1 (identified)
Kleibergen-Paap rk LM statistic          Chi-sq(90)=.       P-val=     .

Weak identification test
Ho: equation is weakly identified
Cragg-Donald Wald F statistic                                       2.99
Kleibergen-Paap Wald rk F statistic                                    .
.....

Weak-instrument-robust inference
Tests of joint significance of endogenous regressors B1 in main equation
Ho: B1=0 and orthogonality conditions are valid
Anderson-Rubin Wald test           F(0,15)=           .     P-val=     .
Anderson-Rubin Wald test           Chi-sq(0)=         .     P-val=     .
Stock-Wright LM S statistic        Chi-sq(0)=         .     P-val=     .

....

Number of clusters             N_clust  =         16
Number of observations               N  =        176
Number of regressors                 K  =         11
Number of endogenous regressors      K1 =          1
Number of instruments                L  =        100
Number of excluded instruments       L1 =          0
----------------------------------------------------------------------------
----------------------------------------------------------------------------
And this is the command and output with extra controls:

----------------------------------------------------------------------------
----------------------------------------------------------------------------
xi: xtivreg2 netpay  public pop age sex shar_ed2 shar_ed3 (share_reg =
GWmean40_UK_* GWmean40_USA_* GWmean40_DE_* mean2004_40_UK_*
mean2004_40_USA_* mean2004_40_DE_*) i.year if year>=1997&year<=2007,fe
cluster(won) first

.......

F test of excluded instruments:
  F( 90,    15) =    14.73
  Prob > F      =   0.0000
Angrist-Pischke multivariate F test of excluded instruments:
  F( 90,    15) =    14.73
  Prob > F      =   0.0000

Summary results for first-stage regressions
-------------------------------------------

                                           (Underid)            (Weak id)
Variable     | F( 90,    15)  P-val | AP Chi-sq( 90) P-val | AP F( 90,
15)
share_reg    |      14.73    0.0000 |     3535.83   0.0000 |       14.73

NB: first-stage test statistics cluster-robust

.......

Underidentification test
Ho: matrix of reduced form coefficients has rank=K1-1 (underidentified)
Ha: matrix has rank=K1 (identified)
Kleibergen-Paap rk LM statistic          Chi-sq(90)=.       P-val=     .

Weak identification test
Ho: equation is weakly identified
Cragg-Donald Wald F statistic                                       2.53
Kleibergen-Paap Wald rk F statistic                                    .

......

Weak-instrument-robust inference
Tests of joint significance of endogenous regressors B1 in main equation
Ho: B1=0 and orthogonality conditions are valid
Anderson-Rubin Wald test           F(6,15)=       16.04     P-val=0.0000
Anderson-Rubin Wald test           Chi-sq(6)=    256.57     P-val=0.0000
Stock-Wright LM S statistic        Chi-sq(6)=         .     P-val=     .

NB: Underidentification, weak identification and weak-identification-robust
    test statistics cluster-robust

Number of clusters             N_clust  =         16
Number of observations               N  =        176
Number of regressors                 K  =         17
Number of endogenous regressors      K1 =          1
Number of instruments                L  =        106
Number of excluded instruments       L1 =          6
----------------------------------------------------------------------------
----------------------------------------------------------------------------


Thanks for your consideration.

Best regards,

Anna Rosso


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- st: RE: xtivreg2, clustered errors and F statistic
  - From: "Schaffer, Mark E" <[email protected]>

Prev by Date: st: Assistance with manipulating a social network dataset?
Next by Date: Re: st: Assistance with manipulating a social network dataset?
Previous by thread: st: Assistance with manipulating a social network dataset?
Next by thread: st: RE: xtivreg2, clustered errors and F statistic
Index(es):
- Date
- Thread