Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, is already up and running.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Stata13 Wishlist- dealing with large number of fixed effects and dummy variables

From   William Buchanan <>
To   "" <>
Subject   Re: st: Stata13 Wishlist- dealing with large number of fixed effects and dummy variables
Date   Tue, 5 Feb 2013 05:40:47 -0800

How many observations are in your dataset and have you considered that the problem might be with the model you are fitting to the data?  It doesn't seem remotely parsimonious to use that many indicators (or it hardly seems that a model with that many variables is simplifying the true data generating process).  Maybe you could provide some example that others could replicate in order to get a better idea of the difficulty you're running into.


Sent from my iPhone

On Feb 5, 2013, at 5:08, Suryadipta Roy <> wrote:

> This is a general SOS call in dealing with large number of fixed
> effects (or dummy variables) with commonly used Stata commands, e.g.
> -reg- , -heckman- , -xtreg-, etc., viz. in dealing with large dyadic
> datasets (e.g. used in the gravity literature in international
> economics). For example, I was trying to run -heckman- with over
> 34,000 dummy variables in a panel data for over 150 importer-exporter
> countries over 25 years, where I need to control for various kinds of
> fixed effects, and Stata has not been able to complete a single
> regression after running for about 10 hours. I have had similar
> experiences with the above-mentioned commands as well for large
> datasets. -areg- , to a certain extent addresses the problem, but then
> runs into problems when one needs to control for a different kinds of
> fixed effects. For the record, I am using StataMP 12.1 in a dual core
> processor machine (8 GB memory), and the form of the -heckman-command
> that I was using is below:
> heckman dep_var ind_vars, select(selct_depvar = ind_vars excluded_var
> i.fixed_effects1 i.fixed_effects2) twostep, where fixed_effects1 and
> fixed_effects2 each comprises of a large number of dummy variables.
> I wish there was a general help for this kind of problem with Stata 13.
> Sincerely,
> Suryadipta.
> *
> *   For searches and help try:
> *
> *
> *

*   For searches and help try:

© Copyright 1996–2016 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index