Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

# st: RE: Fractional logit, selection and panel data

 From "Seliger Florian" To "statalist@hsphsun2.harvard.edu" Subject st: RE: Fractional logit, selection and panel data Date Tue, 11 Feb 2014 08:14:32 +0000

```Dear all,
I apologize for posting again, but obviously there is a crude mistake in my command below: In order to calculate the inverse mills ratio, you have to estimate a probit model.
Still, I do  not know how to account for selection using "glm".
Do you know whether "frm" is the command I should use?

Best,
Florian

-----Original Message-----
From: owner-statalist@hsphsun2.harvard.edu [mailto:owner-statalist@hsphsun2.harvard.edu] On Behalf Of Seliger Florian
Sent: Montag, 10. Februar 2014 21:38
To: 'statalist@hsphsun2.harvard.edu'
Subject: st: Fractional logit, selection and panel data

Dear Statalist,

There were several discussion on modeling proportions , e.g. http://www.stata.com/statalist/archive/2012-10/msg00241.html.

I've learnt that
1. Tobit is not appropriate for proportions although it is widely used in practice 2. Fractional logit is the right method.
3. It can be implemented by glm

Just as in the old thread, my dependent variable is a sales share with certain products (0 - 100%).
I also have a selection equation where the dependent tells me whether firms introduced a product at all (binary coded).
On the right-hand side, I have more or less the same variables in both equations (except for an exclusion restriction that might be added in the selection equation).

I'm wondering if I can proceed as follows:

*Estimate selection equation with logit
logit newproduct x1 x2 x3
*Calculate inverse-mills ratio
predict inno_hat if e(sample), xb
gen pdflogit_inno=normalden(inno_hat)
gen cdflogit_inno=normal(inno_hat)
gen imr=pdflogit_inno/cdflogit_inno
*Estimate outcome equation with glm and include inverse mills ratio glm salesshare_with_new_product/100 x1 x2 x3 imr, family(binomial) link(logit)

I have also noticed a command for estimation of one- and two-part fractional regression models called "frm" by Ramalho http://evunix.uevora.pt/~jsr/FRM.htm

I don't have a clue what's appropriate in my case. Do any of you have experience with "frm" or with selection models  in this context?
Any help is appreciated!

For panel fractional data, there is a presentation of J. Wooldridge http://www.stata.com/meeting/chicago11/abstracts/#wooldridge
Although there is a command "xtgee" for panel data, Wooldridge seems to suggest (if I get him right) to use xtreg for unbalanced panel data.
Does anyone know why not to use xtgee and why xtreg appears to be appropriate for fractions?

Best regards,
Florian

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/
```