Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: retrieve a continuous distribution from a categorical variable


From   "JVerkuilen (Gmail)" <jvverkuilen@gmail.com>
To   statalist@hsphsun2.harvard.edu
Subject   Re: st: retrieve a continuous distribution from a categorical variable
Date   Fri, 7 Dec 2012 10:14:40 -0500

On Fri, Dec 7, 2012 at 7:02 AM, David Hoaglin <dchoaglin@gmail.com> wrote:

> In the absence of an established family of distributions, you could
> consider fitting a curve to the known points on the cumulative
> distribution function --- the cumulative fractions at the bin
> boundaries: (45, 400/7000), (50, 850/7000), ..., (85, 6200/7000) ---
> for example, by using splines.  From the fitted curve you can then
> retrieve estimated counts for single years of age.

Good idea, with one slight addition. The original poster might want to
work on the log- or logit-transformed CDF instead of the CDF itself,
as this will help ensure that the resulting fitted CDF is proper.
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index