Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: st: Binning question


From   "David Radwin" <dradwin@mprinc.com>
To   <statalist@hsphsun2.harvard.edu>
Subject   RE: st: Binning question
Date   Mon, 01 Aug 2011 08:59:35 -0700 (PDT)

Mike, please see this thread:

http://www.stata.com/statalist/archive/2011-05/msg00743.html

David

--
David Radwin
Research Associate
MPR Associates, Inc.
2150 Shattuck Ave., Suite 800
Berkeley, CA 94704
Phone: 510-849-4942
Fax: 510-849-0794

www.mprinc.com

> -----Original Message-----
> From: owner-statalist@hsphsun2.harvard.edu [mailto:owner-
> statalist@hsphsun2.harvard.edu] On Behalf Of Nick Cox
> Sent: Monday, August 01, 2011 8:15 AM
> To: statalist@hsphsun2.harvard.edu
> Subject: Re: st: Binning question
>
> That is true of any binning method. My prejudices aside, I don't think
> what you are seeking is clearly specified in statistical terms.
>
> Nick
>
> On 1 Aug 2011, at 12:35, mcross@exemail.com.au wrote:
>
> > Thanks Nick,
> >
> > I have some 300 size values that need binning for each of my 96 smpls.
> > With so many categories - the equal-width bin method will end up
> > splitting
> > certain observations into separate bins (so for example; 39.27 might
> > end
> > up in one bin, while 39.38 might end up in the next bin) depending on
> > where the bin range ends.
> >
> > I was hoping for some code that might recognize certain size values as
> > belonging to the same distribution and bin them as such.
> >
> > Hopefully someone might suggest some code to help me bin this data.
> >
> > Cheers,
> >
> > Mike.
> >
> >
> >
> >
> > Much easier and clearer just to use equal-width bins.
> >
> > Nick
> >
> > On Sat, Jul 30, 2011 at 10:14 PM,  <mcross@exemail.com.au> wrote:
> >
> > (Stata version 8)
> >
> > Statalisters,
> >
> > I’m collecting various size measurements from multiple samples…
> >
> > Smpl   Size
> > 01   39.38
> > 01   45.26
> > 01   48.81
> > 02   39.27
> > 02   45.28
> > 02   48.94
> >
> > Size is normally distributed and the variance is small. I want to
> > bin my
> > sizes into categories (eg “39”, “45” and “49”), for
> > further analysis.
> > I’d like to define the range for each category by using the SD of ea
> > ch set
> > of size observations - eg the SD of (39.38 and 39.27).

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index