[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

From |
"Nick Cox" <n.j.cox@durham.ac.uk> |

To |
<statalist@hsphsun2.harvard.edu> |

Subject |
RE: st: overlaying two histograms (or distribution curves) |

Date |
Sun, 22 Nov 2009 16:34:10 -0000 |

I'm referring to statistical science generally (including corners of it such as econometrics) -- and naturally as I know it. The absence of a trumpet-blast entry in the Stata manual will have had some consequences, but only a small effect in that context. Nick n.j.cox@durham.ac.uk Martin Weiss "Why is this graph form ... not enormously better known? Probable answer: Manual entry too short (half a page, [R], p. 348), and little publicity elsewhere: http://www.stata-journal.com/sjpdf.html?articlenum=gr0003 Nick Cox For comparison of precisely two distributions -- especially when there is not a prior prejudice or hypothesis that we are expecting approximations to some named, equation-specified distribution -- I regard quantile-quantile plots as near optimal. They allow you to focus on both similarity and differences and to think directly in terms of what is being measured. Why is this graph form which is (a) information-rich (b) free of arbitrary assumptions (bin or kernel width, etc.) (c) easy to explain (d) easy to compute (e) well documented not enormously better known? See -qqplot-. Nick n.j.cox@durham.ac.uk Ariel Linden, DrPH Thank you both (Maarten and Austin) for all these choices I had not known about (violin, byhist, kdens). Austin, I don't have a known distribution per-se. I have two groups (treated and controls), and the outcome variables could follow any distribution. The motivation for this is to visually describe how the distribution of an outcome variable overlaps (or doesn't) between two groups. Date: Fri, 20 Nov 2009 10:47:06 -0500 From: Austin Nichols <austinnichols@gmail.com> Maarten-- I think that is the same graph I gave for comparison purposes, but I don't think it compares well with -byhist- unless one takes a bit more care on the -kdensity- side--the kernel density estimates should at least use the same bandwidths, and perhaps the same estimation points if we really wish to compare them. Other considerations might apply if Ariel told us something about the theoretical distribution of the variable filling the role of "price" (is it discrete? does it have a finite range?). * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/

**References**:**Re: st: overlaying two histograms (or distribution curves)***From:*"Ariel Linden, DrPH" <ariel.linden@gmail.com>

**RE: st: overlaying two histograms (or distribution curves)***From:*"Nick Cox" <n.j.cox@durham.ac.uk>

**AW: st: overlaying two histograms (or distribution curves)***From:*"Martin Weiss" <martin.weiss1@gmx.de>

- Prev by Date:
**Re: st: re: TeX template for PDF help files** - Next by Date:
**Re: st: RE: Re: TeX template for PDF help files** - Previous by thread:
**AW: st: overlaying two histograms (or distribution curves)** - Next by thread:
**st: overlaying two histograms (or distribution curves)** - Index(es):

© Copyright 1996–2015 StataCorp LP | Terms of use | Privacy | Contact us | What's new | Site index |