Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: st: tab most frequently occurring


From   "Nick Cox" <[email protected]>
To   <[email protected]>
Subject   RE: st: tab most frequently occurring
Date   Wed, 17 Mar 2010 15:03:52 -0000

Thanks for the mention. -groups- is on SSC and was discussed in SJ 3-4. 

SJ-3-4  pr0011  . . . . . . . .  Speaking Stata: Problems with tables,
Part II
        . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .  N.
J. Cox
        Q4/03   SJ 3(4):420--439                                 (no
commands)
        reviews three user-written commands (tabcount, makematrix,
        and groups) as different approaches to tabulation problems

See also -modes-. 

SJ-9-4  sg113_2 . . . . . . . . . . . . . . . . . . . . .  Tabulation of
modes
        (help modes if installed) . . . . . . . . . . . . . . . . .  N.
J. Cox
        Q4/09   SJ 9(4):652
        update to allow the generate() option to record in an
        indicator variable of which observations contain values
        matching any of the modes displayed

SJ-3-2  sg113_1 . . . . . . . . . . . . . . . . . .  Software update for
modes
        (help modes if installed) . . . . . . . . . . . . . . . . .  N.
J. Cox
        Q2/03   SJ 3(2):211
        provides new option for specifying the number of modes to
        be shown

STB-50  sg113 . . . . . . . . . . . . . . . . . . . . . .  Tabulation of
modes
        (help modes if installed) . . . . . . . . . . . . . . . . .  N.
J. Cox
        7/99    pp.26--27; STB Reprints Vol 9, pp.180--181
        provides table of most frequent observations (modes)

However, to expand Scott's comment: Neither offers solutions to (b) or
(c). Maarten has given some code. A hybrid of his code and -modes- would
give you (b) and (c) as well. 

Nick 
[email protected] 

Scott Merryman

Nick Cox's -groups- can handle (a):

. sysuse auto, clear
(1978 Automobile Data)

. groups gear, select(5) order(h)

  +------------------------------------+
  | gear_r~o   Freq.   Percent    Cum. |
  |------------------------------------|
  |     2.73       9     12.16   12.16 |
  |     2.93       8     10.81   22.97 |
  |     3.08       7      9.46   32.43 |
  |     2.47       5      6.76   39.19 |
  |     2.41       3      4.05   43.24 |
  +------------------------------------+

. groups gear, select(freq>=3) order(h)

  +------------------------------------+
  | gear_r~o   Freq.   Percent    Cum. |
  |------------------------------------|
  |     2.73       9     12.16   12.16 |
  |     2.93       8     10.81   22.97 |
  |     3.08       7      9.46   32.43 |
  |     2.47       5      6.76   39.19 |
  |     2.41       3      4.05   43.24 |
  |------------------------------------|
  |     3.05       3      4.05   47.30 |
  |     3.54       3      4.05   51.35 |
  |     3.78       3      4.05   55.41 |
  +------------------------------------+

On Wed, Mar 17, 2010 at 8:30 AM, Richard Goldstein
<[email protected]> wrote:
> I want to -tabulate- a variable with many (hundreds if not thousands
of)
> different values; but, I only want to see (a) the 20 (say) most
> frequently occurring values and then (b) I want a row for "all others"
> and then (c) I want a grand total row
>
> I have searched in various ways for already existing programs but, of
> course, I may have missed something (as far as I can see, -fre- will
not
> do what I want but I would be happy to be shown that I was wrong)
>
> so, two questions:
>
> does anyone know of an already existing program for this?
>
> hints, etc. for writing my own would be welcome also if anyone has any


*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index