Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: suppressing low frequency observations in tabulation


From   Kevin McConeghy <[email protected]>
To   [email protected]
Subject   st: suppressing low frequency observations in tabulation
Date   Wed, 24 Oct 2012 17:08:00 -0500

Hello everyone,

I have a large dataset, roughly 6.5mill obs, which is the FDA adverse
event database. Variable drugname is the string describing the drug.

. describe drugname

              storage  display     value
variable name   type   format      label      variable label
---------------------------------------------------------------------------------------------------------------------------------------------------
drugname        str30  %30s

I want to create a frequency table of the top 25 drug "offenders" in
the database, however I am having trouble figuring out how to get
STATA to perform the tab drugname command without including all the
low frequency observations from random drugs (which causes stata to
stop the command becuase "too many values"). I can't see an option for
this in the syntax. Any advice on how to filter out all the background
noise for this?

Kevin
-- 
Kevin McConeghy, PharmD
Infectious Diseases Fellow
University of Illinois College of Pharmacy
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index