Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Restricting sample size


From   Amal Khanolkar <[email protected]>
To   "[email protected]" <[email protected]>
Subject   st: Restricting sample size
Date   Fri, 29 Nov 2013 09:50:57 +0000

Hello All,

I'm running a set of multivariate linear regression analysis stratified by ethnicity (ethnicity_bi2x) as follows:

eststo clear
eststo: regress syst bwtgestage_sd age byear i.conscript_office bmi height_cons ib2.education2 ib3.gestwk_catx2 if multibirth==1 & sex==1 & ethnicity_bi2x==1, cluster (mor_lopnr)
eststo: regress syst bwtgestage_sd age byear i.conscript_office bmi height_cons ib2.education2 ib3.gestwk_catx2 if multibirth==1 & sex==1 & ethnicity_bi2x==2, cluster (mor_lopnr)
eststo: regress syst bwtgestage_sd age byear i.conscript_office bmi height_cons ib2.education2 ib3.gestwk_catx2 if multibirth==1 & sex==1 & ethnicity_bi2x==3, cluster (mor_lopnr)
esttab, ci 

The ethncity_bi2x variable has three categories as follows:

ethnicity |
     cauca & |
   non-cauca |      Freq.     Percent        Cum.
-------------+-----------------------------------
     Swedish |    394,050       96.86       96.86
    European |     10,287        2.53       99.39
non-European |      2,496        0.61      100.00
-------------+-----------------------------------
       Total |    406,833      100.00


The Swedish group is very large relative to the two other categories. As this might bias results for potential differences in outcomes of interest between the three ethnic groups (non-Europeans being very much smaller then the Swedish group), I would also like to run the analysis by reducing the Swedish group from 394,050 to say just 50,000 or 100,000 subjects. How would I tell Stata to run the above analysis restricting to a random sample when ethncity_bi2x==1  (i.e. the Swedish group) ?


My other question would then be how would I choose an appropriate number for restricting the sample for the Swedish group? 

Thanks for any help in advance

Kind regards,


/Amal 



*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index