[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: Re: random sample

From	"Marcella Sapun" <[email protected]>
To	<[email protected]>
Subject	st: Re: random sample
Date	Thu, 26 Oct 2006 11:31:13 -0400

Thank you Michael for your suggestion. It worked nicely too!
Marcella
 

>>> Michael Blasnik <[email protected]> 10/26/2006 11:15 AM
>>>
...
If you can read in the dataset, then I'd recommend using the -sample- 
command.  But if you can't, then you can :

use myfile if uniform()<.1

which will select about 10% of the observations.  If you want exactly
10%, 
then use something bigger than 0.10 and then use -sample- :

use myfile if uniform()<.15
sample 100000

You probably want to set the random number seed before any of these 
approaches if you want replicable results.

Michael Blasnik


----- Original Message ----- 
From: "Marcella Sapun" <[email protected]>
To: <[email protected]>
Sent: Thursday, October 26, 2006 10:57 AM
Subject: st: random sample


> Dear statalisters:
>
> I want to read randomly 10% of a data set that contains about 1
million
> records and 100 variables. How do I do that in stata?
>
> Thanks,
>
> Marcella

*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html 
*   http://www.stata.com/support/statalist/faq 
*   http://www.ats.ucla.edu/stat/stata/
*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: random sample
  - From: "Marcella Sapun" <[email protected]>
- st: Re: random sample
  - From: "Michael Blasnik" <[email protected]>

Prev by Date: Re: st: random sample
Next by Date: Re: st: The origin of the word Stata?
Previous by thread: st: Re: random sample
Next by thread: Re: st: random sample
Index(es):
- Date
- Thread