Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Creating a second output data set

From	Bryan Sayer <[email protected]>
To	[email protected]
Subject	st: Creating a second output data set
Date	Tue, 06 Sep 2011 16:53:01 -0400

I need to create an output data set that will differ in the content andnumber of observations from the input file. The observations will becreated one at a time, based on the input data set.

Specifically, I am creating all combinations of N objects taken two at atime. I will probably also do permutations.

The input data set (to start with) consists of N records with twovariables, the primary sampling unit (PSU) and a size variableassociated with the PSU (a count variable). I want to create two outputdata sets. One is each combination of PSU with the associated jointprobability. The second has the same structure as the input data setbut includes the marginal probability, calculated as the sum of thejoint probabilities associated with the PSU (which are accumulated aseach combination is created).

The part I am stuck on is how to output the data set of combinations.Can someone point me to a program that outputs a file as calculationsare made?

(For those interested, this is for probability proportional to size(PPS) sampling. See, for example, Levy and Lemeshow "Sampling ofPopulations, chapter 11).


Here is an example of one stratum:

Input data set (with marginal probability added)

District Size		pi(i)
LUWEERO	 12,466 	0.916858
KAMPALA	 3,459 		0.542857
TORORO	 2,815 		0.448739
KAMULI	 549 		0.091546
Total	 19,289 	


Output data set:

COMBINATIONS	pi(I,j)
LUWEERO,KAMPALA	0.468854
LUWEERO,TORORO	0.377069
LUWEERO,KAMULI	0.070934
KAMPALA,TORORO	0.062531
KAMPALA,KAMULI	0.011473
TORORO,KAMULI	0.009139




--
Bryan Sayer
Monday to Friday, 8:30 to 5:00
Phone: (614) 442-7369
FAX:  (614) 442-7329
[email protected]

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: Creating a second output data set
  - From: Roger Newson <[email protected]>

Prev by Date: Re: st: Bootstrapping & clustered standard errors (-xtreg-)
Next by Date: Re: st: Creating a second output data set
Previous by thread: st: Use extended functions outside of macro assignment?
Next by thread: Re: st: Creating a second output data set
Index(es):
- Date
- Thread