Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Going through each observation of a variable

From	David Kantor <[email protected]>
To	[email protected]
Subject	Re: st: Going through each observation of a variable
Date	Sat, 08 Jun 2013 17:44:00 -0400

Hello Derya,
See below.

At 02:52 PM 6/8/2013, you wrote:

Hi David,
Organization of the data is that I simply copy-pasted these pricesin the data as additional variables. The variables Price1 and Price2has 500 observations, each row representing a price vector. os1 andos2 are the expenditure shares for each individual and has 80,000observations.
I am computing Y for each individual as the expenditure share ofgood1 for each individual (os1), multiplied by price of good 1 (P1)plus the same for good 2. If I had only a single price vector, thisis straightforward to compute. I could just write'genY=os1*P1+os2*P2'. But I have 500 different price vectors. Iwould like to generate Y 500 times, and take the average across the 500.
The program I posted choose randomly from these price vectors. But Idon't want randomness at this stage. I would like to compute Y foreach price vector one by one...This is what I meant by replication.
Here is an example with 3 price vectors and 10 individuals to showwhat I am trying to do: https://www.dropbox.com/s/boslxhpkyljcq45/Book1.xlsx
Thanks again, greatly appreciated!

Derya
[...]


It seems that you have two datasets:
1, prices: 500 observations
2, individuals: 80000 obsrvations

Or that in some virtual sense, this is what you have.
But it's still not clear how you have it organized.

Are all these observations packed into one dataset? Is it that theyare together -- stacked on top of each other, then it is ameaningless "structure".On the other hand, maybe you need a cross-product of the twodatasets; maybe it is already in that form; it would have 40000000observations.That's a lot of data -- and redundant. but it might be the rightshape to do the job. If it's not already in that shape, then you cancombine them with -cross-.


But again, that's a lot of observations. Your system might choke.

I would guess that your main dataset if the individuals, and eachindividual needs to be mated with the prices data.

It may be better to store the prices data in a matrix or a virtualmatrix using macros. Maybe that's what you have in mind. It may be asituation that works well in Mata, but I am no expert in that.


Other options:
        create wide data; still a lot of data.

step through each individual grabbing one at a time; crossthat with the prices, and output the result (or write it to a Stata data file).

We can best proceed if you clarify how each of these two datasets arestored -- and if they are together, how.

--David

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: Going through each observation of a variable
  - From: Derya Karaci <[email protected]>

References:
- st: Going through each observation of a variable
  - From: Derya Karaci <[email protected]>
- Re: st: Going through each observation of a variable
  - From: David Kantor <[email protected]>
- Re: st: Going through each observation of a variable
  - From: Derya Karaci <[email protected]>
- Re: st: Going through each observation of a variable
  - From: David Kantor <[email protected]>
- Re: st: Going through each observation of a variable
  - From: Derya Karaci <[email protected]>

Prev by Date: Re: st: Interpretation of interaction term in log linear (non linear) model
Next by Date: Re: st: How to store marginal effect value after using margin command?
Previous by thread: Re: st: Going through each observation of a variable
Next by thread: Re: st: Going through each observation of a variable
Index(es):
- Date
- Thread