I am sorry for not being very clear. Like I said, im not yet very experienced in research practice. The goal of the model is to explain sorting into occupations with individual risk preferences. In the first stage I run a Mincerian wage regression with dummies for 106 different occupations. The standard deviations of the residuals of this regression is used as measure for occupation (earnings) risk. In the second step I use the standard deviation of the residuals as dependent variable in a regression on individual characteristics including their risk preference. My problem is that I dont know how i can construct a variable from the standard deviations of the residuals. I have the residuals and I know how to find their standard deviations (for example with tabstat). I dont know how to construct a variable from these (106) standard deviations. I have tried tabstat and the egen advice you have me earlier but these dont yield good results. I hope my problem is now clear and that you (or other STATA users) can help me. If you still dont understand it please ask me again :-). The 106 standard deviations are thus the different possible "types" of the individuals in the sample. I want to have a variable with the 106 different standard deviations so i can use it as dependent variable in my regression.

The 106 standard deviations have ofcourse different frequencies, because some options are more frequently chosen. Do you think i should incorporate this?

Anyways, it would be awesome if you can give me some usefull STATA procedures. -tabstat- is far from the most appropriate command for this purpose. There are several ways to achieve what you need.

1. Use -egen- command:

egen sdy = sd( y )

2. Use saved results from -summarize- (type -return list- to find out what these are):

sum y
gen sdy = r(sd)

I personally think that having a variable equal to a constant is a data management oxymoron... variable must vary, by the virtue of its name :). You probably only need it for another step or two in data analysis, e.g., if you want to simulate a new set of errors with the same variance:

sum y
gen new_y = rnormal()*r(sd) I have a problem with constructing a new a variable from the standard deviations of regressions residuals. I have generated 104 different regression residuals and then use tabstat to determine their standard deviations. I need to construct a new variable from these 104 standard deviations of the regression residuals for the next step in my analysis.

I know that the tabstat has a save options to save the summary statistics in a matrix. Unfortunately when I use this option no matrix is created (nothing happens). Does anyone know how i can save the standard deviations in as a new variable, possible using tabstat or another method.

