Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

# RE: st: question on line graphs

 From "Thaker, Anant" To "statalist@hsphsun2.harvard.edu" Subject RE: st: question on line graphs Date Mon, 19 Apr 2010 01:06:45 -0400

```Dear Mitchell,

Thank you! This worked very well. Just a question: could you briefly explain the purpose of 'collapse rtax , by(cohort year)'? If I want to repeat this procedure with other variables, I'd have to reload the full GSS dataset, correct?

Thanks again,
Anant

Anant Thaker
MBA Class of 2011
Harvard Business School
athaker@mba2011.hbs.edu
+1 646-964-7123

-----Original Message-----
From: owner-statalist@hsphsun2.harvard.edu [mailto:owner-statalist@hsphsun2.harvard.edu] On Behalf Of Michael Norman Mitchell
Sent: Sunday, April 18, 2010 11:32 PM
To: statalist@hsphsun2.harvard.edu
Subject: Re: st: question on line graphs

Dear Anant

Will this do the trick...

******************************************
* CREATE FAKE DATASET
clear
set obs 10
generate year = 1995 + _n
expand 4
sort year
generate cohort = mod((_n-1),4)+1
tab year cohort
expand 100
generate tax = int(uniform()*5)+1
tab tax
recode tax (1 2=1 "1 or 2") (3 4 5=0 "3,4,5"), gen(rtax)
tab tax rtax
******************************************

* Start of real code

* collapse to get proportions
collapse rtax , by(cohort year)
* Convert to percentage
replace rtax = rtax*100
* separate into four variables by cohort
separate rtax, by(cohort)
* graph it
graph twoway (line rtax1 rtax2 rtax3 rtax4 year), ///
xlabel(1996(1)2005, angle(45)) ///
legend(label(1 "Cohort 1") label(2 "Cohort 2") label(3 "Cohort 3")
label(4 "Cohort 4"))

And here is the graph that it creates

http://screencast.com/t/Y2FkZGQ0Y2U

Hope that helps,

Michael N. Mitchell
See the Stata tidbit of the week at...
http://www.MichaelNormanMitchell.com

On 2010-04-18 8.06 PM, Thaker, Anant wrote:
> I am a Stata 11/SE user. In my dataset, I have a cohort variable coded 1-4 with the various subgroups (e.g. 1 =<1934, 2 = 1934-1948, etc.), and an ordered variable that shows views on progressive taxation (coded 1-5 in terms of relative support). These data points are spread over a number of years (from the General Social Survey). I'm trying to chart out the % of respondents who selected 1 or 2 on the taxation variable, by cohort subgroup, over time. So the x-axis would be year, the y-axis would be "% who chose 1 or 2," and there would be four line graph series (one for each cohort subgroup). Would really appreciate some help! I can clarify as needed. Thanks!
>
> Note: data is currently coded as this example:
> YEAR  COHORT  TAXATION
> 2008  1965       1
> YEAR  COHORT  TAXATION
> 1996  1975       3
> ...
>
> Best,
> Anant
>
>
>
> *
> *   For searches and help try:
> *   http://www.stata.com/help.cgi?search
> *   http://www.stata.com/support/statalist/faq
> *   http://www.ats.ucla.edu/stat/stata/
>
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/
```

© Copyright 1996–2017 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index