Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: st: Insert spaces into a string variable upto the max length of a variable.


From   Tim Evans <[email protected]>
To   "[email protected]" <[email protected]>
Subject   RE: st: Insert spaces into a string variable upto the max length of a variable.
Date   Fri, 7 Feb 2014 15:31:46 +0000

Nick,

Thanks for your reply. My aim is to combine the estimates of a logistic  regression analysis as a table next to a graphical representation of it using the variable labels. I'm after this solution purely for presentational purposes. 

Here is my code (using nlsw88 data). Apologies for the length of the code.... but it should work. On the finished graph, I would like the x axis labels to be left aligned (if you look at the data at the end of the code, the spacing of the information is how I would like it to appear next to the graph).


*****BEGIN CODE****

sysuse nlsw88, clear

replace ttl_exp = ttl_exp / 10 

logit union i.race i.south grade ttl_exp, or

foreach v in grade ttl_ex { // continuous variables
       local l`v' : variable label `v'
	   if `"`l`v''"' == "" { // if no variable label
			local l`v' "`v'"
	   }
}
foreach v in south race { // factor variables
	levelsof `v'
	local `v'levs `r(levels)'
	foreach l in ``v'levs' {
		local l`v'_`l' : label (`v') `l'
		if `"`l`v'_`l''"' == "`l'" { // if no value label
			local l`v'_`l' `"`v' == `l'"'
		}
	}
}

parmest, norestore eform
foreach v in grade ttl_ex {
   replace parm = "`l`v''" if parm == "`v'"
}
foreach v in south race {
	local i = 1
	foreach l in ``v'levs' {
		if `i' == 1 {
			drop if parm == "`l'b.`v'" // drop reference
		}
		else {
			replace parm = "`l`v'_`l''" if parm == "`l'.`v'"
		}
		local i = `i' + 1
	}
}
replace parm = "baseline odds" if parm == "_cons"
egen axis= axis(z), label(parm)


gen int flag = 1 if p<0.05
replace flag = 0 if p>=0.05
replace flag = 2 if p==. & estimate==1
label define flag 0 "Not significant" 1 "Significant" 2 "Base value"
label values flag flag
g flag2 = flag
replace flag2 = 0 if !inlist(flag2, 0,1)    

separate estimate, by(flag2)

g str_est0 = string(estimate0, "%09.2fc")
g str_est1 = string(estimate1, "%09.2fc") 
g str_min = string(min95, "%09.2fc")
g str_max = string(max95, "%09.2fc")
g str_P = string( p, "%09.4fc")
replace str_P = "" if str_P=="."
replace str_P = "<0.0001" if p<0.0001
g c = ","
g br1 = " ("
g br2 = ") "



gen length = length(trim(parm))
su length, meanonly
local max = r(max)
local pad : di _dup(`max') " "
replace parm = trim(parm) + substr("`pad'", 1, `max' - length)
drop length

gen length = length(trim(str_P))
su length, meanonly
local max = r(max)
local pad : di _dup(`max') " "
replace str_P = substr("`pad'", 1, `max' - length) + trim(str_P) 

drop length
gen length = length(trim(str_est0))
su length, meanonly
local max = r(max)
local pad : di _dup(`max') " "
replace str_est0 = trim(str_est0) + substr("`pad'", 1, `max' - length)

drop length
gen length = length(trim(str_est1))
su length, meanonly
local max = r(max)
local pad : di _dup(`max') " "
replace str_est1 = trim(str_est1) + substr("`pad'", 1, `max' - length)

drop length
gen length = length(trim(str_min))
su length, meanonly
local max = r(max)
local pad : di _dup(`max') " "
replace str_est1 = trim(str_min) + substr("`pad'", 1, `max' - length)

drop length
gen length = length(trim(str_max))
su length, meanonly
local max = r(max)
local pad : di _dup(`max') " "
replace str_est1 = trim(str_max) + substr("`pad'", 1, `max' - length)
drop length


**GENERATE VARIABLE


*************************************
g est5  = parm + "; "  + str_est1 + " " + "(" + str_min 

*drop length
gen length = length(trim(est5))
su length, meanonly
local max = r(max)
local pad : di _dup(`max') " "
replace est5 = trim(est5) + substr("`pad'", 1, `max' - length)
drop length

replace est5 = est5 + c + str_max + ")"


***************USE THIS VARIABLE AS A LABEL


encode est5, g(est_touse)
g order2 = _n

egen axis2= axis(order2), label(est_touse)
gsort axis2
labmask order, values(est_touse)                           
count if inlist(flag,0,1)
local N = r(N)  
revrs axis2
*g flag2 = flag
*replace flag2 = 0 if !inlist(flag2, 0,1)                   

format %-65.0g revaxis2

 twoway bar estimate0 revaxis2, xscale(log) base(1) horizontal barw(0.5) color(gs13) ///
		xline(1, lcolor(gs13)) yla(1/`=_N', noticks valuelabel labsize(vsmall) ang(h)) xla(, labsize(vsmall)) ||             ///
		  rcap min95 max95 revaxis2, horizontal 		  ///
		||   bar estimate1 revaxis2, xscale(log) base(1) horizontal barw(0.5) color(red) ///
		xline(1, lcolor(gs13)) yla(1/`=_N', noticks valuelabel labsize(vsmall) ang(h))  ||             ///
		  rcap min95 max95 revaxis2, horizontal   ///  
		     	legend(order(3 "Significant" 1 "Not significant") size(small)) ytitle("") ///
  xtitle("Odds ratio", size(small)) ytitle("") l2("{bf:ORs (95% CIs)}",   size(vsmall) orientation(horiz) placement(top) margin(t=0 r=-60))

*****END CODE****



-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf Of Nick Cox
Sent: 07 February 2014 14:54
To: [email protected]
Subject: Re: st: Insert spaces into a string variable upto the max length of a variable.

I have no difficulty in using string variables in -graph bar- or -graph hbar-, meaning e.g.

sysuse auto, clear
decode foreign, gen(origin)
graph hbar (mean) mpg, over(origin)
replace origin = origin + "    " if origin == "Foreign"
graph hbar (mean) mpg, over(origin)

This example shows that right alignment can be approximated, if not gained exactly, at least with the settings on Stata 13.1 for Windows.

Your problem arises with -twoway bar-, so please either try an equivalent command in -graph bar|hbar|dot- or tell us _exactly_ why that won't work in your case. As requested in the FAQ, showing us exact syntax used helps greatly.

Nick
[email protected]

On 7 February 2014 14:38, Tim Evans <[email protected]> wrote:

> Thank you very much for your help.
>
> I have used this code in order to create a variable that is based upon a number of other variables and I wanted to ensure correct spacing between the variables when I concatenated them together. I now left align the string variable and I notice that the combined variables now align for all records.
>
> My next thing is that I want to use this variable in a -tw- bar graph. I understand that strings cannot be used in a bar graph, so I encode my newly created string variable. However, I lose the left alignment I previously enjoyed with the string variable and while the contents are the same the alignment is now right aligned (I have tried left aligning in the variable properties). Is this because what is being aligned is the numeric value behind the label? If so, how can I duplicate the exact alignment in the string variable into the numeric labelled variable?

Tim Evans

> Thanks for this - much appreciated. Am working through this with other variables now.

Nick Cox

> gen length = length(trim(strvar))
> su length, meanonly
>
> local max = r(max)
> local pad : di _dup(`max') " "
>
> replace strvar = trim(strvar) + substr("`pad'", 1, `max' - length)
>
> NB _not_ length("strvar") !!!
>
> The -length()- function finds the length. -summarize- returns the maximum length. The biggest possible "pad" is that many spaces. The length of the part of the pad needed is the difference between the length of the trimmed string and the maximum length.


On 6 February 2014 21:34, Tim Evans <[email protected]> wrote:

>> I'm using Stata 11.2. I have a string variable called 'make' containing the following:
>>
>> Buick
>> AMC
>> Lincoln
>>
>> The max string length is 7 (for Lincoln) with the smallest being 3 for AMC.
>>
>> What I want to do is assess the maximum length of the variable 'make' and then insert extra spaces in order to make the length of each value up to the max. I.e add to the end of Buick two spaces and AMC (4 spaces) and Lincoln (no spaces).
>>
>> I need this for when I concatenate the string variable with another string so that the start of the second string component is aligned for all values.

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/

**************************************************************************
The information contained in the EMail and any attachments is confidential and intended solely and for the attention and use of the named addressee(s). It may not be disclosed to any other person without the express authority of Public Health England, or the intended recipient, or both. If you are not the intended recipient, you must not disclose, copy, distribute or retain this message or any part of it. This footnote also confirms that this EMail has been swept for computer viruses by Symantec.Cloud, but please re-sweep any attachments before opening or saving. http://www.gov.uk/PHE
**************************************************************************

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index