Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: range of a stringvariable

From   Richard Goldstein <>
Subject   Re: st: range of a stringvariable
Date   Wed, 28 Apr 2010 10:18:32 -0400

for the first question:

gen byte newvar=substr(code,1,1)=="E" & real(substr(code,2,3))>=300 &
real(substr(code,2,3)<=499 & length(code)==4

for the second question (end with "A") I would replace "length(code)==4"
with substr(code,5,1)=="A"

here I am using "code" for the name of the variable with the ICD-10 code


On 4/28/10 10:02 AM, Tomas Lind wrote:
> Dear listers
> Choose individuals based on a string variable with a range of values
> I am working with ICD-10 codes (codes for different types of diseases). The
> codes start with a letter A - Z followed by 2 or 3 digits. In some cases
> they might end with the letter A. Say that I have a dataset with 5 subjets
> (id=1 to 5) with these ICD-10 codes (fake data, in reality I have millions
> of subjects):
> I460  E343  I46  C764  E438
> How can I choose individuals with ICD-10 codes in the range E300 to E499
> (not including codes that end up with A). What about if I want to include
> codes that ends with an A. (There is a convenient command for ICD-9 codes,
> but not for ICD-10 codes.) 
> Any suggestions are welcome.
> /Tomas
*   For searches and help try:

© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index