Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: using regex

From   Steven Samuels <[email protected]>
To   [email protected]
Subject   Re: st: using regex
Date   Thu, 2 Dec 2010 17:58:32 -0500

The regular expression solution is:

gen fips = real(regexs(1)) if regexm(x,"([0-9][0-9]?)[0-9][0-9][0-9]")


Steven J. Samuels
[email protected]
18 Cantine's Island
Saugerties NY 12477
Voice: 845-246-0774
Fax:    206-202-4783

On Dec 2, 2010, at 5:22 PM, Rijo John wrote:

Hi statlist,

I have a string variable (code) with state and county codes merged.
For example, the observation 12001 indicate that the first 2 digits
are state codes and the remaining 3 digits are county codes. I want to
create a new variable (fips) only with the state code.  I use the

gen fips = regexs(0) if(regexm(code, "[0-9][0-9]"))

and it works fine when the code contains 5 digits. But if a particular
state code is only 1 digit and thereby the string "code" only has a
total of 4 digits the trick above does not work. In such cases I want
to only extract the first digit.
Can someone help?

*   For searches and help try:

*   For searches and help try:

© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index