Stata: Data Analysis and Statistical Software

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Need help with regular expression

From	Alison Connor <[email protected]>
To	[email protected]
Subject	Re: st: Need help with regular expression
Date	Sun, 17 Feb 2013 19:12:08 +0545

Hi Mike,

That is my fault - The $ denotes the end. Not "&". Sorry about that!

~A

On Sun, Feb 17, 2013 at 7:07 PM, Michael Stewart
<[email protected]> wrote:
> Hi Nick and Alison,
> One quick question.(sorry, I am a novice to regular expressions)
> The two functions di regexm("show us", " us$") & di regexm("show us",
> " us&") have different results.
> what is the difference between these functions ??
> Thank you very much .
> Sincerely
> Mike
>
> On Sun, Feb 17, 2013 at 5:03 AM, Nick Cox <[email protected]> wrote:
>> Alison's suggestion is surely right. If your condition is not
>> restrictive enough, you need to make it more restrictive.
>>
>> If it is a separate word, it occurs surrounded by spaces or at the
>> beginning or end with one space after or before. I tend to build up
>> syntax using -display- and little examples.
>>
>> . di regexm("us marines", "^us ")
>> 1
>>
>> . di regexm("show us", " us$")
>> 1
>>
>> . di regexm("invited us to dinner", " us ")
>> 1
>>
>> You could also use -strpos()-
>>
>> ... if strpos(var, " us ") | strpos(var, "us ") == 1 | strpos(var, " us") == -3
>>
>> Note that -regexm()- is a function, not a command. in Stata functions
>> and commands are quite separate in definition and documentation.
>> -strpos()- is another function. Fixing _function_ firmly in mind as
>> the key term leads to consultation of the documentation on functions,
>> which as here can suggest alternatives.
>>
>> There are also solutions with -word()- but they all seem to entail a loop.
>>
>> Nick
>>
>> On Sun, Feb 17, 2013 at 8:04 AM, Alison Connor
>>
>>> I am sure there is a much better way to do this, but until someone who
>>> actually knows responds, could you try something like:
>>>
>>> regexm(var, " us ") | regexm(var, " us&") | regexm(var, "^us ")  ?
>>
>> On Sun, Feb 17, 2013 at 1:00 PM, Michael Stewart
>>
>>>> I am trying to use regexm command (regexm(var,"us"))to see if the
>>>> values of variable have a particular string "us" like in  "us thyroid"
>>>> .Unfortunately , it is picking up "us" from other string values like
>>>> "venous".If there a way to formulate command so that it can pick up
>>>> only values  where "us" is a separate word and not part of word like
>>>> venous.
>> *
>> *   For searches and help try:
>> *   http://www.stata.com/help.cgi?search
>> *   http://www.stata.com/support/faqs/resources/statalist-faq/
>> *   http://www.ats.ucla.edu/stat/stata/
>
>
>
> --
> Thank you ,
> Yours Sincerely,
> Mike.
> *
> *   For searches and help try:
> *   http://www.stata.com/help.cgi?search
> *   http://www.stata.com/support/faqs/resources/statalist-faq/
> *   http://www.ats.ucla.edu/stat/stata/
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- Re: st: Need help with regular expression
  - From: Michael Stewart <[email protected]>

References:
- st: Need help with regular expression
  - From: Michael Stewart <[email protected]>
- Re: st: Need help with regular expression
  - From: Alison Connor <[email protected]>
- Re: st: Need help with regular expression
  - From: Nick Cox <[email protected]>
- Re: st: Need help with regular expression
  - From: Michael Stewart <[email protected]>

Prev by Date: Re: st: Need help with regular expression
Next by Date: Re: st: Need help with regular expression
Previous by thread: Re: st: Need help with regular expression
Next by thread: Re: st: Need help with regular expression
Index(es):
- Date
- Thread