Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: st: RE: importing html to stata - http://www.wolfbane.com/icd/icd8h.htm


From   Nick Cox <[email protected]>
To   "'[email protected]'" <[email protected]>
Subject   RE: st: RE: importing html to stata - http://www.wolfbane.com/icd/icd8h.htm
Date   Thu, 8 Dec 2011 14:20:46 +0000

Regex aficionados would want to emphasise that 

replace entry = regexm(text, "^\([E|N|Y][0-9]") if entry == 0
replace entry = regexm(text, "^[E|N|Y][0-9]") if entry == 0

is equivalent to

replace entry = regexm(text, "^\(*[E|N|Y][0-9]") if entry == 0

Nick 
[email protected] 


-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf Of Nick Cox
Sent: 08 December 2011 13:44
To: '[email protected]'
Subject: RE: st: RE: importing html to stata - http://www.wolfbane.com/icd/icd8h.htm

Note that the code segment I gave is tailored to the document you specified. But the small tricks in using functions may well still be useful for other documents. BTW, I tried copy and paste first, but that failed for some reason. 

Nick 
[email protected] 


-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf Of Lars Folkestad
Sent: 08 December 2011 13:35
To: [email protected]
Subject: Re: st: RE: importing html to stata - http://www.wolfbane.com/icd/icd8h.htm

That was what i thought.
The best way to go is prob to find the ICD lists i need online, use your
infile suggestion and then merge the two data sets.
Thank you.
lars

Den 08/12/11 14.27 skrev "Nick Cox" <[email protected]>:

>No, not directly. You can only attach value labels to integers, remember.
>
>But the best way you can do something similar is likely to be with a
>-merge-. 
>
>Nick 
>[email protected]
>
>Lars Folkestad
>
>Thank you Nick, that worked fantastic.
>Is it possible to use this newly created dta file as a value label for
>another data set, so that if fx in my other data set i have a patient with
>the diagnosis 000.0 which is essential cholera, the essential cholera will
>show up as a value label?
>lars
>
>Den 08/12/11 14.11 skrev "Nick Cox" <[email protected]>:
>
>>Followed by 
>>
>>gen var1 = word(text, 1)
>>gen var2 = subinstr(text, word(text, 1), "", 1)
>>
>>
>>Nick 
>>[email protected]
>>
>>
>>-----Original Message-----
>>From: Nick Cox 
>>Sent: 08 December 2011 13:07
>>To: '[email protected]'
>>Subject: RE: importing html to stata -
>>http://www.wolfbane.com/icd/icd8h.htm
>>
>>This seemed to work for me. It took a few iterations.
>>
>>infile str244 text using http://www.wolfbane.com/icd/icd8h.htm, clear
>>drop in 1/39
>>drop in 20827/L
>>compress
>>gen entry = regexm(text, "^\(*[0-9]")
>>replace entry = regexm(text, "^\([E|N|Y][0-9]") if entry == 0
>>replace entry = regexm(text, "^[E|N|Y][0-9]") if entry == 0
>>replace entry = sum(entry)
>>replace text = text[_n-1] + " " + text if entry == entry[_n-1]
>>sort entry, stable
>>by entry : keep if _n == _N
>>drop entry
>>
>>Nick 
>>[email protected]
>>
>>
>>-----Original Message-----
>>From: [email protected]
>>[mailto:[email protected]] On Behalf Of Lars Folkestad
>>Sent: 08 December 2011 12:42
>>To: [email protected]
>>Subject: st: importing html to stata -
>>http://www.wolfbane.com/icd/icd8h.htm
>>
>>Dear List
>>
>>I would like to download the following html into stata:
>>http://www.wolfbane.com/icd/icd8h.htm
>>I would like to make the numbers from 000 - _n be var1 and the text from
>>cholera to be var2
>>
>>Can this be done through stata or do i need to go via a program of some
>>sort.
>>I've tried searching the list, and found one example:
>>http://www.stata.com/statalist/archive/2007-03/msg00535.html
>>But i must admit i did not understand what that was all about, and i dont
>>think it applies to my problem.
>

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index