Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
From | Nick Cox <n.j.cox@durham.ac.uk> |
To | "'statalist@hsphsun2.harvard.edu'" <statalist@hsphsun2.harvard.edu> |
Subject | st: RE: importing html to stata - http://www.wolfbane.com/icd/icd8h.htm |
Date | Thu, 8 Dec 2011 13:06:37 +0000 |
This seemed to work for me. It took a few iterations. infile str244 text using http://www.wolfbane.com/icd/icd8h.htm, clear drop in 1/39 drop in 20827/L compress gen entry = regexm(text, "^\(*[0-9]") replace entry = regexm(text, "^\([E|N|Y][0-9]") if entry == 0 replace entry = regexm(text, "^[E|N|Y][0-9]") if entry == 0 replace entry = sum(entry) replace text = text[_n-1] + " " + text if entry == entry[_n-1] sort entry, stable by entry : keep if _n == _N drop entry Nick n.j.cox@durham.ac.uk -----Original Message----- From: owner-statalist@hsphsun2.harvard.edu [mailto:owner-statalist@hsphsun2.harvard.edu] On Behalf Of Lars Folkestad Sent: 08 December 2011 12:42 To: statalist@hsphsun2.harvard.edu Subject: st: importing html to stata - http://www.wolfbane.com/icd/icd8h.htm Dear List I would like to download the following html into stata: http://www.wolfbane.com/icd/icd8h.htm I would like to make the numbers from 000 - _n be var1 and the text from cholera to be var2 Can this be done through stata or do i need to go via a program of some sort. I've tried searching the list, and found one example: http://www.stata.com/statalist/archive/2007-03/msg00535.html But i must admit i did not understand what that was all about, and i dont think it applies to my problem. Thank you Lars * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/ * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/