Statalist The Stata Listserver


[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: import html


From   "Rodrigo Martell" <rodrigo.martell@frontier-economics.com.au>
To   <statalist@hsphsun2.harvard.edu>
Subject   st: import html
Date   Wed, 14 Mar 2007 11:54:16 +1100

I'm trying to get the list of links in this website (http://www.nemweb.com.au/Reports/CURRENT/DispatchIS_Reports/) as a list in a text file.
I tried using -copy- to save it a text file but naturally, it saves it as messy html and importing it into Stata fails because the strings are too long.
I tried -insheet- with some delimiters but it doesn't seem to like this giving a r(198) error.
I found some html related programs using -findit- but none seem to relate to reading in html, just spitting out html.

Does anyone know a clever way to do this in Stata? I'm running out of ideas, I might have to look into writing a console application that does it so that my program can shell out to it when it needs to fetch the list of links as a text file.

Thanks!

Rodrigo Martell

 <http://www.frontier-economics.com> 	
Frontier Economics Pty. Ltd.
395 Collins Street
Melbourne VIC 3000
Australia
www.frontier-economics.com 	
switch:
direct:
fax:
mobile:
email:


+61 (0)3 9620 4488
+61 (0)3 9613 1518
+61 (0)3 8614 2711
+61 (0)407 909 811
rodrigo.martell@frontier-economics.com <mailto:rodrigo.martell@frontier-economics.com> 


This e-mail, including any attachments, may contain confidential and privileged information for the sole use of the intended recipient(s). Any review, use, disclosure or distribution by others is strictly prohibited. If you are not the intended recipient (or authorised to receive information for the recipient), please contact the sender by reply e-mail and delete all copies of this message. Thank you.

	



*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index