Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: skipping rogue commas when importing csv file using -infile-


From   Rob Shaw <rob.shaw.uk@gmail.com>
To   statalist@hsphsun2.harvard.edu
Subject   st: skipping rogue commas when importing csv file using -infile-
Date   Fri, 26 Oct 2012 16:59:19 +0100

Hi

I'm importing (part of) a large text file into Stata using --infile--.
The file is a csv.

However, it seems that a small number of lines have a rouge extra
comma in them, which is then pushing all the data along by one
variable. This happens not just for that line but for all subsqequent
lines as well!

I'm not too bothered if I have to later drop or reprocess this
individual line but does anyone know if there is there a way to stop
it affecting all the lines afterwards as well?

File example (with identical records in this example)

ABC,DEF,GH,IJK
ABC,DEF,GH,IJK
ABC,DEF,G,H,IJK
ABC,DEF,GH,IJK
ABC,DEF,GH,IJK

What I then get is for var1 is

ABC
ABC
ABC
IJK
IJK

and var2 is

DEF
DEF
DEF
ABC
ABC

etc

using hexdump it seems that all the lines finish with \r\n so if there
is a way to use this to 'reset' at each line then that would work.

Many thanks
Rob
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index