Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
From | "Mike Kim" <kalisperos@gmail.com> |
To | <statalist@hsphsun2.harvard.edu> |
Subject | RE: st: insheet limit in string |
Date | Thu, 8 Dec 2011 22:19:07 -0600 |
Thanks, Nick and Matt. Matt, your suggestion works with the simple example below, but it does not work with my actual data. I have many variables to the right of 'school' variable. If I use the code you suggested, split ones push other observations to the right and variables are all mixed. Strange. Well, thanks anyway. I really appreciate your help. Mike. -----Original Message----- From: owner-statalist@hsphsun2.harvard.edu [mailto:owner-statalist@hsphsun2.harvard.edu] On Behalf Of Matthew White Sent: Thursday, December 08, 2011 6:09 PM To: statalist@hsphsun2.harvard.edu Subject: Re: st: insheet limit in string Hi Mike, With -filefilter-, you probably want to change "; [" to the delimiter. For example, if mydata.csv is comma-delimited, you should change "; [" to `"",""' (since you're working with strings). Let's say mydata.csv looks like this: ***BEGIN .CSV*** author,school "Novicevic, MM; Humphreys, JH; Buckley, MR; Cagle, C; Roberts, F","[Buckley, MR] Univ Oklahoma, Michael F Price Coll Business, Norman, OK 73019 USA; [Novicevic, MM; Roberts, F] Univ Mississippi, Sch Business Adm, University, MS 38677 USA; [Humphreys, JH] Texas A&M Univ, Commerce, TX 75429 USA; [Cagle, C] Univ Mississippi, Sch Accountancy, University, MS 38677 USA" ***END*** Then in Stata: filefilter mydata.csv mydata_split.csv, from("; [") to(`"",""') replace insheet using mydata_split.csv, comma names clear Does that work? Also, if you're working with many files, think about using the -:dir- extended macro function: -help extended_fcn-. Best, Matt On Thu, Dec 8, 2011 at 6:30 PM, Mike Kim <kalisperos@gmail.com> wrote: > Hi Matt, > > Yes, I tried filefilter, etc. but it didn't work. For example, I changed "; > [" into "^" or "???"and used insheet. The result is completely mess (with ^) > or error (with ???). Does delimiter option in insheet work only with one > variable? If data has several variable, the imported data becomes total mess > with delimiter option. Then, maybe the only option is to save my school > variable as a separate file and import using delimit option and merge again? > I have 40 of these files to import, but it is doable. The only question > is... why does Stata create this pain? > Mike. > > -----Original Message----- > From: owner-statalist@hsphsun2.harvard.edu > [mailto:owner-statalist@hsphsun2.harvard.edu] On Behalf Of Matthew White > Sent: Thursday, December 08, 2011 4:22 PM > To: statalist@hsphsun2.harvard.edu > Subject: Re: st: insheet limit in string > > Hi Mike, > > There was a discussion about a similar problem not too long ago. > Google "String variables over 244 in a dataset with two delimiters" > and see if that helps. > > Best, > Matt > > On Thu, Dec 8, 2011 at 5:05 PM, Mike Kim <kalisperos@gmail.com> wrote: >> Hi all, >> >> I am using Stata IC v.11 and trying to import data using: >> insheet using mydata.csv, clear >> >> However, due to 244 string limit, I cannot correctly import the following >> example. I cannot use delimit(;) option because it changes the data >> structure I intended. If I can split school variable using delimit("; ["), >> it will work, but Stata does not allow this. Is there any way I can import >> more than 244 string? Can Stata 12 handle large string variables? Thank > you >> in advance. >> Mike. >> >> clear >> input str244 author str244 school >> "Novicevic, MM; Humphreys, JH; Buckley, MR; Cagle, C; Roberts, F" >> "[Buckley, MR] Univ Oklahoma, Michael F Price Coll Business, Norman, OK >> 73019 USA; [Novicevic, MM; Roberts, F] Univ Mississippi, Sch Business Adm, >> University, MS 38677 USA; [Humphreys, JH] Texas A&M Univ, Commerce, TX > 75429 >> USA; [Cagle, C] Univ Mississippi, Sch Accountancy, University, MS 38677 > USA" >> end >> >> >> * >> * For searches and help try: >> * http://www.stata.com/help.cgi?search >> * http://www.stata.com/support/statalist/faq >> * http://www.ats.ucla.edu/stat/stata/ > > > > -- > Matthew White > Data Coordinator > Innovations for Poverty Action > 101 Whitney Avenue, New Haven, CT 06510 USA > +1 434-305-9861 > www.poverty-action.org > > * > * For searches and help try: > * http://www.stata.com/help.cgi?search > * http://www.stata.com/support/statalist/faq > * http://www.ats.ucla.edu/stat/stata/ > > > * > * For searches and help try: > * http://www.stata.com/help.cgi?search > * http://www.stata.com/support/statalist/faq > * http://www.ats.ucla.edu/stat/stata/ -- Matthew White Data Coordinator Innovations for Poverty Action 101 Whitney Avenue, New Haven, CT 06510 USA +1 434-305-9861 www.poverty-action.org * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/ * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/