Notice: On March 31, it was **announced** that Statalist is moving from an email list to a **forum**. The old list will shut down on April 23, and its replacement, **statalist.org** is already up and running.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

From |
"vmz@vol.net.mt" <vmz@vol.net.mt> |

To |
"statalist@hsphsun2.harvard.edu" <statalist@hsphsun2.harvard.edu> |

Subject |
Re: st: RE: comas in numbrers |

Date |
Mon, 03 May 2010 10:02:26 +0100 |

That.s really wonderfull Steve and Jose' .Your help is greatly appreciated, Victor On 3/5/2010, "Steve Samuels" <sjsamuels@gmail.com> wrote: >assume that your commas are not separating the integer from the >non-integer part of the numbers, but instead are indicating >thousands--see Jose's post today. > > I suggest that you use -filefilter- instead of -sed- , Since this is >a Stata command, it will be platform independent, whereas the -sed- >command on your system might have a different syntax than the one on >mine (OS X 10.5.8, bash shell) . (The "-e" might be an issue; you >could omit it.) Either one of these two statements should work. > >filefilter file1.txt file2.txt, from(",") to("") //best >filefilter file1.txt file2.txt, from(\044d) to("") > > Monitor what is in file2.txt either by opening it up in your text >editor (best), or using -type-. > > I'm not sure what you mean by "highlighting" the data, and I'm not >sure that I consider the do file editor as a good text editor. (I >wouldn't want to use it with a thousand-line file). Still, while you >are there (or in a good text editor), why not search for and replace >the comma, save out, as Jose and Nick suggested. >' >Good luck, > >Steve > >On Sun, May 2, 2010 at 12:20 PM, vmz@vol.net.mt <vmz@vol.net.mt> wrote: >> Thanks Steve ,however I seem to be doing something wrong with processing >> the following data into a file calling it file1.txt. I am doing this by >> highlighting the data,copy it and paste it into a do-file and then >> saving it as file1.txt. ..... >> >> 31,666 15,127 46,793 31,259 15,409 46,668 30,215 17,182 7,397 >> 28,323 13,798 42,121 28,267 14,024 42,291 27,049 15,886 2,935 >> 6,615 2,307 8,922 6,668 2,602 9,270 6,130 3,380 9,510 >> 7,829 4,196 12,025 7,556 3,855 11,411 7,020 4,336 11,356 >> 5,583 2,809 8,392 6,098 2,856 8,954 6,177 3,145 9,322 >> 4,399 2,421 6,820 4,311 2,520 6,831 4,119 2,609 6,728 >> 3,897 2,065 5,962 3,634 2,191 5,825 3,603 2,416 6,019 >> 3,343 1,329 4,672 2,992 1,385 4,377 3,166 1,296 4,462 >> >> do "C:\DOCUME~1\Victor\LOCALS~1\Temp\STD04000000.tmp" >> >> capture erase file2.txt >> >> ! sed -e 's/,//g' file1.txt>file2.txt >> >> . >> infile v1 v2 v3 using file2.txt, clear >> (0 observations read) >> >> des >> >> Contains data >> obs: 0 >> vars: 0 >> size: 0 (100.0% of memory free) >> Sorted by: >> >> list >> >> . >> end of do-file >> >> >> >> On 28/4/2010, "Steve Samuels" <sjsamuels@gmail.com> wrote: >> >>>Victor, you can skip all the problems of adding quotes or destringing >>>by deleting the commas outside of Stata. Use the SED package, built >>>into all the Unix, Linux, and OS X distributions and also available >>>for Windows. Write the data into a text file, say file1.txt. Here is >>>an example: >>> >>>**************************CODE BEGINS************************** >>>/* >>> File file1.txt contains the following two lines with three variables >>>3,000,000 4,000,000 . >>> 40,000 300,000 100 >>>*/ >>>capture erase file2.txt >>>! sed -e 's/,//g' file1.txt>file2.txt >>> >>>infile v1 v2 v3 using file2.txt, clear >>>des >>>list >>>***************************CODE ENDS************************** >>> >>>Steve >>> >>>On Tue, Apr 27, 2010 at 5:16 PM, Martin Weiss <martin.weiss1@gmx.de> wrote: >>>> >>>> <> >>>> >>>> To answer this conclusively, we need to know where the data reside, i.e. >>>> what kind of a file. >>>> >>>> I just created a file "Book1.txt" from spreadsheet software, containing >>>> numbers with commata in them, and used -insheet using Book1.txt, tab clear- >>>> to get them into Stata. "tab" is just a red herring here so Stata does not >>>> treat the commata as delimiters. Afterwards, I -destring-ed as described >>>> before and ended up with a numeric variable in Stata. This approach could be >>>> viable for you as well. >>>> >>>> >>>> HTH >>>> Martin >>>> >>>> -----Original Message----- >>>> From: owner-statalist@hsphsun2.harvard.edu >>>> [mailto:owner-statalist@hsphsun2.harvard.edu] On Behalf Of vmz@vol.net.mt >>>> Sent: Dienstag, 27. April 2010 19:54 >>>> To: statalist@hsphsun2.harvard.edu >>>> Subject: st: RE: RE: RE: comas in numbrers >>>> >>>> I mean thousands of observations,represented by numbers with a coma in >>>> them.It is a problem because it would take too much time to quote >>>> unquote each one of them. >>>> Unfortunately,in Eurostat we come across commas in numbers. >>>> >>>> >>>> >>>> On 27/4/2010, "Martin Weiss" <martin.weiss1@gmx.de> wrote: >>>> >>>>> >>>>><> >>>>> >>>>>What do you mean? There is no substantive difference to the situation with >>>> 5 >>>>>obs, is there? Apart from the fact that you do not want to -input- them >>>> all, >>>>>but that problem is unrelated to the issue of commata... >>>>> >>>>> >>>>>HTH >>>>>Martin >>>>> >>>>> >>>>>-----Original Message----- >>>>>From: owner-statalist@hsphsun2.harvard.edu >>>>>[mailto:owner-statalist@hsphsun2.harvard.edu] On Behalf Of vmz@vol.net.mt >>>>>Sent: Dienstag, 27. April 2010 19:16 >>>>>To: statalist@hsphsun2.harvard.edu >>>>>Subject: st: RE: RE: comas in numbrers >>>>> >>>>>What do I do when there are thousands of observations? >>>>> >>>>> >>>>> >>>>>010, "Martin Weiss" <martin.weiss1@gmx.de> wrote: >>>>> >>>>>> >>>>>><> >>>>>> >>>>>>************* >>>>>> >>>>>> drop _all >>>>>> >>>>>>input str10 a >>>>>>"2,345" >>>>>>"213" >>>>>>"34,456" >>>>>>"458" >>>>>>end >>>>>> >>>>>>destring a, generate(numerica) ignore(,) >>>>>>l >>>>>>************* >>>>>> >>>>>> >>>>>>HTH >>>>>>Martin >>>>>> >>>>>>-----Original Message----- >>>>>>From: owner-statalist@hsphsun2.harvard.edu >>>>>>[mailto:owner-statalist@hsphsun2.harvard.edu] On Behalf Of vmz@vol.net.mt >>>>>>Sent: Dienstag, 27. April 2010 17:39 >>>>>>To: statalist@hsphsun2.harvard.edu >>>>>>Subject: st: RE: comas in numbrers >>>>>> >>>>>>Dear Statalist, >>>>>> >>>>>>how do you handle numbers with a coma to indicate thousands? >>>>>>I thank you before hand >>>>>>Victor M. Zammit >>>>>> >>>>>> drop _all >>>>>> >>>>>> input a >>>>>> >>>>>>a >>>>>>1. 2,345 >>>>>>2. 213 >>>>>>3. 34,456 >>>>>>4. 458 >>>>>>5. end >>>>>> >>>>>>.. >>>>>>end of do-file >>>>>> >>>>>> l >>>>>> >>>>>>+-----+ >>>>>>a >>>>>>----- >>>>>>1. 2 >>>>>>2. 213 >>>>>>3. 34 >>>>>>4. 458 >>>>>>+-----+ >>>>>> >>> >>> >>>-- >>>Steven Samuels >>>sjsamuels@gmail.com >>>18 Cantine's Island >>>Saugerties NY 12477 >>>USA >>>Voice: 845-246-0774 >>>Fax: 206-202-4783 >>> >>> >>> >>>-- >>>Steven Samuels >>>sjsamuels@gmail.com >>>18 Cantine's Island >>>Saugerties NY 12477 >>>USA >>>Voice: 845-246-0774 >>>Fax: 206-202-4783 >>>* >>>* For searches and help try: >>>* http://www.stata.com/help.cgi?search >>>* http://www.stata.com/support/statalist/faq >>>* http://www.ats.ucla.edu/stat/stata/ >>> >>> >> >> * >> * For searches and help try: >> * http://www.stata.com/help.cgi?search >> * http://www.stata.com/support/statalist/faq >> * http://www.ats.ucla.edu/stat/stata/ >> > >-- >Steven Samuels >sjsamuels@gmail.com >18 Cantine's Island >Saugerties NY 12477 >USA >Voice: 845-246-0774 >Fax: 206-202-4783 > >* >* For searches and help try: >* http://www.stata.com/help.cgi?search >* http://www.stata.com/support/statalist/faq >* http://www.ats.ucla.edu/stat/stata/ > > * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/

**References**:**Re: st: RE: comas in numbrers***From:*Steve Samuels <sjsamuels@gmail.com>

- Prev by Date:
**Re: st: missing labels in svy: proportion** - Next by Date:
**st: RE: SSC Activity, April 2010** - Previous by thread:
**Re: st: RE: comas in numbrers** - Next by thread:
**RE: st: RE: comas in numbrers** - Index(es):