[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: string

From	Viktor Slavtchev <[email protected]>
To	[email protected]
Subject	st: string
Date	Mon, 17 Mar 2008 14:48:03 +0100

Dear list,
I want to merge two files where the common variable is a string (names of cities). However, there are non systematic differences in the notions.
For example, you can find: "Berlin" in the first file but " Berlin" in the second. In other cases you can find "Rome" and "Roma,IT". Or "Paris, FR" and "Paris/FR"
I was tot able to find any systematics in the notion. I have over 40.000 unique observations.
How can I search for substrings in Stata? For example, for "*Rom*", the largest match between "Rome" and "Roma,IT".
I think this could help to solve some problems. Or does anybody know a better way to deal with such kind of 'bad' data?
thanks
viktor
*
* For searches and help try:
* http://www.stata.com/support/faqs/res/findit.html
* http://www.stata.com/support/statalist/faq
* http://www.ats.ucla.edu/stat/stata/

Follow-Ups:
- st: Re: string
  - From: "Michael Blasnik" <[email protected]>
- Re: st: string
  - From: "Vladimir Vakhitov" <[email protected]>

Prev by Date: Re: st: WHILE command
Next by Date: Re: st: permutations
Previous by thread: st: sjlatex under MiKTeX 2.4 and above
Next by thread: Re: st: string
Index(es):
- Date
- Thread