Stata The Stata listserver
[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

RE: st: Re: Finding "near"-matches


From   "Nick Cox" <n.j.cox@durham.ac.uk>
To   <statalist@hsphsun2.harvard.edu>
Subject   RE: st: Re: Finding "near"-matches
Date   Fri, 28 Oct 2005 18:25:20 +0100

Without leaving Stata, 

strpos(myvar, "Ford") 

is positive so long as -myvar- contains "Ford" anywhere. 

However, it may well be that that's not your way in 
here. 

Nick 
n.j.cox@durham.ac.uk 

Aaron
 
> The topic gets more and more interesting. I often need to match
> 'fuzzily' the names from two databases that have very minor
> differences. here are some examples:
> 
> Ford Co.
> Ford Corporation
> Ford Inc. (just an example)
> 
> or
> 
> XYZ Tech
> XYZ Technology Inc.
> 
> Can you recommend some programs to generate a list of 'fuzzy' or
> 'near' matches for a name (one or more than one alphanumeric
> characters)? Even if a program provides the three possible matches for
> the name 'Ford', that's still better than hand-checking.

*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index