Stata The Stata listserver
[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

st: RE: Matching strings


From   "Nick Cox" <n.j.cox@durham.ac.uk>
To   <statalist@hsphsun2.harvard.edu>
Subject   st: RE: Matching strings
Date   Sun, 24 Nov 2002 15:30:15 -0000

Pedro Martins

> I would like to match a large list of names available in 
> two data sets.
> One problem is that the names are not always exactly the 
> same because of
> typos. I can also use other characteristics of each 
> observation and do a
>    -joinby- but this still provides a large number of 
> possible matches
> for each observation. Is there any procedure in Stata for 
> one to match
> string variables that may not be exactly equal?
> 

There is a -match()- function. Also, 
Michael Blasnik's -egen, soundex()- 
function may help. 

On the whole, however, no such 
function will be as smart as you 
in deciding what is really the 
same and what is different. 

Nick 
n.j.cox@durham.ac.uk 
*
*   For searches and help try:
*   http://www.stata.com/support/faqs/res/findit.html
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index