Statalist


[Date Prev][Date Next][Thread Prev][Thread Next][Date index][Thread index]

Re: st: RE: Matching Names


From   "Michael Blasnik" <michael.blasnik@gmail.com>
To   statalist@hsphsun2.harvard.edu
Subject   Re: st: RE: Matching Names
Date   Fri, 8 Aug 2008 11:33:02 -0400

I think you may want to check out the reclink.ado that I wrote and you
can find on SSC.  It uses a bigram string comparator to rank agreement
between strings.  reclink would be especially helpful if you have
other variables that may be useful for the match -- like gender, age
or location.   Even without such variables, you may benefit from
creating derived variables that can be added to the reclink matching
process -- including a soundex of each name (first and last) and
initials that could be used for blocking in an initial run of reclnk
to identify the better matches more quickly.

Michael Blasnik
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   What's new   |   Site index