Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Extract a letter between numbers


From   "Dimitriy V. Masterov" <[email protected]>
To   [email protected]
Subject   Re: st: Extract a letter between numbers
Date   Mon, 22 Nov 2010 15:41:53 -0500

I think you may want to fuzzy merge your dirty address data and your
clean data using nearmrg, which you can get from scc.

An alternative way would to append your two data sets and then use
strgroup on the variable that is the stacked version of your clean and
dirty addresses. That will give you the closest match.

Neither one will be perfect and may take a long time/fail if you have
too much data. The latter approach has some operating system
restrictions as well.

DVM
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index