Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
From | Nick Cox <njcoxstata@gmail.com> |
To | statalist@hsphsun2.harvard.edu |
Subject | Re: st: Stata equivalent of SPEDIS function in SAS |
Date | Wed, 20 Jul 2011 09:00:59 -0500 |
You might get more answers by defining "spelling distance" precisely, Many members of this list, probably most, don't use SAS or are not familiar with its detailed commands and functions. But look at -strgroup- from SSC: . ssc desc strgroup ------------------------------------------------------------------------------------------------------------------------------------ package strgroup from http://fmwww.bc.edu/repec/bocode/s ------------------------------------------------------------------------------------------------------------------------------------ TITLE 'STRGROUP': module to match strings based on their Levenshtein edit distance DESCRIPTION/AUTHOR(S) strgroup matches similar strings together. This can be useful when merging data that contain typos. For example, "widgets" will not merge with "widgetts" because the strings are not identical. strgroup provides a way to match strings in an objective and automated manner. KW: data management KW: string match KW: string merge KW: string group KW: levenshtein Requires: Stata version 9.2 Distribution-Date: 20100814 Author: Julian Reif , University of Chicago Support: email jreif@uchicago.edu INSTALLATION FILES (type net install strgroup) strgroup.ado strgroup.hlp ../l/levenshtein.ado ../l/levenshtein.hlp strgroup.plugin On Wed, Jul 20, 2011 at 8:46 AM, Zou, Joe Hong <hongzou@cityu.edu.hk> wrote > > Does anybody know whether there is a Stata routine that calculates the > spelling distance between two names as what SPEDIS function in SAS does? * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/statalist/faq * http://www.ats.ucla.edu/stat/stata/