Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down on April 23, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Stata equivalent of SPEDIS function in SAS


From   Nick Cox <njcoxstata@gmail.com>
To   statalist@hsphsun2.harvard.edu
Subject   Re: st: Stata equivalent of SPEDIS function in SAS
Date   Wed, 20 Jul 2011 09:00:59 -0500

You might get more answers by defining "spelling distance" precisely,
Many members of this list, probably most, don't use SAS or are not
familiar with its detailed commands and functions. But look at
-strgroup- from SSC:

. ssc desc strgroup

------------------------------------------------------------------------------------------------------------------------------------
package strgroup from http://fmwww.bc.edu/repec/bocode/s
------------------------------------------------------------------------------------------------------------------------------------

TITLE
      'STRGROUP': module to match strings based on their Levenshtein
edit distance

DESCRIPTION/AUTHOR(S)

      strgroup matches similar strings together. This can be useful
      when merging data that contain typos. For example, "widgets" will
      not merge with "widgetts" because the strings are not identical.
      strgroup provides a way to match strings in an objective and
      automated manner.

      KW: data management
      KW: string match
      KW: string merge
      KW: string group
      KW: levenshtein

      Requires: Stata version 9.2

      Distribution-Date: 20100814

      Author: Julian Reif , University of Chicago
      Support: email jreif@uchicago.edu


INSTALLATION FILES                             (type net install strgroup)
      strgroup.ado
      strgroup.hlp
      ../l/levenshtein.ado
      ../l/levenshtein.hlp
      strgroup.plugin



On Wed, Jul 20, 2011 at 8:46 AM, Zou, Joe Hong <hongzou@cityu.edu.hk> wrote
>
> Does anybody know whether there is a Stata routine that calculates the
> spelling distance between two names as what SPEDIS function in SAS does?
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index