Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.
From | Scott Merryman <scott.merryman@gmail.com> |
To | statalist@hsphsun2.harvard.edu |
Subject | Re: st: Spelling errors in string variables |
Date | Wed, 9 Jan 2013 06:23:48 -0600 |
You might try -soundex- clear input str8 name "Mexico" "Mejico" "mexico" end gen soundex = soundex(name) l Scott On Wed, Jan 9, 2013 at 6:02 AM, Estrella Gomez <estrellastata@gmail.com> wrote: > Does anybody know how can I check the existence of spelling error in my > dataset? I have two string variables with nearly 1,500,000 observations, > and I would like to check if in some cases there are different names for > the same individual (i.e.: Mexico / Mejico / mexico) * * For searches and help try: * http://www.stata.com/help.cgi?search * http://www.stata.com/support/faqs/resources/statalist-faq/ * http://www.ats.ucla.edu/stat/stata/