Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: st: Fuzzy collapse


From   Charles Vellutini <charles.vellutini@ecopa.com>
To   "statalist@hsphsun2.harvard.edu" <statalist@hsphsun2.harvard.edu>
Subject   RE: st: Fuzzy collapse
Date   Thu, 26 Jan 2012 08:58:11 -0800

Thank you Dimitriy

I am strangely unable to download -strgroup- from SSC (says the file is not there). We have Stata/MP 12.0.

Google Refine seems an excellent suggestion since our data are precisely keywords that were processed by Google's Adwords.

Charles

-----Message d'origine-----
De : owner-statalist@hsphsun2.harvard.edu [mailto:owner-statalist@hsphsun2.harvard.edu] De la part de Dimitriy V. Masterov
Envoyé : jeudi 26 janvier 2012 17:25
À : statalist@hsphsun2.harvard.edu
Objet : Re: st: Fuzzy collapse

Charles,

Here's two things that can help.

1) Depending on your OS and what thousands means, strgroup and levenshtein from SCC might get you most of the way there.

2) A non-Stata, but free, solution is to try Google Refine.

DVM
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/



*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index