Bookmark and Share

Notice: On April 23, 2014, Statalist moved from an email list to a forum, based at statalist.org.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: Frequency weighted cluster analysis


From   Nick Cox <njcoxstata@gmail.com>
To   statalist@hsphsun2.harvard.edu
Subject   Re: st: Frequency weighted cluster analysis
Date   Wed, 11 Jan 2012 00:15:16 +0000

Why would a cluster analysis change because some observations are
duplicated? The similarity or dissimilarity of objects is not affected
by their frequency. What does this SAS statement do that should be
replicated by Stata?

Nick

On Tue, Jan 10, 2012 at 11:25 PM, Brendan Halpin <brendan.halpin@ul.ie> wrote:

> Is it possible to use frequency weighted data with cluster (and in
> particular clustermat)?
>
> From the manual I see that it is not intended to be possible -- no
> weight term in the syntax, for instance. However, for datasets with
> significant rates of duplicates, it could be a way of reducing the
> computational burden very significantly.
>
> SAS documentation suggests that PROC CLUSTER has a FREQ statement that
> does this.
>
> The Stata manual also suggests that programmers might implement their
> own clustering algorithms, but there are no examples of how this might
> be done.
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2018 StataCorp LLC   |   Terms of use   |   Privacy   |   Contact us   |   Site index