[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: st: clustering with a new dataset

From	Frank Gallo <[email protected]>
To	[email protected]
Subject	Re: st: clustering with a new dataset
Date	Mon, 04 May 2009 04:16:42 -0400

<>
<>

Hi Walt,

I am a Stata beginner so I have little to offer you regardingprocedures available in Stata: maybe other listers can. However,conceptually speaking, it sounds as though she wants to cross-validateher model. A well-fitting cluster (or factor) model is tentative. Itrequires post-hoc model validation. The researcher may use a randomsample from a validation holdout sample for cross-validation (i.e.,within sample replication, which requires a large sample size). Thougheven if a model fits the data well, it does not mean that it is thecorrect model or even the best model to explain the phenomenon ofinterest. There may be equivalent models that fit the sample data orother data sources equally well. If the researcher uncovers equivalentmodels, there is no statistical technique for discriminating amongthem. Only on substantive knowledge about the phenomenon can theresearcher decide which equivalent model is best. The researcher mayjudge a model "good" on both theoretical and statistical grounds, andthus, provisionally accept the model. Cross-validation procedures ondifferent independent samples (seems like your case) from the samepopulation can enhance the utility of the model. You may compare themodels by examining the overall fit indices (e.g., chi-square, RMSEA)and the significance of path coefficients to offer the client someinsight. I hope this helps.


Best,
Frank



On May 3, 2009, at 9:08 PM, Data Analytics Corp. wrote:

Hi,

I ran a cluster analysis last year for a client using "cluster wardvarlist" where the variables in varlist came from a survey. Thisworked fine and the client was happy. This year, she returned with anew dataset (same variables, just new values from a new survey) andwants last year's clusters applied to this year's data. I can't seehow to do this - in fact it doesn't seem to make sense. Anysuggestions, or should I tell her that I can just rerun the oldcommands and MAYBE the same clusters will appear?


Thanks,

Walt



--
________________________

Walter R. Paczkowski, Ph.D.
Data Analytics Corp.
44 Hamilton Lane
Plainsboro, NJ 08536
________________________
(V) 609-936-8999
(F) 609-936-3733
[email protected]
www.dataanalyticscorp.com

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

References:
- st: clustering with a new dataset
  - From: "Data Analytics Corp." <[email protected]>

Prev by Date: AW: st: graph hbox Y, by(something noiyaxes) over(grp2) over(grp1)
Next by Date: Re: st: Re: Calculate beta values for mim results
Previous by thread: st: clustering with a new dataset
Index(es):
- Date
- Thread