[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: cluster analysis with missing data

From	"Data Analytics Corp." <[email protected]>
To	Stata Listserve <[email protected]>
Subject	st: cluster analysis with missing data
Date	Wed, 29 Jul 2009 11:08:34 -0400

Hi Stata,

A client has a dataset from a survey in which consumers were shown arandomly selected set of 25 needs statements from a total of 152statements. Each consumer saw only 25. The client want to cluster the152 needs statements (i.e., 152 variables). Since the 25 were selectedat random, this should be a Missing Completely at Random problem. Butwith each consumer responding to only 25, each record will have 127missing values. I assume that Stata's clustering routines will dolist-wise deletion so there should be no data available for clustering.Does anyone have any ideas how to handle this? Any suggestions? Can asimilarity matrix still be created (how?) with so many missing data points?


Thanks,

Walt



--
________________________

Walter R. Paczkowski, Ph.D.
Data Analytics Corp.
44 Hamilton Lane
Plainsboro, NJ 08536
________________________
(V) 609-936-8999
(F) 609-936-3733
[email protected]
www.dataanalyticscorp.com

*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/statalist/faq
*   http://www.ats.ucla.edu/stat/stata/

Prev by Date: Re: st: Nested logit with several degenerated nests
Next by Date: Re: st: Pulling out observations based on a condition
Previous by thread: st: How to Reconcile R2 with Economic Significance
Next by thread: st: calculate the average by subgroup for a panel dataset
Index(es):
- Date
- Thread