Bookmark and Share

Notice: On March 31, it was announced that Statalist is moving from an email list to a forum. The old list will shut down at the end of May, and its replacement, statalist.org is already up and running.


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

st: Observations in Sequence analysis


From   Sarah Park <sarahjwpark@gmail.com>
To   statalist@hsphsun2.harvard.edu
Subject   st: Observations in Sequence analysis
Date   Wed, 3 Oct 2012 02:14:41 -0700

I am studying sequence of portfolios of firms, but some firms die and
others are established after my first observation (so some firms have
observation 1-100, but others from 20-70, for example). But, I have
the order variable for those firms starting at 1 to avoid the problem
of gaps (I didn't think treating non-existing observations as missing
data made any sense). However, when I run the cluster analysis based
on the dissimilarity matrix afterwards, I have Stata categorizing some
of the sequence with same elements (ex. 333, 333333, 333333333) into
the same cluster, some into different clusters when they are clearly
the same sequence. I tried with single, wards, average, and other
measures but the result is the same.

First, is this the right way of writing the time variable?
Does anyone have any idea how to fix this problem?
*
*   For searches and help try:
*   http://www.stata.com/help.cgi?search
*   http://www.stata.com/support/faqs/resources/statalist-faq/
*   http://www.ats.ucla.edu/stat/stata/


© Copyright 1996–2014 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index