I am using the command "cluster kmeans" in order to cluster a sample of
objects on the basis of predefined attributes.
The thing is that everytime I run the command, the identity of the clusters
changes, although the allocation of the objects is always the same. In
other words, if there are 2 clusters only, then the first cluster takes one
time the value of 1 and another time the value of 2. And vice versa for
cluster 2.
The problem is that in my case I want to simulate some attributes and check
how well does the clustering command do (on average) in allocating the
objects into the correct clusters.
Notice that all the objects are ordered, in the sense that the first N/2
objects (that belong to the first cluster) are entered first.
Is there any way I can tell the program to make the identity of the first
cluster always equal to 1 and the identity of the second cluster equal to
2? Then in each replication I can easily find out whether the
cross-sections have been allocated in the correct cluster.