Stata: Data Analysis and Statistical Software
   >> Home >> Products >> Capabilities >> Cluster analysis
order stataorder stata

Cluster analysis

Hierarchical clustering

  • Single linkage
  • Complete linkage
  • Average linkage
  • Ward’s linkage (including Ward’s method)
  • Weighted-average linkage
  • Centroid linkage
  • Median linkage

Nonhierarchical

  • Kmeans
  • Kmedians

Cluster on observations

Cluster using any proximity matrix

Dendrograms

  • Full trees
  • Subtrees
  • Upper portion of tree
  • Vertical or horizontal orientation
  • Branch counts

Stopping rules

  • Calínski and Harabasz pseudo-F index
  • Duda and Hart Je(2)/Je(1) index

Support tools

  • Generate summary and grouping variables
  • Attach notes to analyses

Similarity/dissimilarity measures for continuous data

  • L2/Euclidean
  • L1/absolute/cityblock/manhattan
  • L(#)
  • Canberra
  • Correlation
  • Angular

Similarity/dissimilarity measures for binary data

  • Matching
  • Jaccard
  • Russell
  • Hamann
  • Dice
  • Antidice
  • Sneath
  • Rogers
  • Ochiai
  • Yule
  • Anderberg
  • Kulczynski
  • Gower2
  • Pearson

Gower measure for mixed binary and continuous data

Result-management utilities

  • dir
  • list
  • drop
  • use
  • rename

User-extensible commands

  • Ability to add new clustering methods and utilities
  • Full set of tools to ease making additions
Bookmark and Share 
Stata 12
Overview: Why use Stata?
Stata/MP
Capabilities
Overview
Sample session
User-written commands
New in Stata 12
Supported platforms
Which Stata?
Technical support
User comments
Like us on Facebook Follow us on Twitter Follow us on LinkedIn Google+ Watch us on YouTube
Follow us
© Copyright 1996–2013 StataCorp LP   |   Terms of use   |   Privacy   |   Contact us   |   Site index   |   View mobile site