[R] non-uniqueness in cluster analysis
Bruno Giordano
bruno at speech.kth.se
Wed Dec 3 15:32:40 CET 2003
Hi,
I'm clustering objects defined by categorical variables with a hierarchical
algorithm - average linkage.
My distance matrix (general dissimilarity coefficient) includes several
distances with exactly the same values.
As I see, a standard agglomerative procedure ignores this problems, simply
selecting, above equal distances, the one that comes first.
For this reason the analysis in output depends strongly on the orderings of
the objects within the raw data matrix.
Is there a standard procedure to deal with this?
Thanks
Bruno
More information about the R-help
mailing list