[R] non-uniqueness in cluster analysis

Bruno Giordano bruno at speech.kth.se
Wed Dec 3 15:32:40 CET 2003


Hi,
I'm clustering objects defined by categorical variables with a hierarchical
algorithm - average linkage.
My distance matrix (general dissimilarity coefficient) includes several
distances with exactly the same values.
As I see, a standard agglomerative procedure ignores this problems, simply
selecting, above equal distances, the one that comes first.
For this reason the analysis in output depends strongly on the orderings of
the objects within the raw data matrix.
Is there a standard procedure to deal with this?
Thanks
    Bruno




More information about the R-help mailing list