[R] K-means result - variance between cluster
Christian Hennig
chrish at stats.ucl.ac.uk
Fri Jul 2 11:56:51 CEST 2010
Dear Ralph,
between and within clusters sum of squares (if you want variances, you
need to divide them by the appropriate constant!) add up to the
overall sum of squares, so you can get the beween clusters ss by
computing the overall ss (one possibility to get this is to run kmeans
with k=1) and subtracting the within cluster ss from it.
Note, however, that the F-value cannot be interpreted in the usual way and
is particulary not F-distributed when computed on clusters from k-means,
because for F-distribution you'd need to assume that groups are determined
independently of the data.
Hope this helps,
Christian
On Fri, 2 Jul 2010, Ralph Modjesch wrote:
> Hi,
>
> I like to present the results from the clustering method k-means in
> terms of variances: within and between Cluster. The k-means object
> gives only the within cluster sum of squares by cluster, so the between
> variance part is missing,for calculation the following table, which I
> try to get.
>
> Number of | Variance within | Var between | Var total | F-value
> Cluster k | cluster | cluster | |
> ===============================================================
> 2 .......| 25,00 ..........| 75,00 ......| 100 ......| 1,5
> 3 .......| 45,00 ..........| 55,00 ......| 100 ......| 1,7
>
> Is there any package/ function which will do that?
>
>
> --
> Mit freundlichen Grüßen
>
> Ralph Modjesch
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
*** --- ***
Christian Hennig
University College London, Department of Statistical Science
Gower St., London WC1E 6BT, phone +44 207 679 1698
chrish at stats.ucl.ac.uk, www.homepages.ucl.ac.uk/~ucakche
More information about the R-help
mailing list