AW: [R] non-hierarchical non-exclusive clustering of large data sets
TEMPL Matthias
Matthias.Templ at statistik.gv.at
Mon May 24 16:39:33 CEST 2004
I think the "cmeans" method in library(e1071) works better for large data sets as "fanny".
(note,not for this data: fanny has also problems with standardized large data sets - here produce fanny the same memberships for all observations; cmeans works "correctly")
Matthias
> -----Ursprüngliche Nachricht-----
> Von: Bhaskar S. Manda [mailto:bhaskar at cacmnet.com]
> Gesendet: Montag, 24. Mai 2004 17:12
> An: r-help at stat.math.ethz.ch
> Betreff: Re: [R] non-hierarchical non-exclusive clustering of
> large data sets
>
>
> On Mon, 24 May 2004 15:58:57 +0200, Murk Wuite wrote:
> > I'm trying to use R to cluster words with related meanings. Does
> > anyone know of a non-hierarchical clustering method in R
> that produces
> > non-exclusive clusters? With non-exclusive, I mean that words should
>
> The "fanny" method in library(cluster) outputs probabilities
> of membership in each cluster.
>
> > the one above, its dimensions would be in the order of (100000,
> > 100000). Does anyone know if this would cause practical problems,
> > perhaps very slow clustering?
>
> I had a much smaller matrix, 4000x3, fanny took about 4
> minutes wall clock time on a lightly loaded (there were many
> other processes, but none
> computational) 1.4 GHz Athlon, It was completely CPU-bound.
>
> --
> bhaskar
>
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://www.stat.math.ethz.ch/mailman/listinfo> /r-help
> PLEASE
> do read the posting guide!
> http://www.R-project.org/posting-guide.html
>
More information about the R-help
mailing list