[R] Hierarchical Cluster Analysis with large dataset
Sarah Goslee
sarah.goslee at gmail.com
Sun Nov 3 23:01:30 CET 2013
Hi,
I think your dataset is too large to be interpretable, but in general
you should check out the cluster package, specifically clara(), which
is intended for use with large data.
Sarah
On Sun, Nov 3, 2013 at 4:42 AM, Petar Milin
<petar.milin at uni-tuebingen.de> wrote:
> Hello!
> Can anyone give me advice on running Hierarchical Cluster Analysis on large
> datasets? For example, 80000x10000. Calculating distances on such a
> dataframe seems impossible even on very powerful computer.
>
> Also, any other advice that would lead to reduction of dimensionality,
> i.e., cluster/group variables would be more than welcomed.
>
> Many thanks,
> PM
>
--
Sarah Goslee
http://www.functionaldiversity.org
More information about the R-help
mailing list