[R] Hierarchical Cluster Analysis with large dataset
Bert Gunter
gunter.berton at gene.com
Sun Nov 3 16:34:25 CET 2013
(Offlist, since this is just a personal comment).
I cannot help you -- but it sounds like the sort of thing that you
should look for on the BioconductoR list.
But I wonder how you could possibly interpret the results even if you
could get them. I would think they would be more noise than signal,
and making sense of such a mess would be hopeless. Maybe you need to
rethink your approach.
No need to respond to me, of course.
Cheers,
Bert
On Sun, Nov 3, 2013 at 1:42 AM, Petar Milin
<petar.milin at uni-tuebingen.de> wrote:
> Hello!
> Can anyone give me advice on running Hierarchical Cluster Analysis on large
> datasets? For example, 80000x10000. Calculating distances on such a
> dataframe seems impossible even on very powerful computer.
>
> Also, any other advice that would lead to reduction of dimensionality,
> i.e., cluster/group variables would be more than welcomed.
>
> Many thanks,
> PM
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
--
Bert Gunter
Genentech Nonclinical Biostatistics
(650) 467-7374
More information about the R-help
mailing list