[R] Subsets in mclust

Ron Wehrens rwehrens at sci.kun.nl
Thu Feb 14 13:16:18 CET 2002


> Dear group,
> 
> I want to use the mclust package on large data, and therefore I want to use
> a subset in the initial clustering phase. From help(mclust):
> 
>        k: If `k' is specified, the hierarchical clustering phase will
>           use a sample of size `k' of the data in the initial
>           hierarchical clustering phase. The default is to use the
>           entire data set. 
> 
> m2 is a matrix (not very large, but the same happens always when specifying
> k):
> 
> > dim(m2)
> [1] 242   2
> > em321 <- emclust(m2,k=50)
> Error in mhtree.VVV(data[smpl, ]) : data must be a matrix with  at least 2 columns
> > em321 <- emclust(m2)
> (No problems)
> > em321 <- emclust(m2,modelid="VEV",k=50)
> Error in mhtree.VVV(data[smpl, ]) : data must be a matrix with  at least 2 columns
> > library(help=mclust)
> mclust          Model-based cluster analysis
> 
> Description:
> 
> Package: 	mclust
> Version: 	1.1-5
> (...)
> Built: R 1.3.1; i686-pc-linux-gnu; Thu Dec 13 04:02:36 CET 2001
> 
> What is going on here?
> (Mail is sent to the maintainer, too.)

It's a bug which I introduced... I'll fix it right away. A workaround is to
give a vector of 50 (random) indices that will serve as the objects in the
subset: emclust(m2,modelid="VEV", k=sample(dim(m2)[1], 50))
The bug is that is.vector(k) is true for a vector of length 1 as well...

Ron
> 
> Best,
> Christian
> I
> 
> -- 
> ***********************************************************************
> Christian Hennig
> Seminar fuer Statistik, ETH-Zentrum (LEO), CH-8092 Zuerich (current)
> and Fachbereich Mathematik-SPST/ZMS, Universitaet Hamburg
> hennig at stat.math.ethz.ch, http://stat.ethz.ch/~hennig/
> hennig at math.uni-hamburg.de, http://www.math.uni-hamburg.de/home/hennig/
> #######################################################################
> ich empfehle www.boag.de
> 
> 
> -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
> r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
> Send "info", "help", or "[un]subscribe"
> (in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
> _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
> 


---------------------------------+------------------------------------
Ron Wehrens                      |  Dept. of Chemometrics
Email: rwehrens at sci.kun.nl       |  University of Nijmegen
http://www.sci.kun.nl/cac/       |  Toernooiveld 1
Tel: +31 24 365 2053             |  6525 ED Nijmegen       
Fax: +31 24 365 2653             |  The Netherlands        
---------------------------------+------------------------------------
     
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._



More information about the R-help mailing list