[BioC] hclust and (Eisen+ de Hoon) cluster3 program

Benjamin Haibe-Kains bhaibeka at ulb.ac.be
Tue Dec 7 08:23:28 CET 2004


Thanks for the link to the source code, I will look it into details.

For the distance, I use the same one in hclust and eisen, i.e. the 
pearson correlation (also called uncentered correlation). Moreover, I 
use the 'median center' normalization.

benjamin

Charles C. Berry wrote:

> On Mon, 6 Dec 2004, Benjamin Haibe-Kains wrote:
>
>> Hi Michael,
>>
>> I think that the differences are  too important to be due to 
>> different implementation decisions. Actually my problem is that I 
>> have a group of 1 object and the rest in the other group when I use 
>> the 'centroid' hclust (I use cutree to have the main two groups)  and 
>> it's not the case with other softwares. It looks like a bug in the 
>> Fortran routine but I can not access to it.
>
>
> Source code is available from CRAN. In
>
>     http://cran.r-project.org/src/base/R-2/R-2.0.1.tar.gz
>
> the directory
>
>     R-2.0.0/src/library/stats/src
>
> contains hclust.f and hclust-utils.c.
>
> Happy debugging.
>
> ;-)
>
>
>>
>> Have you reported this "bug" before ? Can I write my 'centroid' 
>> method easily ?
>>
>> cheers,
>>
>> benjamin
>>
>> michael watson (IAH-C) wrote:
>>
>>> Benjamin
>>>
>>> You will likely get different results from all clustering software, 
>>> even
>>> when using the same parameters.  This is because many arbitrary
>>> decisions have to be made during a hierarchical cluster analysis and
>>> different programmers will make those decisions in different ways.
>>>
>>> Mick
>>>
>>> -----Original Message-----
>>> From: bioconductor-bounces at stat.math.ethz.ch
>>> [mailto:bioconductor-bounces at stat.math.ethz.ch] On Behalf Of Benjamin
>>> Haibe-Kains
>>> Sent: 06 December 2004 11:05
>>> To: Bioconductor Mailing List
>>> Subject: [BioC] hclust and (Eisen+ de Hoon) cluster3 program
>>>
>>>
>>> Hi all,
>>>
>>> I have a problem with the R function 'hclust'. I have noticed 
>>> differences in clustering when I use the 'centroid' cluster method 
>>> with 'hclust' and the cluster3 program (see M. Eisen and M. de Hoon).
>>>
>>> Have you noticed some differences too ?
>>>
>>> I use
>>>
>>> hclust from library 'stats' (Built: R 2.0.1; i386-pc-linux-gnu; 
>>> 2004-11-15 15:56:06; unix)
>>> cluster 3.0 using C Clustering Library version 1.25
>>>
>>> Thanks a lot
>>>
>>>
>>>
>>
>> -- 
>> Benjamin Haibe-Kains
>> [http://www.ulb.ac.be/di/map/bhaibeka/]
>>
>>
>> PhD student in the Machine Learning Group (MLG)
>>     [http://www.ulb.ac.be/di/mlg/]
>> Universite Libre de Bruxelles (ULB)
>> E-mail: bhaibeka at ulb.ac.be MicroArray Unity     
>> [http://www.bordet.be/servmed/array/index.htm]
>> Institut Jules Bordet (IJB)
>> E-mail: benjamin.haibekains at bordet.be
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at stat.math.ethz.ch
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>>
>>
>
> Charles C. Berry                        (858) 534-2098
>                                          Dept of Family/Preventive 
> Medicine
> E mailto:cberry at tajo.ucsd.edu             UC San Diego
> http://hacuna.ucsd.edu/members/ccb.html  La Jolla, San Diego 92093-0717
>
>
>

-- 
Benjamin Haibe-Kains
	[http://www.ulb.ac.be/di/map/bhaibeka/]


PhD student in the Machine Learning Group (MLG)
	[http://www.ulb.ac.be/di/mlg/]
Universite Libre de Bruxelles (ULB)
E-mail: bhaibeka at ulb.ac.be 

MicroArray Unity 
	[http://www.bordet.be/servmed/array/index.htm]
Institut Jules Bordet (IJB)
E-mail: benjamin.haibekains at bordet.be



More information about the Bioconductor mailing list