[R] Different results in different runs with identical parametersin CLARA
TEMPL Matthias
Matthias.Templ at statistik.gv.at
Fri Jun 10 13:05:02 CEST 2005
> Dear All R Friends,
> When I run my data in any time with the below codes, I
> receive different results.
Of course. See in
L. Kaufman and P. Rousseeuw. Finding Groups in Data. John
Wiley & Sons, Inc, 1990.
There is a "random part" in clara.
> My data , k , samples, trace are
> identical in any run.
>
> c<- clara(mydata,4, metric= " euclidean " , stand= TRUE,
> samples=5 , trace=3, keep.data=TRUE , rngR=TRUE)
>
> result of first try:
> Average silhouette width per cluster: 0.5881658
> result of second try:
> Average silhouette width of best sample: 0.6294549
> result of third try:
> Average silhouette width of best sample: 0.6609939
> ...
> I think that only best sample changes in any run.
> The question is here:
> Which try ( or run) is optimal? How many try do I need to
> achive to optimal case? Is it reliable ? Best Regards, Amir
>
See it as *Explorative Data Analysis*. Each of your different
results give you additional ideas of the structure of your data.
Best,
Matthias
More information about the R-help
mailing list