[BioC] Why daisy() in cluster library failed to exclude NA when computing dissimilarity
Gundala Viswanath
gundalav at gmail.com
Sun Dec 8 08:11:12 CET 2013
Hi,
According to daisy function from cluster documentation, it can compute
dissimilarity when NA (missing) value(s) is present.
http://stat.ethz.ch/R-manual/R-devel/library/cluster/html/daisy.html
But why when I tried this code
library(cluster)
x <- c(1.115,NA,NA,0.971,NA)
y <- c(NA,1.006,NA,NA,0.645)
df <- as.data.frame(rbind(x,y))
daisy(df,metric="gower")
It gave this message:
Dissimilarities :
x
y NA
Metric : mixed ; Types = I, I, I, I, I
Number of objects : 2
Warning messages:
1: In min(x) : no non-missing arguments to min; returning Inf
2: In max(x) : no non-missing arguments to max; returning -Inf
I welcome other alternative than gower.
I expect the dissimilarity output gives a non-NA value e.g. 0. What's
the right way to do it?
G.V.
More information about the Bioconductor
mailing list