[R] rowSums()
Doran, Harold
HDoran at air.org
Wed Sep 24 16:06:09 CEST 2008
Say I have the following data:
testDat <- data.frame(A = c(1,NA,3), B = c(NA, NA, 3))
> testDat
A B
1 1 NA
2 NA NA
3 3 3
rowsums() with na.rm=TRUE generates the following, which is not desired:
> rowSums(testDat[, c('A', 'B')], na.rm=T)
[1] 1 0 6
rowsums() with na.rm=F generates the following, which is also not
desired:
> rowSums(testDat[, c('A', 'B')], na.rm=F)
[1] NA NA 6
I see why this occurs, but what I hope to have returned would be:
[1] 1 NA 6
To get what I want I could do the following, but normally my ideas are
bad ideas and there are codified and proper ways to do things.
rr <- numeric(nrow(testDat))
for(i in 1:nrow(testDat)) rr[i] <- if(all(is.na(testDat[i,]))) NA else
sum(testDat[i,], na.rm=T)
> rr
[1] 1 NA 6
Is there a "proper" way to do this? In my real data, nrow is over
100,000
Thanks,
Harold
> sessionInfo()
R version 2.7.2 (2008-08-25)
i386-pc-mingw32
locale:
LC_COLLATE=English_United States.1252;LC_CTYPE=English_United
States.1252;LC_MONETARY=English_United
States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] MiscPsycho_1.2 lattice_0.17-13 statmod_1.3.6
loaded via a namespace (and not attached):
[1] grid_2.7.2
More information about the R-help
mailing list