[R] Improve code efficient with do.call, rbind and split contruction
Jun Shen
jun.shen.ut at gmail.com
Fri Sep 2 19:02:49 CEST 2016
Dear list,
I have the following line of code to extract the last line of the split
data and put them back together.
do.call(rbind,lapply(split(simout.s1,simout.s1[c('SID','DOSENO')]),function(x)x[nrow(x),]))
the problem is when have a huge dataset, it takes too long to run.
(actually it's > 3 hours and it's still running).
The dataset is pretty big. I have 200,000 unique SID and 4 DOSENO, so
totally 800,000 split dataset. Is there anyway to speed it up? Thanks.
Jun
[[alternative HTML version deleted]]
More information about the R-help
mailing list