[R] randomForest speed improvements
anthony at resolution.com
Wed Jan 5 00:30:23 CET 2011
Thanks for the reply. I had no idea I could combine them back ... that
actually will work pretty well. We can have several "worker threads" load
up the RF's on different machines and/or cores, and then re-assemble them.
RMPI might be an option down the road, but would be a bit of overhead for us
Using the method of combine() ... I was able to drastically reduce the
amount of time to build randomForest objects. IE, using about 25,000 rows
(6 columns), it takes maybe 5 minutes on my laptop. Using 5 randomForest
objects (each with 5k rows), and then combining them, takes < 1 minute.
View this message in context: http://r.789695.n4.nabble.com/randomForest-speed-improvements-tp3172523p3174621.html
Sent from the R help mailing list archive at Nabble.com.
More information about the R-help