[R] randomForest speed improvements

apresley anthony at resolution.com
Wed Jan 5 00:30:23 CET 2011


Thanks for the reply.  I had no idea I could combine them back ... that
actually will work pretty well.  We can have several "worker threads" load
up the RF's on different machines and/or cores, and then re-assemble them. 
RMPI might be an option down the road, but would be a bit of overhead for us

Using the method of combine() ... I was able to drastically reduce the
amount of time to build randomForest objects.  IE, using about 25,000 rows
(6 columns), it takes maybe 5 minutes on my laptop.  Using 5 randomForest
objects (each with 5k rows), and then combining them, takes < 1 minute.

View this message in context: http://r.789695.n4.nabble.com/randomForest-speed-improvements-tp3172523p3174621.html
Sent from the R help mailing list archive at Nabble.com.

More information about the R-help mailing list