[R] Random Forest - Strata

Coll gbcoll2 at gmail.com
Tue Jul 20 17:48:04 CEST 2010


Hi all,

Had struggled in getting "Strata" in randomForest to work on this. 

Can I get randomForest for each of its TREE, to get ALL sample from some
strata to build tree, while leaving some strata TOTALLY untouched as oob?

e.g. in below, how I can tell RF to, 
- for tree 1 in the forest, to use only Site A and B to build the tree,
while using the WHOLE Site C data for the oob error rate,
- for tree 2, use only site A and C to build tree, while using whole site B
data for oob
- for tree 3, use Site B and C, A as oob...?

My command does not work as it would use some sample in all of the sites:
rforest.obj <- randomForest(Presence.f ~., data=dataset.subset, strata =
site.factor)

while 
the setting the corresponding "sampsize" argument seems would only screen
out the Site in all tree building...

Site	Presence	  Length	  Sulphur
A	        Yes	       3.50	        19.42
A	        No	        3.90	        51.09
A	        No	        3.60	        26.75
B	        Yes	       2.60	        9.71
B	        No	        2.20	        9.77
B	        No	        2.60	        8.60
B	        No	        3.00	        35.59
C	        Yes	       3.50	        16.07
C	        No	        3.40	        49.96
C	        No	        3.10	        35.35

Any idea / comments are welcomed.

Thanks in advance.

Coll
-- 
View this message in context: http://r.789695.n4.nabble.com/Random-Forest-Strata-tp2295731p2295731.html
Sent from the R help mailing list archive at Nabble.com.



More information about the R-help mailing list