[R] tapply bug? - levels of a factor in a data frame after tapply are intermixed
Dimitri Liakhovitski
ld7631 at gmail.com
Fri Feb 13 18:09:33 CET 2009
Hello! I have encountered a really weird problem. Maybe you've
encountered it before?
I have a large data frame "importances". It has one factor ($A) with 3
levels: 3, 9, and 15. $B is a regular numeric variable.
Below I am picking a really small sub-frame (just 3 rows) based on
"indices". "indices" were chosen so that all 3 levels of A are
present:
indices=c(14329,14209,14353)
test=data.frame(yy=importances[["B']][indices],xx=importances[["A"]][indices])
Here is what the new data frame "test" looks like:
yy xx
1 -0.009984006 9
2 -2.339904131 3
3 -0.008427385 15
Here is the structure of "test":
>str(test)
'data.frame': 3 obs. of 2 variables:
$ yy: num -0.00998 -2.3399 -0.00843
$ xx: Factor w/ 3 levels "3","9","15": 2 1 3
Notice - the order of factor levels for xx is not 1 2 3 as it should
be but 2 1 3. How come?
Or also look at this:
>test$xx
[1] 9 3 15
Levels: 3 9 15
Same thing.
Do you know what might be the reason?
Thank you very much!
--
Dimitri Liakhovitski
MarketTools, Inc.
Dimitri.Liakhovitski at markettools.com
More information about the R-help
mailing list