[R] Newbie question on subsets

jim holtman jholtman at gmail.com
Sun Jan 20 04:52:54 CET 2008

Part of the problem is that 'country' is probably a factor and you
will get all the levels that were in the original factor in the new
subset.  Try the following that will remove the extra levels in the

my.subset <- subset(Forbes2000,
                            country %in% c("United

# remove the extra factor levels
my.subset$country <- my.subset$country[, drop=TRUE]

  data = my.subset,

On Jan 19, 2008 10:11 PM, Joe Trubisz <jtrubisz at mac.com> wrote:
> Hi...
> I'm working through the book, A Handbook of Statistical Analyses
> using R by Everitt, and I'm trying to do the following (p. 19 of his
> book):
> boxplot(log(marketvalue)~country,
>   data = subset(Forbes2000,
>                             country %in% c("United
> Kingdom","Germany","India","Turkey")),
>   ylab="log(marketvalue",
>   varwidth=TRUE)
> This *almost* works, but I'm getting ALL the countries on the x-axis,
> not just the 4-specified.
> I tried tinkering with variations in the subset command to no avail.
> Can someone tell me what's wrong/missing with the above command?
> Thanks,
> Joe
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.

Jim Holtman
Cincinnati, OH
+1 513 646 9390

What is the problem you are trying to solve?

More information about the R-help mailing list