[R] boxplot question
Dermot MacSweeney
dsweeney at nmrc.ucc.ie
Sun Jan 7 19:51:54 CET 2001
Hi All,
As a follow-on from the mail below, it is possible to have boxplots based on
means and variance for more than one input, i.e.,
bxp(input1,input2,....)
Thanks in Advance,
Dermot
******** Previous Mail on this subject ***********
>I have a couple more questions about boxplots. In the books I've read on
>statistics (I'm not a statistics expert but just a poor engineer trying
>to help his wife with the analysis of the data she gathered for her
>medieval history PhD), I seen two kinds of box plots: one using
>quartiles to determine the height of the rectangles and whiskers which
>is what the standard R boxplot does, from what I read in the doc, and
>another one using the the variance (rectangle height = 2s, whiskers
>going from +1s to +2s and -1s to -2s). Is there a way to produce such
>boxplot with R ?
Yes, by using the bxp() function on a data structure containing the relevant
values:
#
# using a standard R dataset
#
data(ToothGrowth)
attach(ToothGrowth)
#
# standard boxplot
#
boxplot(len)
#
# get stats for the standard boxplot
#
bp.stats <- boxplot.stats(len)
#
# have a look at the structure (list) returned by boxplot.stats()
#
bp.stats
#
# we can recreate this structure but with means and SDs ...
#
bp.limits <- as.matrix(c(mean(len) - 2 * sd(len),
mean(len) - sd(len),
mean(len),
mean(len) + sd(len),
mean(len) + 2 * sd(len)))
bp.stats <- list(stats = bp.limits, n = length(len), conf = NULL,
out = numeric(0))
#
# and plot it ...
#
bxp(bp.stats)
This could be rolled up into a function ... in essence you create a structure
(list)
that bxp() can use. Watch out for NA values in the mean(), and sd().
I think that there is an error in the R documentation about the bxp() structure
...
$stats should be a matrix not a vector ... but I may have that wrong.
>Second question: I would like to ponderate the plotted variable with
>another variable. To take an example, the grouping factor A is a
>production city, X is a unitary price and Y is a quantity bought from
>city Ai at price Xi. I would like to make a boxplot of the prices
>ponderated by the quantity bought. Since it sounds like a fairly
>standard visualization, I suppose ther's an easy way to do this with R.
This looks like a problem for coplot() if X and Y are continuous and you want
scatters
of them by levels of factors. With boxplot() you need to use the 'interaction'
term in
the formula you supply to boxplot. With a standard R dataset:
data(ToothGrowth)
boxplot(len~interaction(supp, dose), data = ToothGrowth)
You could make it clearer with by adding colour:
boxplot(len~interaction(supp, dose), data = ToothGrowth, col = c(7,3))
It only works with data.frames (I think).
**************************************************************
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
More information about the R-help
mailing list