[R] abbreviate or wrap dimname labels
Marc Schwartz
MSchwartz at MedAnalytics.com
Fri Apr 15 19:29:51 CEST 2005
On Fri, 2005-04-15 at 12:12 -0400, Michael Friendly wrote:
> For a variety of displays (mosaicplots, barplots, ...)
> one often wants to either abbreviate or wrap long labels,
> particularly when these are made up of several words.
> In general, it would be nice to have a function,
>
> abbreviate.or.wrap <-
> function(x, maxlength=10, maxlines=2, split=" ") {
> }
>
> that would take a character vector or a list of vectors, x,
> and try to abbreviate or wrap them to fit approximately
> the maxlength and maxlines constraints, using the split
> argument to specify allowable characters to wrap to multiple
> lines.
>
> For example, this two-way table has dimnames too long to
> be displayed nicely in a mosaicplot:
>
> > library(catspec)
> > library(vcd)
> >
> > data(FHtab)
> > FHtab<-as.data.frame(FHtab)
> >
> > xtable <- xtabs(Freq ~ .,FHtab)
> > lab <- dimnames(xtable)
> > lab
> $OccFather
> [1] "Upper nonmanual" "Lower nonmanual" "Upper manual" "Lower manual"
> [5] "Farm"
>
> $OccSon
> [1] "Upper nonmanual" "Lower nonmanual" "Upper manual" "Lower manual"
> [5] "Farm"
>
> abbreviate works here, but gives results that aren't very readable:
>
> > lapply(lab, abbreviate, 8)
> $OccFather
> Upper nonmanual Lower nonmanual Upper manual Lower manual Farm
> "Upprnnmn" "Lwrnnmnl" "Uppermnl" "Lowermnl"
> "Farm"
>
> $OccSon
> Upper nonmanual Lower nonmanual Upper manual Lower manual
> Farm
> "Upprnnmn" "Lwrnnmnl" "Uppermnl" "Lowermnl"
> "Farm"
>
> In a related thread, Marc Schwartz proposed a solution for wrapping
> labels, based on
>
> >short.labels <- sapply(labels, function(x) paste(strwrap(x,
> 10), collapse = "\n"), USE.NAMES = FALSE)
>
> But, my attempt to use strwrap in my context gives a single string
> for each set of dimension names:
>
> > stack.lab <-function(x) { paste(strwrap(x,10), collapse = "\n") }
> > lapply(lab, stack.lab)
> $OccFather
> [1] "Upper\nnonmanual\nLower\nnonmanual\nUpper\nmanual\nLower\nmanual\nFarm"
>
> $OccSon
> [1] "Upper\nnonmanual\nLower\nnonmanual\nUpper\nmanual\nLower\nmanual\nFarm"
>
> For my particular example, I can do what I want with gsub, but it is
> hardly general:
>
> > lab[[1]] <- gsub(" ","\n", lab[[1]])
> > lab[[2]] <- lab[[1]] # cheating: I know it's a square table
> > lab
> $OccFather
> [1] "Upper\nnonmanual" "Lower\nnonmanual" "Upper\nmanual"
> "Lower\nmanual"
> [5] "Farm"
>
> $OccSon
> [1] "Upper\nnonmanual" "Lower\nnonmanual" "Upper\nmanual"
> "Lower\nmanual"
> [5] "Farm"
>
> > dimnames(xtable) <- lab
>
> Then,
> mosaicplot(xtable, shade=TRUE)
> gives a nice display!
>
> Can anyone help with a more general solution for wrapping labels
> or abbreviate.or.wrap()?
>
> thanks,
> -Michael
Michael,
This is not completely generic (I have not used abbreviate() here) and
it could take some further fine tuning and perhaps even consideration of
creating a generic method. However, a possible solution to the problem
of using my previous approach on a list object and giving some
flexibility to also handle vectors:
# Core wrapping function
wrap.it <- function(x, len)
{
sapply(x, function(y) paste(strwrap(y, len),
collapse = "\n"),
USE.NAMES = FALSE)
}
# Call this function with a list or vector
wrap.labels <- function(x, len)
{
if (is.list(x))
{
lapply(x, wrap.it, len)
} else {
wrap.it(x, len)
}
}
Thus, for your labels in a list:
> wrap.labels(lab, 10)
$OccFather
[1] "Upper\nnonmanual" "Lower\nnonmanual" "Upper\nmanual"
[4] "Lower\nmanual" "Farm"
$OccSon
[1] "Upper\nnonmanual" "Lower\nnonmanual" "Upper\nmanual"
[4] "Lower\nmanual" "Farm"
and for the example vector in my prior post:
> labels <- factor(paste("This is a long label ", 1:10))
> wrap.labels(labels, 10)
[1] "This is\na long\nlabel 1" "This is\na long\nlabel 2"
[3] "This is\na long\nlabel 3" "This is\na long\nlabel 4"
[5] "This is\na long\nlabel 5" "This is\na long\nlabel 6"
[7] "This is\na long\nlabel 7" "This is\na long\nlabel 8"
[9] "This is\na long\nlabel 9" "This is\na long\nlabel 10"
To incorporate abbreviate() here, you could perhaps modify the
wrap.labels() syntax to use a "wrap = TRUE/FALSE" argument to explicitly
indicate which approach you want, or perhaps develop some decision tree
approach to automate the process.
HTH,
Marc Schwartz
More information about the R-help
mailing list