[R] Sharing levels across multiple factor vectors

Peter Dalgaard pdalgd at gmail.com
Thu Apr 1 10:05:58 CEST 2010


Jeff Brown wrote:
> Sorry for spamming.  I swear I had worked on that problem a long time before
> posting.  
> 
> But I just figured it out: I have to change the values, which are
> represented as integers, not strings.  So the following code will do it:
> 
> df <- data.frame (
> 	a = factor( c( "bob", "alice", "bob" ) ),
> 	b = factor( c( "kenny", "alice", "alice" ) )
> );
> allLevels <- unique( c( levels( df$a ), levels( df$b ) ) )
> for (c in colnames(df)) {
> 	df[,c] <- match( df[,c], allLevels);
> 	levels( df[,c] ) <- 1:(length(allLevels))
> };
> 

Hmm, I think I'd go for something like

allLevels <- unique(unlist(lapply(df,levels)))
df[] <- lapply(df, factor,
levels=allLevels, labels=seq_along(allLevels))


-- 
Peter Dalgaard
Center for Statistics, Copenhagen Business School
Phone: (+45)38153501
Email: pd.mes at cbs.dk  Priv: PDalgd at gmail.com



More information about the R-help mailing list