[R] Selecting columns whose names contain "mutated" except when they also contain "non" or "un"
Paul Miller
pjmiller_57 at yahoo.com
Tue Apr 24 15:40:35 CEST 2012
Hi Greg,
This is quite helpful. Not so good yet with regular expressions in general or Perl-like regular expressions. Found the help page though, and think I was able to determine how the code works as well as how I would select only instances where "muta" is preceeded by either "non" or "un".
> (tmp <- c('mutation','nonmutated','unmutated','verymutated','other'))
[1] "mutation" "nonmutated" "unmutated" "verymutated" "other"
> grep("(?<!un)(?<!non)muta", tmp, perl=TRUE)
[1] 1 4
> grep("(?!muta)non|un", tmp, perl=TRUE)
[1] 2 3
Did I get the second grep right?
If so, do you have any sense of why it seems to fail when I apply it to my data?
> KRASyn$NonMutant_comb <- rowSums(KRASyn[grep("(?!muta)non|un", names(KRASyn), perl=TRUE)])
Error in rowSums(KRASyn[grep("(?!muta)non|un", names(KRASyn), perl = TRUE)]) :
'x' must be numeric
Thanks,
Paul
More information about the R-help
mailing list