[R] Removing values from a string

arun smartpink111 at yahoo.com
Thu Jul 19 21:58:31 CEST 2012



Hi,

Try this:
one = data.frame(keyword=c("|auto", "NA|auto|insurance|quote",
"NA|auto|insurance",
                           "NA|insurance", "NA|auto|insurance", "<NA>"))

onenew<-data.frame(keyword=gsub("(NA){0,1}\\|","",one$keyword))
onenew1<-data.frame(keyword=gsub("(<NA>){0,1}","",onenew$keyword))
 onenew1
             keyword
1               auto
2 autoinsurancequote
3      autoinsurance
4          insurance
5      autoinsurance
6                   

A.K.




----- Original Message -----
From: Abraham Mathew <abmathewks at gmail.com>
To: r-help at r-project.org
Cc: 
Sent: Thursday, July 19, 2012 3:21 PM
Subject: [R] Removing values from a string

So I have the following data frame and I want to know how I can remove all
"NA" values from each string, and also
remove all "|" values from the START of the string. So they should
something like "auto|insurance" or "auto|insurance|quote"

one = data.frame(keyword=c("|auto", "NA|auto|insurance|quote",
"NA|auto|insurance",
                           "NA|insurance", "NA|auto|insurance", "<NA>"))

one


Can anyone point me in the right direction? I'm still not too familiar with
regex or gsub to find a solution, and there doesn't seem
to be anything helpful in the stringr package for this task.


Thanks

-- 
*Abraham Mathew
Statistical Analyst
www.amathew.com
720-648-0108
@abmathewks*

    [[alternative HTML version deleted]]

______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.




More information about the R-help mailing list