[R] Help with a third ggplot error
Upton, Stephen (Steve) (CIV)
@cupton @end|ng |rom np@@edu
Mon Jun 17 14:25:43 CEST 2019
And slightly differently with strcapture (using Jim Lemon's reprex):
mydf <- strcapture("([a-zA-Z ]+)([0-9]+\\.[0-9]+)",scdf$V1,data.frame(Channel="none",Price=0))
change the regex as needed, following Boris' advice.
steve
On 6/17/19, 5:09 AM, "R-help on behalf of Boris Steipe" <r-help-bounces using r-project.org on behalf of boris.steipe using utoronto.ca> wrote:
(Technically you are now thread-hijacking. But here goes:)
mydf <- data.frame(V11 = c("DD Pack0.002",
"FTA English News0.003",
"FTA Complimentary0.004"),
stringsAsFactors = FALSE)
# regex matching start-of-string(letters or blanks)(numbers, a decimal
# point, more numbers)end-of-string: "^([a-zA-Z ]+)([0-9]+\\.[0-9]+)$"
# first check that all elements are matched by the regex. If not, an assumption
# of how the strings are patterned is not true ...
all(grepl("^([a-zA-Z ]+)([0-9]+\\.[0-9]+)$", V11)) # must be true!
mydf$`Channel name` <- gsub("^([a-zA-Z ]+)([0-9]+\\.[0-9]+)$", "\\1", mydf$V11)
mydf$Price <- gsub("^([a-zA-Z ]+)([0-9]+\\.[0-9]+)$", "\\2", mydf$V11)
mydf
# V11 Channel name Price
# 1 DD Pack0.002 DD Pack 0.002
# 2 FTA English News0.003 FTA English News 0.003
# 3 FTA Complimentary0.004 FTA Complimentary 0.004
Note this _will_ give wrong results if channel names like "ABC4" exist.
B.
> On 2019-06-15, at 15:40, Sam Charya via R-help <r-help using r-project.org> wrote:
>
> Hello All,
> I need help with splitting a string. My data frame is in the following format:
> V11 DD Pack0.002 FTA English News0.003 FTA Complimentary0.004 WB1.185 WION1.186 Al Jazeera0.007 Animal Planet2.368 Asianet Movies17.709 Calcutta News0.0010 Comedy Central5.90
> I read the file from a csv and set header = False, hence it is named V1.
> The data consists of names of Channels and their prices. For example: Row 1: Name of the Channel is 'DD Pack' and the Price is 0.00.similarly for Row 5, the name of the Channel is 'WION' and the price is 1.18.
> similarly for Row 8: The name of the Channel is 'Asianet Movies' and the price is 17.70.
>
> My question is: How would I separate the data into 2 columns: One for the Channel name and one for the Price.
> For example. The Heading should be for Col1: 'Channel Name' and for Col2: 'Price'The data under 'Channel Name' should be 'DD Pack' and for 'Price' should be 0.00 and so on and so forth.
> The letters and the numeric appears together so there is no separator and I am not being able to figure this out. Kindly please help resolve this.
> Many Thanks in advance for your help. This is my first question ever to the community so apologies if I have made a mistake in sending it to the wrong group - kindly direct if that is the case.
> sam.
> ********************************************************
>
>
>
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
______________________________________________
R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list