[R] Pull Stock Symbol Out of String
wdunlap at tibco.com
Tue Apr 8 20:47:47 CEST 2014
The following gets the last parenthesized sequence of non-parentheses
> sub(".*(\\([^()]+\\))([^()]*)$", "\\1",
"American Tower Corp(REIT)(ATC)",
"Qwerty Corp (ASD)(ZXC)(123) extra stuff"))
 "(AET)" "(ATC)" "No Parens" "(123)"
> -----Original Message-----
> From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org] On Behalf
> Of Sparks, John James
> Sent: Tuesday, April 08, 2014 11:29 AM
> To: r-help at r-project.org
> Subject: [R] Pull Stock Symbol Out of String
> Dear R Helpers,
> My regex skills are beginner to intermediate and banging around the web
> has not resulted in a solution to the problem below so I hope that one of
> you who has mad skills can help me out.
> I want to extract the stock ticker--AMT-- out of the string
> American Tower Corporation (REIT)Â (AMT)
> The presence of the other parenthetical text (REIT) makes this difficult.
> Please note that the string may or may not have a interfering set of
> characters such as the (REIT) so the solution needs to be generalizable to
> the last set of characters that are contained in parentheses in the larger
> string. So an example of a string without the interfering (REIT) would be
> Aetna Inc.Â (AET)
> Your assistance would be very much appreciated.
> --John Sparks
> R-help at r-project.org mailing list
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
More information about the R-help