[R] misbehavior with extract_numeric() from tidyr
arnaud gaboury
arnaud.gaboury at gmail.com
Mon Apr 20 20:57:20 CEST 2015
On Mon, Apr 20, 2015 at 6:09 PM, William Dunlap <wdunlap at tibco.com> wrote:
> The hyphen without a following digit confuses tidyr::extract_numeric().
> E.g.,
> > extract_numeric("23 ft-lbs")
> Warning message:
> In extract_numeric("23 ft-lbs") : NAs introduced by coercion
> [1] NA
> > extract_numeric("23 ft*lbs")
> [1] 23
>
See[0] for the reason on the minus in the regex. It is not a bug but a wish.
I am honestly very surprised the maintainer decided to go with such a so
simple solution for negative numbers.
[0]https://github.com/hadley/tidyr/issues/20
Contact the BugReports address for the package
> > packageDescription("tidyr")$BugReports
> [1] "https://github.com/hadley/tidyr/issues"
> or package's maintainer
> > maintainer("tidyr")
> [1] "Hadley Wickham <hadley at rstudio.com>"
> to report problems in a user-contributed package.
>
>
>
> Bill Dunlap
> TIBCO Software
> wdunlap tibco.com
>
> On Mon, Apr 20, 2015 at 12:10 AM, arnaud gaboury <arnaud.gaboury at gmail.com
> > wrote:
>
>> R 3.2.0 on Linux
>> --------------------------------
>>
>> library(tidyr)
>>
>> playerStats <- c("LVL 10", "5,671,448 AP l6,000,000 AP", "Unique
>> Portals Visited 1,038",
>> "XM Collected 15,327,123 XM", "Hacks 14,268", "Resonators Deployed
>> 11,126",
>> "Links Created 1,744", "Control Fields Created 294", "Mind Units
>> Captured 2,995,484 MUs",
>> "Longest Link Ever Created 75 km", "Largest Control Field 189,731 MUs",
>> "XM Recharged 3,006,364 XM", "Portals Captured 1,204", "Unique Portals
>> Captured 486",
>> "Resonators Destroyed 12,481", "Portals Neutralized 1,240", "Enemy
>> Links Destroyed 3,169",
>> "Enemy Control Fields Destroyed 1,394", "Distance Walked 230 km",
>> "Max Time Portal Held 240 days", "Max Time Link Maintained 15 days",
>> "Max Link Length x Days 276 km-days", "Max Time Field Held 4days",
>> "Largest Field MUs x Days 83,226 MU-days")
>>
>>
>> -----------------------------------------------------------------------------------------------
>> extract_numeric(playerStats)
>> [1] 10 56714486000000 1038 15327123
>> 14268 11126 1744 294 2995484
>> [10] 75 189731 3006364 1204
>> 486 12481 1240 3169 1394
>> [19] 230 240 15 NA
>> 4 NA
>>
>>
>> ------------------------------------------------------------------------------------------------
>> playerStats[c(22,24)]
>> [1] "Max Link Length x Days 276 km-days" "Largest Field MUs x
>> Days 83,226 MU-days"
>>
>> --------------------------------------------------------------------------------------------
>>
>> I do not understand why these two vectors return NA when the function
>> extract_numeric() works well for others,
>>
>> Any wrong settings in my env?
>>
>> Thank you for hints.
>>
>>
>>
>> --
>>
>> google.com/+arnaudgabourygabx
>>
>> ______________________________________________
>> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
>
>
--
google.com/+arnaudgabourygabx
<https://plus.google.com/_/notifications/emlink?emr=05814804238976922326&emid=CKiv-v6PvboCFcfoQgod6msAAA&path=%2F116159236040461325607%2Fop%2Fu&dt=1383086841306&ub=50>
[[alternative HTML version deleted]]
More information about the R-help
mailing list