[R-pkg-devel] CRAN rules re. web scraping?
Spencer Graves
@pencer@gr@ve@ @end|ng |rom e||ect|vede|en@e@org
Fri Jan 24 00:50:29 CET 2020
Thanks very much to Iñaki Ucar, Adam H Sparks, and Roy
Mendelssohn for their replies that helped me understand what I needed to
do to fix problems identified in the CRAN Checks. I believe that those
problems are not fixed in the development version of Ecfun available at
"https://github.com/sbgraves237/Ecfun". The package still needs more
work, but I will make Prof. Ripley's Feb. 4 deadline.
Thanks again,
Spencer Graves
On 2020-01-23 01:55, Iñaki Ucar wrote:
> On Thu, 23 Jan 2020 at 02:49, Spencer Graves
> <spencer.graves using effectivedefense.org> wrote:
>> Hello, All:
>>
>>
>> GOOD NEWS AND BAD NEWS:
>>
>>
>> * First the good news: I heard from Brian Ripley; see below.
>> His web site says, "He retired in August 2014 on grounds of ill health."
>> (http://www.stats.ox.ac.uk/~ripley/) I was pleased to see that he seems
>> to be well enough to send me the email below.
>>
>>
>> * BAD NEWS: My Ecfun package is violating current CRAN rules
>> regarding "not writing anywhere in the file space". (See below.)
>>
>>
>> QUESTION:
>>
>>
>> How do you suggest I respond to this?
>>
>>
>> It's hard for me to fix, because I cannot replicate the error and
>> I don't understand the rules Prof. Ripley is trying to enforce. The
>> "CRAN Package Check Results for" this package show an error on 1
>> platform (r-devel-linux-x86_64-fedora-gcc), NOTEs on 3 platforms
>> (Fedora-clang and Debian), and "OK" on 9 others. I can program selected
>> tests not to run on CRAN, e.g., with (!fda::CRAN()).
>>
>>
>> However, I suspect I should be able to do better than that.
>>
>>
>> Suggestions?
> The message from Prof. Ripley is crystal-clear, and exposes two issues
> (Internet access, writing files) that have been discussed many times
> in this list. A quick scan of the CRAN policy [1] yields:
>
> - Packages which use Internet resources should fail gracefully with an
> informative message if the resource is not available (and not give a
> check warning nor error).
>
> - Packages should not write in the user’s home filespace (including
> clipboards), nor anywhere else on the file system apart from the R
> session’s temporary directory.
>
> [1] https://cran.r-project.org/web/packages/policies.html
>
> Iñaki
>
>> Thanks,
>> Spencer Graves
>>
>>
>> p.s. The development version of this package is available at
>> "https://github.com/sbgraves237/Ecfun".
>>
>>
>> https://cloud.r-project.org/web/checks/check_results_Ecfun.html
>>
>>
>> -------- Forwarded Message --------
>> Subject: CRAN package Ecfun
>> Date: Tue, 21 Jan 2020 21:26:02 +0000
>> From: Prof Brian Ripley <ripley using stats.ox.ac.uk>
>> Reply-To: CRAN <CRAN using r-project.org>
>> To: Spencer Graves <spencer.graves using effectivedefense.org>
>> CC: CRAN <CRAN using r-project.org>
>>
>>
>>
>> This has been intermittently failing its checks for a week: different
>> check runs failed (in the 24h prior to) the 14th, 15th, 17th and today.
>> The current failure is
>>
>> Check: examples
>> Result: ERROR
>> Running examples in ‘Ecfun-Ex.R’ failed
>> The error most likely occurred in:
>>
>> > ### Name: read.testURLs
>> > ### Title: Read a file produced by testURLs
>> > ### Aliases: read.testURLs
>> > ### Keywords: IO
>> >
>> > ### ** Examples
>> >
>> > # Test only 2 web sites, not the default 4,
>> > # and test only twice, not the default 10 times:
>> > tst <- testURLs(c(
>> + PVI="http://en.wikipedia.org/wiki/Cook_Partisan_Voting_Index",
>> + house="http://house.gov/representatives"),
>> + n=2, maxFail=2)
>> 1
>> 1579634784, PVI, TRUE 0.828
>> 1579634785, house, FALSE 0.051
>> 1579634785, house, FALSE 0.048
>> 2
>> 1579634785, PVI, TRUE 0.043
>> 1579634785, house, FALSE 0.11
>> 1579634785, house, FALSE 0.035
>> >
>> > # The above should have created a file 'testURLresults.csv'
>> > # in the working directory. Read it.
>> >
>> > dat <- read.testURLs()
>> Error in read.table(file = file, header = header, sep = sep, quote =
>> quote, :
>> more columns than column names
>> Calls: read.testURLs -> read.csv -> read.table
>>
>> That does not conform to the policy on Internet access, not least as no
>> attempt is made to check if the file was created, let alone that it has
>> the expected layout. Nor does it conform to the policy on not writing
>> anywhere in the file space (and that shows on its CRAN results page too).
>>
>> Please correct ASAP and before Feb 4 to safely retain the package on CRAN.
>>
>> --
>> Brian D. Ripley, ripley using stats.ox.ac.uk
>> Emeritus Professor of Applied Statistics, University of Oxford
>>
>>
>> [[alternative HTML version deleted]]
>>
>> ______________________________________________
>> R-package-devel using r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-package-devel
>
More information about the R-package-devel
mailing list