[R-pkg-devel] Advice for addressing CRAN rejection

Ben Bolker bbo|ker @end|ng |rom gm@||@com
Wed May 14 03:07:43 CEST 2025


    Hmm. The thread you linked to is specifically an issue with 
non-deterministic linear algebra, the solution to which is to disable 
threaded computations. I don't think CRAN multithreads by default (and I 
don't know if they test on MKL at all ...?)

   Can you provide more specific/concrete examples of the tests? (Again, 
I apologize if there were examples posted up-thread -- I'm too lazy to 
search for them.) I'm not quite sure I understand your comment about

 > Suppose, for example, that X is a symmetric, positive definite 
matrix. Then identical() will usually distinguish between (X^1/2)^-1 and 
(X^-1)^1/2 (the kind of thing I want to be able to check) while 
all.equal() will generally not


    What is X^1/2? (There are infinitely many ways to take a matrix 
square root ...) Interpreting X^(1/2) as chol(X) and X^(-1) as solve(X), 
these are not even close:

 > set.seed(101); m <- crossprod(matrix(rnorm(9), 3, 3))
 > all.equal(solve(chol(m)), chol(solve(m)))
[1] "Mean relative difference: 0.6655765"


   In general "convenience shortcuts" that do any kind of rearranging of 
a floating point computation **cannot** be guaranteed to be identical; 
this is a corollary of

https://cran.r-project.org/doc/FAQ/R-FAQ.html#Why-doesn_0027t-R-think-these-numbers-are-equal_003f

See also 
https://stackoverflow.com/questions/9508518/why-are-these-numbers-not-equal/9508558#9508558

(e.g., floating-point addition is not associative)

   I apologize if this sounds basic/is telling you something you already 
know, but from what I can understand of your questions so far, you are 
asking for something that is not possible in general.

   Can you clarify further please?

   cheers
    Ben Bolker



On 5/13/25 15:08, smallepsilon wrote:
> Ben,
> 
> The thread to which I alluded is here: https://stat.ethz.ch/pipermail/r-help/2025-May/480866.html
> 
> Further clarification: The package provides some convenience shortcuts for the user which should run the same calculations as their longer counterparts. I want to use identical() to provide strong evidence that this is happening. Suppose, for example, that X is a symmetric, positive definite matrix. Then identical() will usually distinguish between (X^1/2)^-1 and (X^-1)^1/2 (the kind of thing I want to be able to check) while all.equal() will generally not (unless I set the tolerance sufficiently low, but that is just making all.equal() behave more like identical()). Using all.equal() helps detect catastrophic errors, but those would be detected in other tests already.
> 
> Thanks,
> 
> Jesse
> 
> 
> On Tuesday, May 13th, 2025 at 1:41 PM, Ben Bolker <bbolker using gmail.com> wrote:
> 
>> Can you please clarify (maybe by linking back to an earlier thread, don't remember if you discussed this previously) what you mean by "I realized that because all.equal() does not test (even as a proxy) that the same calculations were done"?
>>
>>
>> On Tue, May 13, 2025, 1:05 PM smallepsilon <smallepsilon using proton.me> wrote:
>>
>>> I have been trying to fix some issues with my package's testing on CRAN, which culminated in a rejection email from a CRAN administrator that I am not sure how to address.
>>>
>>> The first issues arose with MKL. (I got helpful information about that recently on r-help.) In many package tests, I want to verify that two ways of specifying something lead to the execution of exactly the same calculations. I use identical() as a proxy for this, but because numeric results are not necessarily reproducible when using parallel processing, this does not work on all platforms.
>>>
>>> My initial attempts to address this involved replacing the offending identical() calls with all.equal() calls. After two or three such attempts, I realized that because all.equal() does not test (even as a proxy) that the same calculations were done, it is impractical and unnecessary to run these tests on all of the CRAN platforms. I moved the original test files to a separate folder on my computer so I can run them all locally. (My assumption is that if the logic is correct on my computer, then it is correct on all of them, and identical() helps verify this.) In the newest package version uploaded to CRAN, I included the tests that verify the essential functionality of the package so that the crucial output values are the same on all platforms, up to a reasonable number of significant digits. These are the tests that are clearly important to run on all platforms.
>>>
>>> My submission was rejected, not because of test failures, but because I had "removed the failing tests which is not the idea of tests." No errors/warnings/notes were reported to me. The only option I have been given is to replace identical() with all.equal(), which defeats the purpose of these particular tests.
>>>
>>> I replied to the administrator's email with a brief version of all of this, but have not gotten a response. Any advice on what else I could do would be appreciated.
>>>
>>> Thanks,
>>>
>>> Jesse
>>>
>>> ______________________________________________
>>> R-package-devel using r-project.org mailing list
>>> https://stat.ethz.ch/mailman/listinfo/r-package-devel

-- 
Dr. Benjamin Bolker
Professor, Mathematics & Statistics and Biology, McMaster University
Director, School of Computational Science and Engineering
* E-mail is sent at my convenience; I don't expect replies outside of 
working hours.



More information about the R-package-devel mailing list