[Rd] Memory leakage/violation?
Henrik Bengtsson
hb at maths.lth.se
Sat Aug 27 12:09:09 CEST 2005
Thank you Thomas and Uwe for this.
This is really odd, because today I can neither reproduce it myself (I
rebooted my computer this morning). I try to recall what I did
yesterday: I did _not_ reboot my machine, but I did install the latest
binaries of R 2.1.1 and 2.2.0dev from CRAN. I did close all R sessions
and restarted them by R --vanilla (only one at the time) and got the
same errors over and over for hours (trust me, I was really frustrated
with my analysis). Maybe a reboot would have solved it - I should know
better and try that first! A worse scenario is that the hardware was
overheated or starts to fall apart.
To answer you question Uwe, I don't know about the compiler settings
since I got the pre-build binaries from CRAN.
Best wishes
Henrik
Uwe Ligges wrote:
> Thomas Lumley wrote:
>
>> I can't reproduce this on R2.2.0dev on Windows XP (in a few hundred
>> tries), or running under Valgrind on AMD64 Linux (in four or five tries).
>
>
> Cannot reproduce either (using R-2.1.1 and an older version of R-devel,
> though). Maybe a compiler issue?
> Henrik, do you use exactly the compiler set up mentioned in the manuals?
> Which version of gcc? Did your emember to replace the f771.exe?
>
> Uwe
>
>
>
>> -thomas
>>
>>
>> On Fri, 26 Aug 2005, Henrik Bengtsson wrote:
>>
>>
>>> Hi,
>>>
>>> I've spotted a possible memory leakage/violation in the latest R v2.1.1
>>> patched and R v2.2.0dev on Windows XP Pro SP2 Eng.
>>>
>>> I first caught it deep down in a nested svd algorithm when subtracting a
>>> double 'c' from a integer vector 'a' where both had finite values but
>>> when assigning 'a <- a - c' would report NaNs whereas (a-c) alone would
>>> not. Different runs with the identical data would introduce NaNs at
>>> random positions, but not all the time.
>>>
>>> Troubleshooting is after a couple of hours still at v0.5, but here is a
>>> script that generates the strange behavior on the above R setups. I let
>>> the script speak for itself. Note that both the script 'strange.R' and
>>> the data 'strange.RData' is online too, see code below.
>>>
>>> People on other systems (but also on Windows), could you please try it
>>> and see if you can reproduce what I get.
>>>
>>> Cheers
>>>
>>> Henrik
>>>
>>>
>>> # The following was tested on: Windows XP Pro SP2 Eng with
>>> # i) R Version 2.1.1 Patched (2005-08-25)
>>> # ii) R 2.2.0 Under development (unstable) (2005-08-25 r35394M)
>>>
>>> # Start 'R --vanilla' and source() this script, i.e.
>>> # source("http://www.maths.lth.se/help/R/strange.R")
>>> # If you do not get any errors, retry a few times.
>>>
>>> foo <- function(x) {
>>> print(list(
>>> name=as.character(substitute(x)),
>>> storage.mode=storage.mode(x),
>>> na=any(is.na(x)),
>>> nan=any(is.nan(x)),
>>> inf=any(is.infinite(x)),
>>> ok=all(is.finite(a))
>>> ))
>>> print(length(x))
>>> print(summary(x))
>>> }
>>>
>>> # Load data from a complicated "non-reproducible" algorithm.
>>> # The below errors occur also when data is not
>>> # saved and then reloaded from file. Data was generated in
>>> # R v2.1.1 patched (see above).
>>> if (file.exists("strange.RData")) {
>>> load("strange.RData")
>>> } else {
>>> load(url("http://www.maths.lth.se/help/R/strange.RData"))
>>> }
>>>
>>> # First glance at data...
>>> foo(a)
>>> foo(c)
>>>
>>> ## $name
>>> ## [1] "a"
>>> ##
>>> ## $storage.mode
>>> ## [1] "integer"
>>> ##
>>> ## $na
>>> ## [1] FALSE
>>> ##
>>> ## $nan
>>> ## [1] FALSE
>>> ##
>>> ## $inf
>>> ## [1] FALSE
>>> ##
>>> ## $ok
>>> ## [1] TRUE
>>> ##
>>> ## [1] 15000
>>> ## Min. 1st Qu. Median Mean 3rd Qu. Max.
>>> ## 41.0 51.0 63.0 292.2 111.0 65170.0
>>> ## $name
>>> ## [1] "c"
>>> ##
>>> ## $storage.mode
>>> ## [1] "double"
>>> ##
>>> ## $na
>>> ## [1] FALSE
>>> ##
>>> ## $nan
>>> ## [1] FALSE
>>> ##
>>> ## $inf
>>> ## [1] FALSE
>>> ##
>>> ## $ok
>>> ## [1] TRUE
>>> ##
>>> ## [1] 1
>>> ## Min. 1st Qu. Median Mean 3rd Qu. Max.
>>> ## 53.43 53.43 53.43 53.43 53.43 53.43
>>> ##
>>>
>>> # But, trying the following, will result in
>>> # no-reproducible error messages. Sometimes
>>> # it errors at kk==1, sometimes at kk >> 1.
>>> # Also, look at the different output for
>>> # different kk:s.
>>> for (kk in 1:100) {
>>> cat("kk=",kk, "\n")
>>> print(summary(a-c))
>>> }
>>>
>>> ## kk= 1
>>> ## Min. 1st Qu. Median Mean 3rd Qu.
>>> Max.
>>> ## -7.741e+307 -2.431e+00 9.569e+00 5.757e+01
>>> ## kk= 2
>>> ## Min. 1st Qu. Median Mean 3rd Qu. Max.
>>> ## -12.430 -2.431 9.569 238.700 57.570 65120.000
>>> ## kk= 3
>>> ## Min. 1st Qu. Median Mean 3rd Qu. Max.
>>> ## -12.430 -2.431 9.569 57.570 65120.000
>>> ## kk= 4
>>> ## Min. 1st Qu. Median Mean 3rd Qu. Max.
>>> ## -12.430 -2.431 9.569 238.700 57.570 65120.000
>>> ## kk= 5
>>> ## Min. 1st Qu. Median Mean 3rd Qu. Max.
>>> ## -12.430 -2.431 9.569 238.700 57.570 65120.000
>>> ## kk= 6
>>> ## Error in quantile.default(object) : missing values and NaN's
>>> ## not allowed if 'na.rm' is FALSE
>>>
>>>
>>> ## Comments: If you shorten down 'a', the bug occurs less frequently.
>>>
>>> ______________________________________________
>>> R-devel at r-project.org mailing list
>>> https://stat.ethz.ch/mailman/listinfo/r-devel
>>>
>>
>>
>> Thomas Lumley Assoc. Professor, Biostatistics
>> tlumley at u.washington.edu University of Washington, Seattle
>>
>> ______________________________________________
>> R-devel at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-devel
>
>
>
More information about the R-devel
mailing list