[Rd] different results on linux and windows

Paul Gilbert pgilbert at bank-banque-canada.ca
Thu May 14 16:20:20 CEST 2009


I cannot say what the problem is in your code, but in general it is 
possible to get the same random sequences from Linux and Windows with 
R's Mersenne Twister generator.  If you generate long sequences (say 10s 
of thousands) and then start doing comparisons involving remainders, 
like differencing the sums of two mean zero sequences, then differences 
between libraries will show up. I have found bigger differences between 
Linux variants than I have between Windows and most Linuxes.

Paul Gilbert

Klaus Nordhausen wrote:
> Dear R experts,
> 
> we are preparing an R-package to compute the Oja Median which contains
> some C++ code in which random numbers are needed. To generate the random
> numbers we use the following Mersenne-Twister implementation:
> 
> // MersenneTwister.h
> // Mersenne Twister random number generator -- a C++ class MTRand
> // Based on code by Makoto Matsumoto, Takuji Nishimura, and Shawn Cokus
> // Richard J. Wagner  v1.0  15 May 2003  rjwagner at writeme.com
> 
> the random seed for the Mersenne-Twister is provided by our R-function
> which gives an (random) integer to  the C++ function srand() which in
> turn sets the seed in the code.
> 
> Using the set.seed in R makes now the results reproducible, but the
> results differ between windows and linux.
> 
> Does anyone know what the problem there is?
> 
> Our suspicion is that the reason is that some libraries are different
> implemented on linux and windows (XP) compilers.
> 
> After the program start we set the seed in row 447(vkm.cpp) with 
> srand(int);
> 
> When the median will be calculated, an intern seed is set with unsigned
>  int seed = rand();   ( in row 100 (vkm.cpp)). This seed will be used
> to calculate some random subsets and to
> create a Mersenne Twister object with MTRand rr(seed);  (row 156, vkm.cpp).
> 
> The MTRand Object rr is called with an unsigned Integer, so the
> important function in the mersenneTwister.h class is in line 87:
> MTRand( const uint32& oneSeed );
> 
> According to that the Random Number Generator uses the methods
> initialize(oneSeed); and reload();  (inside the method, beginning in
> line 215)
> 
> This both methods (line 283 and line 301) are using beside others
> registers. Could it be that there is a different behavior between
> Windows and Linux?
> 
> We do not want to use only srand() since we might need more then the
> number of pseudo random numbers that algorithm can provide.
> 
> For those interested and which would like to see the code, a first
> version of the package, called OjaMedian, is available as source file
> and windows binary on my homepage:
> http://www.uta.fi/~klaus.nordhausen/down.html
> 
> The problem is in the ojaMedian function when the evolutionary algorithm
> is used. Involved C++-files are mainly vkm.cpp and MersenneTwister.h.
> 
> We would be very grateful for any advice on how to solve this problem.
> (below is also a demonstration)
> 
> Thank you very much in advance,
> 
> Klaus
> 
> Results on windows XP:
> 
> Compiler used: gcc version 4.2.1-sjlj (mingw32-2)
> 
>> library(OjaMedian)
>> set.seed(1)
>> testD <- rmvnorm(20,c(0,0))
>> summary(testD)
>        V1                V2
>  Min.   :-2.2147   Min.   :-1.989352
>  1st Qu.:-0.3844   1st Qu.:-0.399466
>  Median : 0.3597   Median :-0.054967
>  Mean   : 0.1905   Mean   :-0.006472
>  3rd Qu.: 0.7590   3rd Qu.: 0.655663
>  Max.   : 1.5953   Max.   : 1.358680
>> set.seed(1)
>> ojaMedian(testD)
> [1]  0.21423705 -0.05799643
>> sessionInfo()
> R version 2.9.0 (2009-04-17)
> i386-pc-mingw32
> 
> locale:
> LC_COLLATE=Finnish_Finland.1252;LC_CTYPE=Finnish_Finland.1252;LC_MONETARY=Finnish_Finland.1252;LC_NUMERIC=C;LC_TIME=Finnish_Finland.1252 
> 
> 
> attached base packages:
> [1] stats     graphics  grDevices utils     datasets  methods   base
> 
> other attached packages:
> [1] OjaMedian_0.0-14 ICSNP_1.0-3      ICS_1.2-1        survey_3.14
> [5] mvtnorm_0.9-5
> 
> loaded via a namespace (and not attached):
> [1] tools_2.9.0
>>
> 
> Results on Linux Kubuntu 8.10
> result of: cat /proc/version:
> Linux version 2.6.28-11-generic (buildd at palmer) (gcc version 4.3.3
> (Ubuntu 4.3.3-5ubuntu4) ) #42-Ubuntu SMP Fri Apr 17 01:57:59 UTC 2009
> 
>> library(OjaMedian)
>>  set.seed(1)
>>  testD <- rmvnorm(20,c(0,0))
>>  summary(testD)
> 
>        V1                V2
>  Min.   :-2.2147   Min.   :-1.989352
>  1st Qu.:-0.3844   1st Qu.:-0.399466
>  Median : 0.3597   Median :-0.054967
>  Mean   : 0.1905   Mean   :-0.006472
>  3rd Qu.: 0.7590   3rd Qu.: 0.655663
>  Max.   : 1.5953   Max.   : 1.358680
> 
>> set.seed(1)
>>  ojaMedian(testD)
> 
> (-0.501381, 0.193929)[1] 0.119149071 0.002732100
> 
>> sessionInfo()
> 
> R version 2.8.1 (2008-12-22)
> i486-pc-linux-gnu
> 
> locale:
> LC_CTYPE=en_US.UTF-8;LC_NUMERIC=C;LC_TIME=en_US.UTF-8;LC_COLLATE=en_US.UTF-8;LC_MONETARY=C;LC_MESSAGES=en_US.UTF-8;LC_PAPER=en_US.UTF-8;LC_NAME=C;LC_ADDRESS=C;LC_TELEPHONE=C;LC_MEASUREMENT=en_US.UTF-8;LC_IDENTIFICATION=C 
> 
> 
> attached base packages:
> [1] stats     graphics  grDevices utils     datasets  methods   base
> 
> other attached packages:
> [1] OjaMedian_0.0-14 ICSNP_1.0-3      ICS_1.2-1        survey_3.14
> [5] mvtnorm_0.9-5
> 
> 
> 
> 
> 
====================================================================================

La version française suit le texte anglais.

------------------------------------------------------------------------------------

This email may contain privileged and/or confidential in...{{dropped:26}}



More information about the R-devel mailing list