[Rd] Does anybody successfully built latest R on AIX 5.3?
Ei-ji Nakama
nakama at ki.rim.or.jp
Mon Jun 20 05:24:21 CEST 2011
Hi,
> This is strange, we are hoping to improve R performance through high clock speed Power CPU(4.0 GHz above),
> Now I think we should take more considerations, RAM is also cheaper for X86 than Power :)
power is no good CPU.
$ lsdev -C | grep proc
proc0 Available 00-00 Processor
proc2 Available 00-02 Processor
$ lsattr -El proc0
frequency 4204000000 Processor Speed False
smt_enabled true Processor SMT enabled False
smt_threads 2 Processor SMT threads False
state enable Processor state False
type PowerPC_POWER6 Processor type False
$ lsattr -El proc2
frequency 4204000000 Processor Speed False
smt_enabled true Processor SMT enabled False
smt_threads 2 Processor SMT threads False
state enable Processor state False
type PowerPC_POWER6 Processor type False
As a result of DGEMM by GotoBLAS(http://prs.ism.ac.jp/~nakama/SurviveGotoBLAS2).
$ GOTO_NUM_THREADS=1 ./bm 2000
12.954 GFLOPS (N x N : N=2000 1.23517sec)
12.719 GFLOPS (N x T : N=2000 1.25796sec)
13.118 GFLOPS (T x N : N=2000 1.21965sec)
12.726 GFLOPS (T x T : N=2000 1.25732sec)
$ GOTO_NUM_THREADS=2 ./bm 2000
25.259 GFLOPS (N x N : N=2000 0.633444sec)
24.050 GFLOPS (N x T : N=2000 0.665272sec)
25.710 GFLOPS (T x N : N=2000 0.622316sec)
24.075 GFLOPS (T x T : N=2000 0.664595sec)
$ GOTO_NUM_THREADS=4 ./bm 2000
21.311 GFLOPS (N x N : N=2000 0.750802sec)
25.778 GFLOPS (N x T : N=2000 0.620694sec)
26.398 GFLOPS (T x N : N=2000 0.60611sec)
25.826 GFLOPS (T x T : N=2000 0.619536sec)
It's fast with the structure (L2 shared cache) of the CPU to 2CPU.
Best Regards,
--
EI-JI Nakama <nakama (a) ki.rim.or.jp>
"\u4e2d\u9593\u6804\u6cbb" <nakama (a) ki.rim.or.jp>
More information about the R-devel
mailing list