[R-SIG-Mac] R run time question
Simon Urbanek
simon.urbanek at r-project.org
Wed Jul 13 22:35:26 CEST 2011
On Jul 13, 2011, at 3:50 PM, Steven McKinney wrote:
>
>
>
>> -----Original Message-----
>> From: r-sig-mac-bounces at r-project.org [mailto:r-sig-mac-bounces at r-project.org] On Behalf Of Wayne Gray
>> Sent: July-13-11 12:25 PM
>> To: r-sig-mac at r-project.org
>> Subject: [R-SIG-Mac] R run time question
>>
>> All,
>>
>> We are doing ANOVAs that take a long time (> 3 hrs) to run in 64-bit mode on an Intel MacPro with 8
>> gbytes and 2.66 GHZ Intel Core i7 machine.
>>
>> We have just tried running these on our server where we don't care as much how long it takes. The
>> server is a Mac "2 x 3 GHz Dual-Core Intel Xeon" with 13 GB 667 MHz DDR2 FB-DIMM.
>>
>> We have installed R "R 2.10.1 GUI 1.31-np Tiger build 32-bit (5538)" on the server.
>
> Why are you installing a 32 bit R? And why an older version of R?
>
> Can you not install a 64 bit version of the latest R?
>
>>
>> The data.frame is composed of the following observations, factors, and numerals:
>>
>>> str(e1jit41blkAll)
>> 'data.frame': 21648 obs. of 6 variables:
>> $ subjectid: Factor w/ 22 levels "1379","1744",..: 1 1 1 1 1 1 1 1 1 1 ...
>> $ cond : Factor w/ 2 levels "Visual","Auditory": 1 1 1 1 1 1 1 1 1 1 ...
>> $ block : Factor w/ 8 levels "2","3","4","5",..: 1 1 1 1 1 1 1 1 1 1 ...
>> $ cbtime : Factor w/ 41 levels "1","2","3","4",..: 1 10 11 12 13 14 15 16 17 18 ...
>> $ dirtime : Ord.factor w/ 3 levels "Early"<"Middle"<..: 1 1 1 1 1 1 1 1 1 1 ...
>> $ jitter : num 7.78 7.33 5.56 5 6.11 ...
>>
>> When we run the following ANOVA on this dataframe:
>>
>>> anova.ALL.e1jit41.blkAll <- with(e1jit41blkAll, aov(jitter ~ cond*block*dirtime*cbtime +
>> error(subjectid/(block*dirtime*cbtime)), data = e1jit41blkAll))
>>
>> We get the following feedback:
>>
>> Error: cannot allocate vector of size 3.5 Gb
>
> Running the ANOVA requires more than 4GB of RAM, which can not be done
> with 32 bit versions of software. So you will need 64 bit R to do this analysis.
>
>>
>>>
>> R(29720,0xa000d000) malloc: *** vm_allocate(size=3749089280) failed (error code=3)
>> R(29720,0xa000d000) malloc: *** error: can't allocate region
>> R(29720,0xa000d000) malloc: *** set a breakpoint in szone_error to debug
>> R(29720,0xa000d000) malloc: *** vm_allocate(size=3749089280) failed (error code=3)
>> R(29720,0xa000d000) malloc: *** error: can't allocate region
>> R(29720,0xa000d000) malloc: *** set a breakpoint in szone_error to debug
>>>
>>
>> So two categories of questions for all of you Mac-R wizards out there.
>>
>> First category of question: can this analysis be run faster (and if so, how) on our laptop Intel Macs?
>>
>> Second category of question: is there anything that can be done to the Apple Server or to our analysis
>> so that the analysis runs on our server?
>
> What OS version is on your server? If it's an old 32 bit version, upgrading the server
> to a recent 64 bit OS will help. Then install a recent 64 bit version of R.
>
FWIW: you can run R in 64-bit on Tiger as well, you just have to disable Quartz and the GUI since Tiger doesn't have 64-bit Cocoa.
Cheers,
Simon
> If you can do this, you can better assess whether this analysis can be done
> faster on the 8GB laptops.
>
> R runs entirely in RAM, so if the analysis needs more than 8GB of RAM, you may
> be experiencing "swapping" on the laptops. The operating system allocates
> as much memory as the job needs - if that amount of memory exceeds the amount
> of RAM you have, then the rest is allocated to "virtual memory" which is
> on the disk drive. Swapping occurs when part of the virtual memory of your job
> that is on disk is needed - the OS has to copy a chunk of what is in RAM out to
> disk, so it can copy the other chunk on disk back into RAM.
>
> Monitor the laptop with a shell command line process such as "top"
> and you will be able to determine how much virtual memory your R job
> is using, and how many "swaps" are involved. Swapping really slows things
> down as reading and writing from the disk is slow. If your job is swapping
> on the laptop, but uses less than 13GB of memory in total, it will run faster
> on the server. If the job is not swapping at all on the laptop, it will not
> run appreciably faster on the server.
>
>
> HTH
>
> Steve McKinney
>
>
>>
>> Many thanks,
>>
>> Wayne Gray
>>
>> [[alternative HTML version deleted]]
>>
>> _______________________________________________
>> R-SIG-Mac mailing list
>> R-SIG-Mac at r-project.org
>> https://stat.ethz.ch/mailman/listinfo/r-sig-mac
>
> _______________________________________________
> R-SIG-Mac mailing list
> R-SIG-Mac at r-project.org
> https://stat.ethz.ch/mailman/listinfo/r-sig-mac
>
>
More information about the R-SIG-Mac
mailing list