[Rd] MacOS parallel::makeCluster fails
Dominik Leutnant
|eutn@nt @end|ng |rom |h-muen@ter@de
Fri Jul 12 11:22:11 CEST 2019
Hi Thomas,
thanks for your reply (and thanks for your patience...).
I am now using the following minimal reprex:
> library(parallel)
> cl <- makeCluster(2L)
I freshly started the machine and did not open any other app. Just R.app (3.6.1).
After executing the second line of code, R seems to hang infinitely and does not respond.
The R process itself uses almost no CPU.
Unfortunately, I do not have any experience with neither "Sock_listen" nor "dtruss".
Is there an example somewhere available?
Best
Dominik
Am 05.06.19, 10:18 schrieb "Tomas Kalibera" <tomas.kalibera using gmail.com>:
Hi Dominik,
from the output, the master process could not "listen" on the port where
it expects a connection from the worker. We need to find out why. I'd
recommend first to create a minimal reproducible example (and one that
does not use future, only parallel, and a minimal number of threads,
ideally just 2). Then I'd recommend to check if the problem still exists
with R-devel. Then I'd check if the problem happens in all invocations,
even after reboots, on a clean system, without many running applications
- if it does, this is good news. Then you could post such example and we
could help more - if we can reproduce on our system indeed we could
debug, if not there could at least be more directed advice on how to
debug on your side. What I'd do myself if I could reproduce on my system
would be instrument R around Sock_listen in internet module to see
exactly what has failed with which error. Maybe dtruss would help too,
but instrumenting may be easier. The earlier problem you mention has
never been diagnosed (it was only intermittent on the reporter's
machine, we could not reproduce on our systems, and despite a lot of
effort on our side and on the reporter's, we could not reliably
diagnose). In principle, it could be some race condition in R (one has
been fixed since the previous report), but especially if it is
deterministic it would more likely be some OS limit on your system. You
could of course try playing with OS limits, on the number of open files,
etc, with changing the port number (port= option), etc, but I would
recommend the systematic approach of debugging the cause.
Best
Tomas
On 6/4/19 10:45 AM, Dominik Leutnant wrote:
> Hi all,
>
> The call parallel::makeCluster(1L) hangs infinitely on my MacOS machine which seems to be already reported by some people (e.g., https://stat.ethz.ch/pipermail/r-devel/2018-February/075565.html).
> However, the solutions posted on SO, GH or R-devel do not work in my case.
>
> So far, I unsuccessfully tested …
>
> 1. Couple of reboots
> 2. Adding 192.0.0.1 to /etc/hosts
> 3. Using R.app instead of RStudio.app
> 4. Turn off the firewall
>
> Following Hendriks advice, “cl <- future::makeClusterPSOCK(1L, verbose = TRUE, timeout = 60)” gives (note: without adding the timeout parameter, R just hangs):
>> Sys.setenv(LANGUAGE='en')
>> cl <- future::makeClusterPSOCK(1L, verbose = TRUE, timeout = 60)
> [local output] Workers: [n = 1] ‘localhost’
> [local output] Base port: 11867
> [local output] Creating node 1 of 1 ...
> [local output] - setting up node
> Testing if worker's PID can be inferred: ‘'/Library/Frameworks/R.framework/Resources/bin/Rscript' -e 'try(cat(Sys.getpid(),file="/var/folders/5s/kgm05t2s0_52gz1s445mnlgw0000gn/T//RtmpZp1RX6/future.parent=835.3434fe0c5c6.pid"), silent = TRUE)' -e "file.exists('/var/folders/5s/kgm05t2s0_52gz1s445mnlgw0000gn/T//RtmpZp1RX6/future.parent=835.3434fe0c5c6.pid')"’
> - Possible to infer worker's PID: TRUE
> [local output] Starting worker #1 on ‘localhost’: '/Library/Frameworks/R.framework/Resources/bin/Rscript' --default-packages=datasets,utils,grDevices,graphics,stats,methods -e 'try(cat(Sys.getpid(),file="/var/folders/5s/kgm05t2s0_52gz1s445mnlgw0000gn/T//RtmpZp1RX6/future.parent=835.3434fe0c5c6.pid"), silent = TRUE)' -e 'parallel:::.slaveRSOCK()' MASTER=localhost PORT=11867 OUT=/dev/null TIMEOUT=60 XDR=TRUE
> [local output] - Exit code of system() call: 0
> [local output] Waiting for worker #1 on ‘localhost’ to connect back
> [local output] Detected a warning from socketConnection(): ‘problem in listening on this socket’
> Killing worker process (PID 903) if still alive
> Worker (PID 903) was successfully killed: TRUE
> Error in socketConnection("localhost", port = port, server = TRUE, blocking = TRUE, :
> Failed to launch and connect to R worker on local machine ‘localhost’ from local machine ‘Dominiks-MBP.local’.
> * The error produced by socketConnection() was: ‘cannot open the connection’
> * In addition, socketConnection() produced 1 warning(s):
> - Warning #1: ‘problem in listening on this socket’
> * The localhost socket connection that failed to connect to the R worker used port 11867 using a communication timeout of 60 seconds and a connection timeout of 120 seconds.
> * Worker launch call: '/Library/Frameworks/R.framework/Resources/bin/Rscript' --default-packages=datasets,utils,grDevices,graphics,stats,methods -e 'try(cat(Sys.getpid(),file="/var/folders/5s/kgm05t2s0_52gz1s445mnlgw0000gn/T//RtmpZp1RX6/future.parent=835.3434fe0c5c6.pid"), silent = TRUE)' -e 'parallel:::.slaveRSOCK()' MASTER=localhost PORT=11867 OUT=/dev/null TIMEOUT=60 XDR=TRUE.
> * Worker (PID 903) was successfully killed: TRUE
> * Troubleshooting suggestions:
> - Suggestion #1: Set 'outfile=NULL' to see output from worker.
> In addition: Warning message:
> In socketConnection("localhost", port = port, server = TRUE, blocking = TRUE, :
> problem in listening on this socket
>
> My session looks like:
>> sessionInfo()
> R version 3.6.0 (2019-04-26)
> Platform: x86_64-apple-darwin15.6.0 (64-bit)
> Running under: macOS Mojave 10.14.5
>
> Matrix products: default
> BLAS: /Library/Frameworks/R.framework/Versions/3.6/Resources/lib/libRblas.0.dylib
> LAPACK: /Library/Frameworks/R.framework/Versions/3.6/Resources/lib/libRlapack.dylib
>
> Random number generation:
> RNG: Mersenne-Twister
> Normal: Inversion
> Sample: Rounding
>
> locale:
> [1] de_DE.UTF-8/de_DE.UTF-8/de_DE.UTF-8/C/de_DE.UTF-8/de_DE.UTF-8
>
> attached base packages:
> [1] stats graphics grDevices utils datasets methods base
>
> loaded via a namespace (and not attached):
> [1] compiler_3.6.0
> Any help is greatly appreciated.
> Best regards
> Dominik
>
> Dr. Dominik Leutnant
>
> Muenster University of Applied Sciences
> Department of Civil Engineering
> Institute for Infrastucture·Water·Resources·Environment (IWARU)
> WG Urban Hydrology and Water Management
> Corrensstr. 25
> FRG-48149 Münster, Germany
>
> Tel.: +49 (0) 251/83-65274
> Fax: +49 (0) 251/83-65915
> Mail: leutnant using fh-muenster.de<mailto:leutnant using fh-muenster.de>
> Web: https://www.fh-muenster.de/
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-devel using r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel
More information about the R-devel
mailing list