[Rd] How to efficiently share data (a dataframe) between R and Java

Dirk Eddelbuettel edd at debian.org
Sun Dec 6 19:56:22 CET 2015


On 6 December 2015 at 18:36, Ing. Jaroslav Kuchař wrote:
| in our ongoing project we use Java implementations of several
| algorithms. We also provide a “wrapper” implemented as an R package
| using rJava (https://github.com/jaroslav-kuchar/rCBA). Based on our
| recent experiments, the significant portion of time is spent on copying
| a dataframe from R to Java. The Java implementation needs access to the
| source dataframe. 
| 
| I have tested several approaches: calling Java method row-by-row;
| serialize the whole data-frame to a temp file and parsing in Java; or
| row binding to a single vector and calling a single Java method. Each
| approach has its limitations e.g. time-consuming row-by-row copying,
| serialization and parsing performance or memory limitations of a single
| vector. 
| 
| Is there an efficient approach how to copy a dataframe from R to Java
| and another one from Java to R?
| 
| Thanks for any help you can provide...

Have you looked at the gold standard that is Rserve and its dedicated
clients, starting with the Java one?

Dirk

-- 
http://dirk.eddelbuettel.com | @eddelbuettel | edd at debian.org



More information about the R-devel mailing list