[Bioc-devel] any interest in a BiocMatrix core package?
Michael Lawrence
lawrence.michael at gene.com
Fri Mar 3 18:29:42 CET 2017
This is along the lines of what I suggested on the board phone call. If
there is already a C++ library like Armadillo for doing the heavy lifting,
it would be easy to implement an R-level abstraction on top of it, just as
was done for HDF5, bigMemory, etc.
On Fri, Mar 3, 2017 at 8:32 AM, McDavid, Andrew <
Andrew_Mcdavid at urmc.rochester.edu> wrote:
> On C++, Armadillo can be passed a a pointer to memory for the backing
> store of its objects, so can use memory mapping. On the R side, package
> bigmemory provides R access and initialization of memory-mapped arrays.
> See https://www.r-bloggers.com/using-rcpparmadillo-with-bigmemory/. This
> doesn’t provide language or platform interchange of the backing store, but
> would be an easy-ish solution.
>
> On Mar 3, 2017, at 10:23 AM, bioc-devel-request at r-project.org<mailto:
> bioc-devel-request at r-project.org> wrote:
>
> Some comment on Aaron's stuff
>
> One possibility for doing things like this is if your code can be done in
> C++ using a subset of rows or columns. That can sometimes give the
> necessary speed up. What I mean is this
>
> Say you can safely process 1000 cells (not matrix cells, but biological
> cells, aka columns) at a time in RAM
>
> iterate in R:
> get chunk i containing 1000 cells from the backend data storage
> do something on this sub matrix where everything is in a normal matrix
> and you just use C++
> write results out to whatever backend you're using
>
> Then, with a million cells you iterate over 1000 chunks in R. And you
> don't need to "touch" the full dataset which can be stored on an arbitrary
> backend. And this approach could be run even (potentially) with different
> chunks on different nodes.
>
>
> [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioc-devel at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/bioc-devel
>
[[alternative HTML version deleted]]
More information about the Bioc-devel
mailing list