[Bioc-devel] SummarizedExperiment: structure loss, when mixing matrix and data.frame data

Vincent Carey stvjc at channing.harvard.edu
Sun Nov 26 13:56:31 CET 2017


Confirmed with the following sessionInfo(), satisfying biocValid()==TRUE

> sessionInfo()

R Under development (unstable) (2017-11-22 r73776)

Platform: x86_64-pc-linux-gnu (64-bit)

Running under: Linux Mint 18.1


Matrix products: default

BLAS: /home/stvjc/R-35-dist/lib/R/lib/libRblas.so

LAPACK: /home/stvjc/R-35-dist/lib/R/lib/libRlapack.so


locale:

 [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C

 [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8

 [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8

 [7] LC_PAPER=en_US.UTF-8       LC_NAME=C

 [9] LC_ADDRESS=C               LC_TELEPHONE=C

[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C


attached base packages:

[1] parallel  stats4    stats     graphics  grDevices utils     datasets

[8] methods   base


other attached packages:

[1] SummarizedExperiment_1.9.2 DelayedArray_0.5.5

[3] matrixStats_0.52.2         Biobase_2.39.0

[5] GenomicRanges_1.31.1       GenomeInfoDb_1.15.1

[7] IRanges_2.13.4             S4Vectors_0.17.10

[9] BiocGenerics_0.25.0


loaded via a namespace (and not attached):

 [1] lattice_0.20-35         bitops_1.0-6            grid_3.5.0

 [4] zlibbioc_1.25.0         XVector_0.19.1          Matrix_1.2-12

 [7] tools_3.5.0             RCurl_1.95-4.8          compiler_3.5.0

[10] GenomeInfoDbData_0.99.2

On Sun, Nov 26, 2017 at 7:09 AM, Felix Ernst <felix.ernst at ulb.ac.be> wrote:

> Hi all,
>
> I got different results constructing a SummarizedExperiment in 3.6 and
> 3.7. My question is, whether this is intentional or a bug.
>
> library(GenomicRanges)
> library(SummarizedExperiment)
>
> nrows <- 200; ncols <- 6
> counts <- matrix(runif(nrows * ncols, 1, 1e4), nrows)
> colnames(counts) <- LETTERS[1:6]
> rownames(counts) <- 1:nrows
> counts2 <- counts-floor(counts)
> rowRanges <- GRanges(rep(c("chr1", "chr2"), c(50, 150)),
>                      IRanges(floor(runif(200, 1e5, 1e6)), width=100),
>                      strand=sample(c("+", "-"), 200, TRUE),
>                      feature_id=sprintf("ID%03d", 1:200))
> colData <- DataFrame(Treatment=rep(c("ChIP", "Input"), 3),
>                      row.names=LETTERS[1:6])
>
> se <- SummarizedExperiment(assays=list(counts=counts),
>                            rowRanges=rowRanges,
>                            colData=colData)
>
> str(assays(se)$counts)
> assays(se)$counts2 <- as.data.frame(counts2)
> str(assays(se)$counts)
>
> On a Windows 10 R3.4.2 Bioc 3.6 this produces:
> num [1:200, 1:6] 8815 6314 1945 6185 5935 ...
>  - attr(*, "dimnames")=List of 2
>   ..$ : chr [1:200] "1" "2" "3" "4" ...
>   ..$ : chr [1:6] "A" "B" "C" "D" ...
>  num [1:200, 1:6] 8815 6314 1945 6185 5935 ...
>  - attr(*, "dimnames")=List of 2
>   ..$ : chr [1:200] "1" "2" "3" "4" ...
>   ..$ : chr [1:6] "A" "B" "C" "D" ...
>
> On Ubuntu 17.10 R-devel r73779 Bioc3.7  this produces
> num [1:200, 1:6] 8636 7040 9275 4821 2475 ...
>  - attr(*, "dimnames")=List of 2
>   ..$ : chr [1:200] "1" "2" "3" "4" ...
>   ..$ : chr [1:6] "A" "B" "C" "D" ...
>  num [1:1200] 8636 7040 9275 4821 2475 ...
>
> Somehow the structure is lost.
>
> This happens, if I mix matrix and data.frame data, and doesn’t, if I use
> only matrices. The man page defines matrix-like objects,
> which a data.frame is (isn’t it?) and the behavior is different from
> Bioc3.6 to Bioc3.7.
>
> I can rule out that this is a Windows/Linux thing, because the Travis
> build error, which pointed to an difference in the first place,
> didn‘t occur with bioc-release, just with bioc-devel.
>
> Thanks for any advice and suggestions.
>
> Felix
>
>         [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioc-devel at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/bioc-devel

	[[alternative HTML version deleted]]



More information about the Bioc-devel mailing list