[Bioc-devel] transitioning scater/scran to SingleCellExperiment

Tue Aug 8 11:14:21 CEST 2017

On 08/08/2017 03:59 AM, Angerer, Philipp wrote:
> 
> 
> Hi Aaron,
> 
> 
> 
> I guess this would be a question for the SummarizedExperiment developers, though personally, I never liked ExpressionSet's inclination to slap names on everything.
> 
> 
> 
> Too bad we’re bound to SummarizedExperiment’s “rows” and “cols”. Since they always refer to features and samples, respectively: Why not name them that?
> 
> There’s already too many APIs in too many programming languages that confusingly have one or the other convention – if whe know which is which, why not name them after that knowledge?
> BQ_BEGIN
> 
> 
> It probably wouldn't be a good idea to store distances as expression matrices. However, if there is a need for it, we can add a new slot for distance matrices. I think SC3 has a similar requirement, so perhaps this would be more generally useful than I first thought. You can post an issue on the github repository to remind Davide or me to do it.
> BQ_END
> 
> 
> Distance matrices (cell×cell) can’t only come from cell×gene matrices. You can e.g. use dynamic time warping to create them from cell×gene×time arrays.
> BQ_BEGIN
> 
> 
> Finally, I'm not sure what advantages those ergonomics provide. Indeed, if every package defines its own plot() S4 method for SingleCellExperiment, they will clobber each other in the dispatch table, resulting in some interesting results dependent on package loading order. If you have destiny-specific data and methods, best to keep them separate rather than stuffing them into the SCE object.
> BQ_END
> 
> 
> I wrote that I could e.g. create a plot_dm method, which plots a diffusion map stored in a SCE.
> 
> Also I didn’t mean the plot method with ergonomics. I meant fortify , names , $ , and [[ . Those would be very useful, as you could just do things like the following, and have autocompletion:
> sce$Predicate1 <- sce$SampleMeta1 > 40 # `$` accesses counts (by gene) and rowData. `$<-` sets rowData
> qplot(Gene1, Gene2, colour = Predicate1, data = sce) # fortify creates a data.frame containing cbind(t(counts), rowData)

FWIW, SummarizedExperiment (hence SingleCellExperimet) uses $ (and $<-; 
also [[ and [[<-) to access (with auto-completion) colData. And it 
wouldn't be a good idea to have $ access one element (counts) and $<- 
modify another (rowData), or to mix what $ and [[ access.

Martin

> 
> 
> Just as you can do now with DiffusionMap objects.
> 
> Also I’m not sure if i got rowData and the “t” right in the above code ;) I meant cbind(counts as cell×gene, sampleMeta as cell×n_meta)
> 
> Best,
> Phil
> 
>   
> 
> 
> Helmholtz Zentrum Muenchen
> 
> Deutsches Forschungszentrum fuer Gesundheit und Umwelt (GmbH)
> 
> Ingolstaedter Landstr. 1
> 
> 85764 Neuherberg
> 
> www.helmholtz-muenchen.de
> 
> Aufsichtsratsvorsitzende: MinDir'in Baerbel Brumme-Bothe
> 
> Geschaeftsfuehrer: Prof. Dr. Guenther Wess, Heinrich Bassler, Dr. Alfons Enhsen
> 
> Registergericht: Amtsgericht Muenchen HRB 6466
> 
> USt-IdNr: DE 129521671
> 
> 
> 	[[alternative HTML version deleted]]
> 
> _______________________________________________
> Bioc-devel at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/bioc-devel
> 

This email message may contain legally privileged and/or...{{dropped:2}}