[Bioc-devel] ShortRead: optional custom labeling of samples in QA report
Julian Gehring
julian.gehring at embl.de
Tue Feb 12 15:23:15 CET 2013
Hi,
In the QA report of the 'ShortRead' package, a short sequential integer
labeling for referencing the samples/files throughout the report is
created by default. Would it be reasonable/possible to allow for other
optional names to label the samples to make the results of the report
easier to understand?
In general, I have three ideas what would be handy to have:
1. Derive a label from the file names. This is probably hard to
generalize and implement in a way that it actually helps.
2. In case the 'dirPath' argument in the 'qa' function call is a named
vector, such as
qa(dirPath=c(p1="bam_file1.bam", p2="bam_file2.bam"))
use the names ["p1", "p2"] for the labeling later on. This would
require storing the names in the object returned by 'qa', but should not
be too hard to implement.
3. Optionally, pass a named vector to the 'report' method, matching file
names to sample labels. In case the file names do not match or
'samples' is missing, default to the sequential labeling.
For option 3, I have created a simple example patch to illustrate how
this could be implemented (see attached). So, later this may look like
this:
library(ShortRead)
files = c(p1="bam_file1.bam", p2="bam_file2.bam")
qa = qa(files, type="BAM")
## default sequential labeling ##
ShortRead:::.report_html_BAMQA(qa, dest="report_normal")
## samples named according to names(files) ##
ShortRead:::.report_html_BAMQA(qa, dest="report_named", samples=files)
I'm happy about any inputs or thoughts regarding this.
Best wishes
Julian
More information about the Bioc-devel
mailing list