[Bioc-sig-seq] Question when running ShortRead to generate a QA report
Martin Morgan
mtmorgan at fhcrc.org
Wed Sep 16 02:20:37 CEST 2009
"pterry at huskers.unl.edu" <pterry at huskers.unl.edu> writes:
> Dear Bioc-sig-sequencing,
>
> My question: Referring to the ShortRead tutorial (IOQA.pdf) at
> Bioc2009, page 15, how to generate a 'narrative description QA file'
> like 'qa_080828_081110.pdf' for my data?
>
> That is,
>
> Using the developer version of R,
>
> mterry at system76-pc:~/R-DEVEL/R-49517$ ./bin/R
>
> R version 2.10.0 Under development (unstable) (2009-09-01 r49517)
> Copyright (C) 2009 The R Foundation for Statistical Computing
> ISBN 3-900051-07-0
>
> ...
>
>> library("ShortRead")
> Loading required package: IRanges
>
> ...
>
> Loading required package: Biostrings
> Loading required package: BSgenome
> Loading required package: lattice
>> packageDescription("ShortRead")$Version
> [1] "1.3.32"
>>
>> sp <- SolexaPath("extdata/ELAND/080828_HWI-EAS88_0003")
>> rpt <- report(qa(sp), dest="reports/report_090709.pdf")
>
> Note: p-15, of IOQA.pdf, ShortRead tutorial at BioC2009, refers to sample report 'qa_080828_081110.pdf'. Near bottom of page 15, it states: "The QA report is self-documenting, providing a narrative description of each section."
>
> In the new "reports/report_090709.pdf/image" directory, there are 8 pairs of files plus a logo file:
>
> -rw-r--r-- 1 mterry mterry 22143 2009-09-07 12:23 alignmentQuality.jpg
> -rw-r--r-- 1 mterry mterry 15710 2009-09-07 12:23 alignmentQuality.pdf
> -rw-r--r-- 1 mterry mterry 8473 2009-09-07 12:23 bioclogo-small.jpg
> -rw-r--r-- 1 mterry mterry 23130 2009-09-07 12:23 perCycleBaseCall.jpg
> -rw-r--r-- 1 mterry mterry 23011 2009-09-07 12:23 perCycleBaseCall.pdf
> -rw-r--r-- 1 mterry mterry 20257 2009-09-07 12:23 perCycleQuality.jpg
> -rw-r--r-- 1 mterry mterry 21864 2009-09-07 12:23 perCycleQuality.pdf
> -rw-r--r-- 1 mterry mterry 29271 2009-09-07 12:23 perTileCount.jpg
> -rw-r--r-- 1 mterry mterry 74448 2009-09-07 12:23 perTileCount.pdf
> -rw-r--r-- 1 mterry mterry 17217 2009-09-07 12:23 perTileHistogram.jpg
> -rw-r--r-- 1 mterry mterry 3785 2009-09-07 12:23 perTileHistogram.pdf
> -rw-r--r-- 1 mterry mterry 27000 2009-09-07 12:23 perTileQuality.jpg
> -rw-r--r-- 1 mterry mterry 69548 2009-09-07 12:23 perTileQuality.pdf
> -rw-r--r-- 1 mterry mterry 21976 2009-09-07 12:23 readOccurences.jpg
> -rw-r--r-- 1 mterry mterry 31303 2009-09-07 12:23 readOccurences.pdf
> -rw-r--r-- 1 mterry mterry 19937 2009-09-07 12:23 readQuality.jpg
> -rw-r--r-- 1 mterry mterry 56130 2009-09-07 12:23 readQuality.pdf
>
> but I don't find "a self-documenting pdf file with narrative
> description of each section" as in the example file:
> qa_080828_081110.pdf? Before running this r49517 example, I had
> just installed 'texlive', (a TeX system) using Synaptic Package
> Manager onto my Ubuntu 9.04 linux laptop. Should this satisfy the
> requirement for 'LATEX installation' mentioned on page 15 of
> IOQA.PDF?
Hi --
The vignette is a little out of date with reality; pdf-based reports
have largely been replaced by html reports, and the way you produce
your report
rpt <- report(qa(sp), dest="reports/report_090709.pdf")
makes an html report (in a directory that happens to end in
'.pdf'). You can either enjoy the html, e.g.,
rpt <- report(qa(sp), dest=file.path("reports", "report_090709"))
browseURL(rpt)
(probably there is an index.html file in the folder that you created?)
or create a pdf version
rpt <- report(qa(sp), dest=file.path("reports", "report_090709"),
type="pdf")
pdf reports are only available for some return types of qa(sp), so you
might end up with an HTML report as the only option.
Since generating the result of qa can take some time, it might make
some sense to
qa_090709 <- qa(sp)
save(qa_090709, file="qa_090709.Rda")
i.e., separate out the qa and report generating portions.
I will bring the vignette up to date.
Martin
>
>> sessionInfo()
> R version 2.10.0 Under development (unstable) (2009-09-01 r49517)
> x86_64-unknown-linux-gnu
>
> locale:
> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
> [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
> [5] LC_MONETARY=C LC_MESSAGES=en_US.UTF-8
> [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
> [9] LC_ADDRESS=C LC_TELEPHONE=C
> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>
> attached base packages:
> [1] stats graphics grDevices utils datasets methods base
>
> other attached packages:
> [1] ShortRead_1.3.32 lattice_0.17-25 BSgenome_1.13.11 Biostrings_2.13.36
> [5] IRanges_1.3.66
>
> loaded via a namespace (and not attached):
> [1] Biobase_2.5.6 grid_2.10.0 hwriter_1.1 tools_2.10.0
>>
>
> P. Terry
> pterry at huskers.unl.edu
>
> _______________________________________________
> Bioc-sig-sequencing mailing list
> Bioc-sig-sequencing at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioc-sig-sequencing
--
Martin Morgan
Computational Biology / Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N.
PO Box 19024 Seattle, WA 98109
Location: Arnold Building M1 B861
Phone: (206) 667-2793
More information about the Bioc-sig-sequencing
mailing list