[BioC] affyPara: background correction & normalization only?

Markus Schmidberger schmidb at ibe.med.uni-muenchen.de
Sat Mar 6 20:09:21 CET 2010


Leo Lahti wrote:
> Dear Markus, 
>
> Thanks for a quick reply.
>
>
> I tested affyPara 1.7.1. To get the summarization method 'none' to 
> work, I had to slightly modify the summary.method check in preproPara 
> R code ("if ((any( express.summary.stat.methods()==summary.method) || 
> summary.method == "none") == 0) stop("Unknown 
> Summarization-Method")"). After this the summary.method "none" seems 
> to be functional in preproPara function.
>
>
> However, I bumped into another problem which seems to be related to 
> the affyPara warnings in build/check at  
> http://bioconductor.org/checkResults/2.6/bioc-LATEST/
> I could not so far solve this issue so far. Any 
> solutions/updates/feedback would be appreciated.
Do not use the unstable 2.11 version. Please use version 2.10.
I will check for these warnings as soon there is a code freeze for 2.11.
I will add the none to the summary.method list. Thanks!

Best
Markus
>
>
> You can find the output of my running example below.
>
>
> with kind regards
> Leo Lahti 
> leo.lahti at iki.fi <mailto:leo.lahti at iki.fi>
> http://www.cis.hut.fi/lmlahti
>
>
> > require(affyPara)
> > require(hgu133plus2cdf)
> Loading required package: hgu133plus2cdf
> > require(hgu133plus2hsentrezgcdf)
> Loading required package: hgu133plus2hsentrezgcdf
> > cl <- makeCluster(3, type = "SOCK")
> > cels <- list.celfiles("/my/CEL/path", full.names = TRUE)
> >
> > eset <- preproPara(cels,
> +            bgcorrect = TRUE, bgcorrect.method = "rma",
> +            normalize = TRUE, normalize.method = "quantiles",
> +            pmcorrect.method = "pmonly",
> +            summary.method = "none",
> +            cdfname = "HGU133Plus2_Hs_ENTREZG", cluster = cl, verbose 
> = TRUE)
> Partition of object Error in gsub("^/?([^/]*/)*", "", unlist(object), 
> extended = TRUE) :
>   unused argument(s) (extended = TRUE)
> > sessionInfo()
> R version 2.11.0 Under development (unstable) (2010-02-15 r51142)
> x86_64-unknown-linux-gnu
>
> locale:
>  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
>  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
>  [5] LC_MONETARY=C              LC_MESSAGES=en_US.UTF-8
>  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C
>  [9] LC_ADDRESS=C               LC_TELEPHONE=C
> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>
> attached base packages:
> [1] tcltk     stats     graphics  grDevices utils     datasets  methods
> [8] base
>
> other attached packages:
> [1] hgu133plus2hsentrezgcdf_12.1.0 hgu133plus2cdf_2.5.0
> [3] affyPara_1.7.1                 aplpack_1.2.2
> [5] vsn_3.15.1                     snow_0.3-3
> [7] affy_1.25.2                    Biobase_2.7.4
>
> loaded via a namespace (and not attached):
> [1] affyio_1.15.2        grid_2.11.0          lattice_0.18-3
> [4] limma_3.3.9          preprocessCore_1.9.0 tools_2.11.0
>
> On Fri, Mar 5, 2010 at 2:02 PM, Markus Schmidberger 
> <schmidb at ibe.med.uni-muenchen.de 
> <mailto:schmidb at ibe.med.uni-muenchen.de>> wrote:
>
>     Dear Leo,
>
>     good point.
>
>     I did some changes to preproPara(). No there is a summarization
>     method 'none' (summary.method='none') available. There will be no
>     summarization step and the results object eset is NULL.
>     I submitted these changes and documentation to the svn. Version
>     1.7.1. should be available ad midnight:
>     http://www.bioconductor.org/packages/devel/bioc/html/affyPara.html
>
>     Now at all nodes the bgc and normalized affyBatches are available
>     in the GlobalEnvironment. It is very simple to run some code on that.
>
>     res <- clusterCall(cluster, FUN)
>     FUN<- function()
>     {
>       require(affy)
>       if (exists("AffyBatch", envir = .GlobalEnv))
>              AffyBatch <- get("AffyBatch", envir = .GlobalEnv)
>
>       # do anything you want on the AffyBatch.  
>     }
>
>     res will be a list of results from all nodes. You have to find a
>     way to combine these results.
>
>     Best
>     Markus
>
>
>
>
>     Leo Lahti wrote:
>
>         Dear affyPara package maintainers and BioC developer community,
>
>
>         The parallelizations of Affy preprocessing in the affyPara
>         package provide essential tools to handle large array collections.
>
>         Before starting to hack on this myself, I would like to ask if
>         there are workarounds in affyPara to obtain a preprocessed &
>         normalized (but not summarized) PM intensity matrix from CEL
>         files? Alternatively, having an access to virtual affybatch
>         (i.e. keeping it in the nodes without 'rebuild') would solve
>         the problem. Is such functionality available?
>
>         I would need the probe-level values (for PM probes) after
>         preprocessing and normalization (but without probeset
>         summarization step)  - in an ideal case a probes x arrays
>         matrix, and not necessarily other information from the
>         affybatch object. It seems that both background correction and
>         normalization can be done in affyPara by reading in the CEL
>         files directly. However, the output of these methods in itself
>         an affybatch which will cause the memory problems the package
>         is trying to solve.
>
>         Thanks once more for relevant work.
>
>         with kind regards
>
>         Leo Lahti
>         Department of Information and Computer Science
>         Aalto University School of Science and Technology
>         Finland
>
>
>
>     -- 
>     Dr. rer. nat. Markus Schmidberger
>
>     Ludwig-Maximilians-Universität München
>     IBE - Institut für medizinische Informationsverarbeitung,
>     Biometrie und Epidemiologie
>     Lehrstuhl für Biometrie und Bioinformatik
>     Marchioninistr. 15, D-81377 Muenchen
>     URL: http://www.ibe.med.uni-muenchen.de Mail: Markus.Schmidberger
>     [at] ibe.med.uni-muenchen.de <http://ibe.med.uni-muenchen.de>
>
>


-- 
Dr. rer. nat. Markus Schmidberger

Ludwig-Maximilians-Universität München
IBE - Institut für medizinische Informationsverarbeitung,
Biometrie und Epidemiologie
Lehrstuhl für Biometrie und Bioinformatik
Marchioninistr. 15, D-81377 Muenchen
URL: http://www.ibe.med.uni-muenchen.de 
Mail: Markus.Schmidberger [at] ibe.med.uni-muenchen.de



More information about the Bioconductor mailing list