[BioC] Problem with NChannelSet objects using Agilent Data with arrayQualityMetrics package

Wolfgang Huber whuber at embl.de
Thu Feb 27 20:50:24 CET 2014


Dear Laurent
try with this script:

----------------------
library("ArrayExpress")
library("arrayQualityMetrics")
library("geneplotter")
library("vsn")

AEset = ArrayExpress("E-GEOD-32915")

## explore raw data distributions
par(mfrow=c(2,1))
for(w in c("E", "Eb")) multidensity(log2(assayData(AEset)[[w]]), main=w)

## normalize
v = vsn2(assayData(AEset)$E - assayData(AEset)$Eb)

x = ExpressionSet(assayData = exprs(v),
    phenoData = phenoData(AEset),
    featureData = featureData(AEset),
    experimentData = experimentData(AEset))

arrayQualityMetrics ( x, intgroup = "Comment..Sample_source_name." )
-------------------------

Usually, as is here, some data wrangling and “normalization” is required between downloading a raw data set from ArrayExpress and processing it further with Bioconductor tools. Although the microarray field has been around for a while, these steps are still not fully automated. (Contributions are welcome.) The code above implies the (subjective) decision to subtract Eb from E, and then call vsn2 for between-array “normalization” and log-like transformation. I then stick the result back into ExpressionSet ‘x’ and so we go.

Hope this helps
	Wolfgang

> print(sessionInfo())
R Under development (unstable) (2014-02-27 r65088)
Platform: x86_64-apple-darwin13.0.0/x86_64 (64-bit)

locale:
[1] C

attached base packages:
[1] parallel  stats     graphics  grDevices utils     datasets  methods  
[8] base     

other attached packages:
 [1] vsn_3.31.2                 geneplotter_1.41.1        
 [3] annotate_1.41.2            AnnotationDbi_1.25.9      
 [5] lattice_0.20-27            arrayQualityMetrics_3.19.6
 [7] ArrayExpress_1.23.1        Biobase_2.23.5            
 [9] BiocGenerics_0.9.3         fortunes_1.5-2            

loaded via a namespace (and not attached):
 [1] BeadDataPackR_1.15.1  BiocInstaller_1.13.3  Biostrings_2.31.14   
 [4] Cairo_1.5-5           DBI_0.2-7             Formula_1.1-1        
 [7] Hmisc_3.14-1          IRanges_1.21.32       KernSmooth_2.23-10   
[10] RColorBrewer_1.0-5    RJSONIO_1.0-3         RSQLite_0.11.4       
[13] Rcpp_0.11.0           SVGAnnotation_0.93-1  XML_3.98-1.1         
[16] XVector_0.3.7         affy_1.41.3           affyPLM_1.39.2       
[19] affyio_1.31.0         beadarray_2.13.1      cluster_1.14.4       
[22] colorspace_1.2-4      gcrma_2.35.1          genefilter_1.45.1    
[25] grid_3.1.0            gridSVG_1.4-0         hwriter_1.3          
[28] latticeExtra_0.6-26   limma_3.19.21         plyr_1.8.1           
[31] preprocessCore_1.25.5 reshape2_1.2.2        setRNG_2011.11-2     
[34] splines_3.1.0         stats4_3.1.0          stringr_0.6.2        
[37] survival_2.37-7       xtable_1.7-1          zlibbioc_1.9.0 


Il giorno 26 Feb 2014, alle ore 17:43, Laurent.Pays <Laurent.Pays at univ-lyon1.fr> ha scritto:

> Hi to all,
> 
> After downloading Agilent Data  using the ArrayExpress package:
> 
>>> AEset <-ArrayExpress("E-GEOD-32915”)
> 
> I verified that AEset was indeed a NChannelSet object:
>>> class(AEset)
>> [1] "NChannelSet"
>> attr(,"package")
>> [1] “Biobase"
> 
> Then I tried to use the arrayQualityMetrics package which should work on NChannelSet objects:
> 
>> arrayQualityMetrics ( AEset )
> 
> But I get this error message:
> 
>> The report will be written into directory 'arrayQualityMetrics report for AEset'. 
>> Erreur dans platformspecific(expressionset, intgroup, do.logtransform) : 
>>  'assayData(expressionset)' must contain either 'exprs', or 'R' and 'G', but not both.
> 
> My question is how do I get pass this error, or at least or can I remove either ‘exprs’, or ‘R’ and ‘G’ from the NChannelSet object knowing that when I try:
>>> channelNames(AEset)
> I only get:
>> [1] "E"  “Eb”
> and I don’t know what to analyse with arrayQualityMetrics: “E” or “Eb”?
> 
> Thanks in advance to your answers
> 
> Laurent
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor



More information about the Bioconductor mailing list