[BioC] Empty featureData in ExpressionSet generated by affy::justRMA

David Shih djh.shih at gmail.com
Wed May 18 20:04:12 CEST 2011


Hi Jim,

Thank you for clarifying the purpose of the featureData slot.
The assayData slot also appears empty in my ExpressionSet.

The rownames are missing in my expression matrix extracted using.
> exprs(eset)

I was hoping to fill the rownames of my matrix using keys obtained
from
> keys <- mappedkeys(hgu133a2ACCUM)

I checked that all the keys are mapped, and I am under the impression
that they should be for hgu133a2ACCUM.

My main concern is that hgu133a2ACCUM may not return the keys
(probe ids) in the same order as they appear in the expression matrix,
which would cause the probes to be incorrectly mapped.

I have no problem with normalizing older platforms like U133. All the
probe ids are automatically assigned upon extraction with exprs().
However, the hthgu133a platform does not appear to be as
straightforward.

I had a similar problems with the HT Human Gene 1.0 ST platform.
I ended up using the oligo package, whicih provides a getNetAffx()
function to retrieve probe annotation data.
However, the existing normalization code precludes from switching to
the oligo package.

Best regards,
David

On Wed, May 18, 2011 at 01:51:01PM -0400, James W. MacDonald wrote:
> Hi David,
> 
> On 5/18/2011 1:31 PM, David Shih wrote:
> >Dear list,
> >
> >I am using the affy package to normalize expression data on the HT
> >Human Genome U133A platform. According to earlier posting on the list,
> >this is equivalent to U133Av2.
> >
> >I used the justRMA() function to normalize the data, and obtained an
> >ExpressionSet object with an empty featureData slot.
> 
> You may be misunderstanding the purpose of the featureData slot.
> According to ?eSet-class:
> 
> 'featureData': Contains variables describing features (i.e., rows
>           in 'assayData') unique to this experiment. Use the
>           'annotation' slot to efficiently reference feature data
>           common to the annotation package used in the experiment.
>           Class: 'AnnotatedDataFrame-class'
> 
> The probeset IDs are not unique to any particular experiment, so are
> stored as part of the assayData, which can be extracted using e.g.,
> exprs():
> 
> > data(sample.ExpressionSet)
> > head(exprs(sample.ExpressionSet))
>                        A         B        C        D        E
> F       G
> AFFX-MurIL2_at  192.7420  85.75330 176.7570 135.5750 64.49390
> 76.3569 160.5050
> AFFX-MurIL10_at  97.1370 126.19600  77.9216  93.3713 24.39860
> 85.5088 98.9086
> AFFX-MurIL4_at   45.8192   8.83135  33.0632  28.7072  5.94492
> 28.2925 30.9694
> AFFX-MurFAS_at   22.5445   3.60093  14.6883  12.3397 36.86630
> 11.2568 23.0034
> AFFX-BioB-5_at   96.7875  30.43800  46.1271  70.9319 56.17440
> 42.6756 86.5156
> AFFX-BioB-M_at   89.0730  25.84610  57.2033  69.9766 49.58220
> 26.1262 75.0083
> 
> Best,
> 
> Jim
> 
> 
> >
> >I understand that I can retrive the annotation information from:
> >library(hgu133a2.db)
> >
> >However, I am uncertain whether the probes in the annotation library
> >are in the same order as the ExpressionSet object.
> >
> >Can I add the probe IDs using the following code?
> >
> >library(hgu133a2.db)
> >keys<- mappedkeys(hgu133ai2ACCNUM);
> >expr<- exprs(eset);
> >rownames(expr)<- keys;
> >
> >
> >Best regards,
> >
> >David Shih, Graduate Student
> >The Hospital for Sick Children
> >Brain Tumour Research Centre
> >101 College Street, TMDT-11-401M
> >Toronto, ON  M5G1L7
> >Canada
> >Tel:  416-813-7654 x4327
> >
> >
> >>sessionInfo()
> >R version 2.12.2 (2011-02-25)
> >Platform: x86_64-unknown-linux-gnu (64-bit)
> >
> >locale:
> >  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
> >  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
> >  [5] LC_MONETARY=C              LC_MESSAGES=en_US.UTF-8
> >  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C
> >  [9] LC_ADDRESS=C               LC_TELEPHONE=C
> >[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
> >
> >attached base packages:
> >[1] stats     graphics  grDevices utils     datasets  methods   base
> >
> >other attached packages:
> >[1] hgu133a2.db_2.4.5    org.Hs.eg.db_2.4.6   RSQLite_0.9-4
> >[4] DBI_0.2-5            AnnotationDbi_1.12.0 hthgu133acdf_2.7.0
> >[7] affy_1.28.0          Biobase_2.10.0
> >
> >loaded via a namespace (and not attached):
> >[1] affyio_1.18.0         preprocessCore_1.12.0 tools_2.12.2
> >
> >_______________________________________________
> >Bioconductor mailing list
> >Bioconductor at r-project.org
> >https://stat.ethz.ch/mailman/listinfo/bioconductor
> >Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
> 
> -- 
> James W. MacDonald, M.S.
> Biostatistician
> Douglas Lab
> University of Michigan
> Department of Human Genetics
> 5912 Buhl
> 1241 E. Catherine St.
> Ann Arbor MI 48109-5618
> 734-615-7826
> **********************************************************
> Electronic Mail is not secure, may not be read every day, and should
> not be used for urgent or sensitive issues
>



More information about the Bioconductor mailing list