[BioC] rma for tiling arrays (oligo package)
Ben Bolstad
bmb at bmbolstad.com
Mon Jul 21 16:56:05 CEST 2008
Actually, you can avoid "copying from affy" altogether by using
preprocessCore directly where both normalize.quantiles() and
rma.background.correct() are actually defined.
I think oligo also loads preprocessCore, so those functions should
already be exposed.
Ben
On Mon, 2008-07-21 at 10:48 -0400, James W. MacDonald wrote:
> Hi Ann,
>
> I don't think you want to use rma() directly, as it is going to try to
> do a medianpolish on probesets but such a thing doesn't exist for the
> tiling arrays.
>
> If you want to use the background correction and normalization that are
> used by rma() then I think it will take some work on your part. The
> functions you will want to use are part of the affy package, but you
> don't really want to load affy and oligo at the same time because there
> are so many identically named functions (they both have namespaces, so
> this isn't the end of the world, but it is easier if you don't have to
> deal with name collisions).
>
> I would personally just copy the functions normalize.quantiles() and
> rma.background.correct() from affy into a file (say, affysources.R) and
> then source that into R. Both of these functions want you to pass a
> matrix, so you would want to extract the pm data from your AllArrays
> object, run rma.background.correct() and then normalize.quantiles() on
> the matrix, and then put that back into AllArrays.
>
> Best,
>
> Jim
>
>
>
> Ann Hess wrote:
> > After creating an appropriate library using the makePDpackage, I am
> > using the oligo package to open and work with Affymetrix Arabidopsis
> > Tiling 1.0R Arrays. I am interested in using the rma function to
> > background correct and normalize the data, but I am not sure how to map
> > the processed data back to probes or directly to chromosome and position.
> >
> > What do the rownames of the expression matrix created by rma correspond
> > to? My best guess is that they correspond to chromosome position (which
> > can be found using pmChr, but not for an ExpressionSet object).
> > However, these positions are relative to a particular chromosome and
> > therefore not unique. For example, there are probes corresponding to
> > position 417 on both Chromosome 3 and Chromosome 5, but only a single
> > row in the ExpressionSet object corresponding to 417.
> >
> > Is there a way to background correct and normalize the data without the
> > rma function? Perhaps this would allow for easier mapping to probes.
> >
> > Any suggestions would be appreciated.
> >
> > Ann
> >
> > Code and session info is here:
> >
> >> library(oligo)
> >> library(pd.at35b.mr.v04.2.tigrv5)
> >> AllArrays<-read.celfiles(list.celfiles(),pk="pd.at35b.mr.v04.2.tigrv5")
> >> dim(pm(AllArrays))
> > [1] 3092374 12
> >> dim(mm(AllArrays))
> > [1] 3092338 12
> >
> >> Pos<-pmPosition(AllArrays)
> >> length(Pos)
> > [1] 3092374
> >> length(unique(Pos))
> > [1] 2921991
> >
> >> RMAout<-rma(AllArrays)
> >
> >> dim(exprs(RMAout))
> > [1] 2921991 12
> >
> >> exprs(RMAout)[1:10,1:2]
> > Comp5-1_1006.CEL Comp5-2_1006.CEL
> > 0 3.344400 3.295634
> > 1 1.988137 1.708682
> > 1000 6.315857 7.297425
> > 10000009 9.053133 8.754469
> > 10000014 2.106050 2.137780
> > 10000024 10.392988 9.385502
> > 10000026 2.242264 5.487639
> > 10000034 1.830658 5.239400
> > 1000004 3.097441 5.825040
> > 10000046 6.839724 7.221181
> >
> >> sessionInfo()
> > R version 2.6.0 (2007-10-03)
> > x86_64-redhat-linux-gnu
> >
> > attached base packages:
> > [1] splines tools stats graphics grDevices utils datasets
> > [8] methods base
> >
> > other attached packages:
> > [1] pd.at35b.mr.v04.2.tigrv5_1.2.0 oligo_1.2.2
> > [3] oligoClasses_1.0.3 affxparser_1.10.2
> > [5] AnnotationDbi_1.0.6 preprocessCore_1.0.0
> > [7] RSQLite_0.6-9 DBI_0.2-4
> > [9] Biobase_1.16.3
> >
> > loaded via a namespace (and not attached):
> > [1] rcompgen_0.1-17
> >
> > _______________________________________________
> > Bioconductor mailing list
> > Bioconductor at stat.math.ethz.ch
> > https://stat.ethz.ch/mailman/listinfo/bioconductor
> > Search the archives:
> > http://news.gmane.org/gmane.science.biology.informatics.conductor
>
> --
> James W. MacDonald, M.S.
> Biostatistician
> Hildebrandt Lab
> 8220D MSRB III
> 1150 W. Medical Center Drive
> Ann Arbor MI 48109-0646
> 734-936-8662
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
More information about the Bioconductor
mailing list