[BioC] rma for tiling arrays (oligo package)

James W. MacDonald jmacdon at med.umich.edu
Mon Jul 21 17:15:24 CEST 2008


Thanks Ben, I forgot that you moved everything into preprocessCore.

Best,

Jim



Ben Bolstad wrote:
> Actually, you can avoid "copying from affy" altogether by using
> preprocessCore directly where both normalize.quantiles() and 
> rma.background.correct() are actually defined. 
> 
> I think oligo also loads preprocessCore, so those functions should
> already be exposed.
> 
> Ben
> 
> 
> 
> 
> 
> On Mon, 2008-07-21 at 10:48 -0400, James W. MacDonald wrote:
>> Hi Ann,
>>
>> I don't think you want to use rma() directly, as it is going to try to 
>> do a medianpolish on probesets but such a thing doesn't exist for the 
>> tiling arrays.
>>
>> If you want to use the background correction and normalization that are 
>> used by rma() then I think it will take some work on your part. The 
>> functions you will want to use are part of the affy package, but you 
>> don't really want to load affy and oligo at the same time because there 
>> are so many identically named functions (they both have namespaces, so 
>> this isn't the end of the world, but it is easier if you don't have to 
>> deal with name collisions).
>>
>> I would personally just copy the functions normalize.quantiles() and 
>> rma.background.correct() from affy into a file (say, affysources.R) and 
>> then source that into R. Both of these functions want you to pass a 
>> matrix, so you would want to extract the pm data from your AllArrays 
>> object, run rma.background.correct() and then normalize.quantiles() on 
>> the matrix, and then put that back into AllArrays.
>>
>> Best,
>>
>> Jim
>>
>>
>>
>> Ann Hess wrote:
>>> After creating an appropriate library using the makePDpackage, I am 
>>> using the oligo package to open and work with Affymetrix Arabidopsis 
>>> Tiling 1.0R Arrays.  I am interested in using the rma function to 
>>> background correct and normalize the data, but I am not sure how to map 
>>> the processed data back to probes or directly to chromosome and position.
>>>
>>> What do the rownames of the expression matrix created by rma correspond 
>>> to?  My best guess is that they correspond to chromosome position (which 
>>> can be found using pmChr, but not for an ExpressionSet object).  
>>> However, these positions are relative to a particular chromosome and 
>>> therefore not unique.  For example, there are probes corresponding to 
>>> position 417 on both Chromosome 3 and Chromosome 5, but only a single 
>>> row in the ExpressionSet object corresponding to 417.
>>>
>>> Is there a way to background correct and normalize the data without the 
>>> rma function?  Perhaps this would allow for easier mapping to probes.
>>>
>>> Any suggestions would be appreciated.
>>>
>>> Ann
>>>
>>> Code and session info is here:
>>>
>>>> library(oligo)
>>>> library(pd.at35b.mr.v04.2.tigrv5)
>>>> AllArrays<-read.celfiles(list.celfiles(),pk="pd.at35b.mr.v04.2.tigrv5")
>>>> dim(pm(AllArrays))
>>> [1] 3092374      12
>>>> dim(mm(AllArrays))
>>> [1] 3092338      12
>>>
>>>> Pos<-pmPosition(AllArrays)
>>>> length(Pos)
>>> [1] 3092374
>>>> length(unique(Pos))
>>> [1] 2921991
>>>
>>>> RMAout<-rma(AllArrays)
>>>> dim(exprs(RMAout))
>>> [1] 2921991      12
>>>
>>>> exprs(RMAout)[1:10,1:2]
>>>          Comp5-1_1006.CEL Comp5-2_1006.CEL
>>> 0                3.344400         3.295634
>>> 1                1.988137         1.708682
>>> 1000             6.315857         7.297425
>>> 10000009         9.053133         8.754469
>>> 10000014         2.106050         2.137780
>>> 10000024        10.392988         9.385502
>>> 10000026         2.242264         5.487639
>>> 10000034         1.830658         5.239400
>>> 1000004          3.097441         5.825040
>>> 10000046         6.839724         7.221181
>>>
>>>> sessionInfo()
>>> R version 2.6.0 (2007-10-03)
>>> x86_64-redhat-linux-gnu
>>>
>>> attached base packages:
>>> [1] splines   tools     stats     graphics  grDevices utils     datasets
>>> [8] methods   base
>>>
>>> other attached packages:
>>> [1] pd.at35b.mr.v04.2.tigrv5_1.2.0 oligo_1.2.2
>>> [3] oligoClasses_1.0.3             affxparser_1.10.2
>>> [5] AnnotationDbi_1.0.6            preprocessCore_1.0.0
>>> [7] RSQLite_0.6-9                  DBI_0.2-4
>>> [9] Biobase_1.16.3
>>>
>>> loaded via a namespace (and not attached):
>>> [1] rcompgen_0.1-17
>>>
>>> _______________________________________________
>>> Bioconductor mailing list
>>> Bioconductor at stat.math.ethz.ch
>>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>>> Search the archives: 
>>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>> -- 
>> James W. MacDonald, M.S.
>> Biostatistician
>> Hildebrandt Lab
>> 8220D MSRB III
>> 1150 W. Medical Center Drive
>> Ann Arbor MI 48109-0646
>> 734-936-8662
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at stat.math.ethz.ch
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
> 

-- 
James W. MacDonald, M.S.
Biostatistician
Hildebrandt Lab
8220D MSRB III
1150 W. Medical Center Drive
Ann Arbor MI 48109-0646
734-936-8662



More information about the Bioconductor mailing list