[BioC] normalize.AffyBatch.quantiles and NA values
odlc at uchicago.edu
odlc at uchicago.edu
Thu Jul 6 18:27:44 CEST 2006
Ben,
Thank you very much for the answer.
By the way, I got your name wrong in my posting. For
the record, for those who might read this: the correct
reference for the quantile normalization procedure for
microarrays is:
Bolstad et al., Bioinformatics, 19(2):185–193, Jan 2003
Sorry for the confusion,
Omar.
---- Original message ----
>Date: Wed, 05 Jul 2006 09:16:08 -0700
>From: Ben Bolstad <bmb at bmbolstad.com>
>Subject: Re: [BioC] normalize.AffyBatch.quantiles and NA values
>To: odlc at uchicago.edu
>Cc: bioconductor at stat.math.ethz.ch
>
>Your understanding of what the code is doing is correct. eg
>
>> data(affybatch.example)
>> pm(affybatch.example)[1,]
>20A 20B 10A
>149 118 124
>> pm(affybatch.example)[1,1] <- NA
>> affybatch.example.norm <- normalize(affybatch.example)
>> pm(affybatch.example)[1:3,]
> 20A 20B 10A
>[1,] NA 118.0 124.0
>[2,] 143.5 124.8 116.5
>[3,] 132.0 111.0 105.0
>> pm(affybatch.example.norm)[1:3,]
> 20A 20B 10A
>[1,] NA 118.0000 124.0000
>[2,] 127.2667 137.3667 120.1667
>[3,] 115.8333 122.6000 107.3333
>>
>
>
>Note that normalize calls normalize.AffyBatch.quantiles by
default in
>this case.
>
>Why does it do this? Because the underlying routine which
does the
>normalization normalize.quantiles() does not handle NA
values. In some
>sense this was done by design, the thinking being that if you
start off
>with raw CEL file data there should not be any missing data.
>
>
>Ben
>
>
>On Fri, 2006-06-30 at 09:33 -0500, odlc at uchicago.edu wrote:
>> Hello,
>>
>> I am trying to understand how normalize.AffyBatch.quantiles
>> works. From what I understand of the code, it seems that
>> rows (corresponding to a probe) which contain even a single
>> NA are dropped; then, the quantile-normalization method
>> described in Botstein et al. is applied to the remaining rows,
>> and these normalized rows are put back into the original
>> batch.
>>
>> In other words, values in a row that contains NA's
>> remain unchanged.
>>
>> Questions:
>>
>> - Is this really how it works?
>>
>> - If yes, is this the intended behavior?
>>
>> Thank you,
>>
>> Omar.
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at stat.math.ethz.ch
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives:
http://news.gmane.org/gmane.science.biology.informatics.conductor
>--
>Ben Bolstad <bmb at bmbolstad.com>
>http://bmbolstad.com
>
More information about the Bioconductor
mailing list