[BioC] Normalization quality

alex lam (RI) alex.lam at bbsrc.ac.uk
Thu Oct 5 15:16:12 CEST 2006


Hi Sean,
Thanks. I wasn't aware of the quality metrics. I googled it and will
have a go with simpleaffy to see how I get on.

How would you decide if GCRMA is better than MAS5 or vice-versa if you
have more info? The arrays came from GSE1485 in GEO. 

Cheers,
Alex

-----Original Message-----
From: Sean Davis [mailto:sdavis2 at mail.nih.gov] 
Sent: 05 October 2006 11:47
To: bioconductor at stat.math.ethz.ch
Cc: alex lam (RI)
Subject: Re: [BioC] Normalization quality

On Thursday 05 October 2006 06:22, alex lam (RI) wrote:
> Dear BioCers,
>
> Hi! I don't have much of experience in handling affy chips and I hope 
> someone can help me here.
>
> I loaded 276 affy CEL files using justGCRMA - my computer couldn't 
> cope going the readAffy route.
> Quantile normalization was done.
>
> I have noticed that for some probesets there are some strange results.
>
> For example:
> > summary(exprs(eset.norm.quantile)["203329_at",])
>
>    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.
>   2.486   2.513   2.523   2.545   2.534   5.201
>
> This gene seems to be very lowly expressed for everyone except for 1 
> chip. I did a boxplot of that chip against a few others after 
> normalization and the overall distributions are similar.

I would look at the quality metrics for the arrays as a separate issue
from normalization.  If you have 276 arrays, there will likely be an
"outlier" for at least one array FOR MANY GENES.

> (1) As I don't have an AffyBatch object, is there a way to make image 
> plots? And what other methods are there to catch these odd ones?
>
> (2) I was provided summarised expression values from an external group

> and they used MAS5 for pre-processing. For the same gene, the 
> expression
>
> summary is:
> > summary(mas.expr[,"X203329_at"]) # I received a csv file, hence it's

> > a
>
> data frame
>    Min. 1st Qu.  Median    Mean 3rd Qu.    Max.
>   2.926   5.337   5.950   5.880   6.403   9.183
> The variation seems to be much greater, and it looks much more 
> interesting.

I think that for most purposes, most people would agree that MAS5 is
inferior to RMA or GCRMA, but such a statement is a dangerous one to
make without knowing the details of the experiment.

Sean



More information about the Bioconductor mailing list