[BioC] Affy: gene filtering before or after normalization??

w.huber at dkfz-heidelberg.de w.huber at dkfz-heidelberg.de
Fri Oct 10 18:31:51 MEST 2003


Hi Jing,

in my experience (and it seems to be that of others) quality control,
including gene filtering, should be done while and after normalization,
and there is not too much to be done before. I.e. if you use a model-based
normalization, you can use the residuals for QC, you can use the
reproducibility of the per-probe intensities across chips for QC, and
since with Affy genechips you have multiple probes per gene you should use
that, too. I think Francois Collins has a nice method for QC based on the
residuals of the probe-set-summary model.

All this assuming that there are no obvious scratches, fingerprints,
gradients etc on the chip image itself, in which case you should probably
send them back to the lab...

Best wishes
  Wolfgang


-------------------------------------
Wolfgang Huber
Division of Molecular Genome Analysis
German Cancer Research Center
Heidelberg, Germany
Phone: +49 6221 424709
Fax:   +49 6221 42524709
Http:  www.dkfz.de/abt0840/whuber
-------------------------------------

On Thu, 9 Oct 2003, Jing Shen wrote:

> Hi,
>
> I am going to work on affymetrix data analysis using Bioconductor Affy
> package. In my understanding, the procedure for data analysis should be:
> (1) import data (CEL files)
> (2) data filtering ?? --- get rid of bad or false intensities (e.g.,sth
> like filtering on flags, present or absent or expression values in
> GeneSpring)
> (3) data normalization --- several different methods based on probe cell
> level or probe set level...
> (4) now you have the data for statistical analysis...
> I am wondering if anybody can give me some suggestions on data filtering
> before (or after??) my data normalization if I use RMA() or expresso()?
> Or what kind of gene filtering criteria do you guys use? or I don't need
> to do that at all?
>
> Thanks,
> Jing
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor
>



More information about the Bioconductor mailing list