[BioC] must one test all that is included in normalization?

Naomi Altman naomi at stat.psu.edu
Thu Jun 7 19:23:55 CEST 2007


Once the data are normalized, one uses only the genes that you are 
interested in - whether because the others are from a different 
species, or for any other reason that can be stated a priori without 
reference to the data at hand.

--Naomi

At 04:45 PM 6/6/2007, Maurice Melancon wrote:
>Hello,
>
>I worked with a single-colour multi-species array.  The dataset had limited
>options for transformation but disparately needed it, and vsn fit the bill
>nicely.  I was advised to include all non-empty spots in the normalization,
>even those that I am certain show hybridization due to non-target binding,
>because all spots contain useful information.  To qualify for a spot being
>'present' or 'absent', the creators of the array used the criteria of
>intensity above 2 SD of empty spots as their criteria.  This has proven to
>be far too liberal for my purposes, considering the nature of the array, and
>considering that while the dataset was in desparate need of scaling, the
>empty spots have little to no hybridization and thus the criteria for
>dropping spots prior to normalization or anything else is lax (imho).
>
>With vsn dataset in hand, the consensus seems to be among my collaborators
>to count only those spots that meet a certain criteria based upon the vsn
>data as present, and to test only those for differential expression.  One
>then calculates FDR based upon this subset of the data.  Is this fair?  Can
>you ignore spots used in normalization because you know them to be
>unreliable probes?  Or must this subset be eliminated prior to
>transformation?
>
>This is along a similar line to something posted earlier, where someone
>asked if it was allowable to normalize only a subset of the spots on an
>array, and to me the bigger question is how to fairly exclude spots that you
>are not interested in.
>
>With thanks,
>
>Maurice
>
>         [[alternative HTML version deleted]]
>
>_______________________________________________
>Bioconductor mailing list
>Bioconductor at stat.math.ethz.ch
>https://stat.ethz.ch/mailman/listinfo/bioconductor
>Search the archives: 
>http://news.gmane.org/gmane.science.biology.informatics.conductor

Naomi S. Altman                                814-865-3791 (voice)
Associate Professor
Dept. of Statistics                              814-863-7114 (fax)
Penn State University                         814-865-1348 (Statistics)
University Park, PA 16802-2111



More information about the Bioconductor mailing list