[BioC] filter on Human Gene U133 Plus 2

Wolfgang Huber whuber at embl.de
Sat Nov 6 09:52:10 CET 2010


Il Nov/2/10 2:54 PM, wxu ha scritto:
>    Use MAS5, RMA, or other algorithms would be fine. The point is whether
> the gene filtering is proper. The raw p-values are not changed after
> filtering. The gene filtering only makes the FDR looking nicer.
> Different person may report a different FDR for the same gene list even
> with the same data set because of the different number filtered. If you
> want to get lower and nicer FDR, filter and reduce the total gene number
> to a small number. Do you think that is kind of cheating :)

Hi Wayne, Naima

in a nutshell, filtering is able to increase detection power, while 
maintaining type 1 error, if the filtering criterion is correlated with 
your test statistic (e.g. t-statistic) under the test's alternative, but 
independent from it under the null.

There are some important technical points on the application of this 
statement to different concrete test statistics, filtering criteria and 
type 1 error measures, which are discussed in the paper "Independent 
filtering increases detection power for high-throughput experiments", 
http://www.pnas.org/content/107/21/9546.long


	Best wishes
	Wolfgang


>
>
> Wayne
> --
>
> On 11/2/2010 4:27 AM, Naïma Oumouhou wrote:
>> Wayne,
>> Thank you for your response.
>> I thought it was a good idea to use the Detection MAS5 algorithm
>> because I remove genes not expressed or expressed a little.
>> Best regards
>>
>>   Le 29/10/2010 19:25, wxu a écrit :
>>> Naima,
>>> Why do you need to filter genes? For bad quality or some other
>>> reasons in your mind? Read my recent paper you may have an idea:
>>> http://www.ncbi.nlm.nih.gov/pubmed/20846437
>>>
>>> Wayne
>>> --
>>> Naïma Oumouhou wrote:
>>>> Dear Christian,
>>>>
>>>> I'm sorry to bother you again : I've got a question about filter on
>>>> Affymetrix Human gene U133 Plus 2 array.
>>>> I would like to find differentiallty expressed genes between 2
>>>> groups of patients (n1=7 and n2=6).
>>>> I have no experience in microarray analyses.
>>>> I read several publications and your xps vignettes but I don't know
>>>> what I have to do.
>>>> Some people filtered probesets using Detection MAS5 call:probesets
>>>> that aren't expressed in at least one sample using the  Detection
>>>> MAS5 algorithm are discarded.
>>>> What do you think about this filter?not tight enough?
>>>> After this filter and with my dataset,I still have 26 495
>>>> probesets?Is it too much?
>>>> Furthermore, in these remaining probesets, there are affymetrixx
>>>> control probesets. These probesets have to be removed?At which step?
>>>> After, I use the moderated t-statistics with BH correction. I find
>>>> no differentiallty expressed genes.
>>>> I wonder if I have to reduce the number of probesets with another
>>>> filter or with a "Detection filter" tighter?
>>>> Thanks for any help.
>>>> Naïma
>>>> Saisissez du texte, l'adresse d'un site Web ou importez un document
>>>> à traduire.<http://translate.google.fr/?tr=f&hl=fr>
>>>> <http://translate.google.fr/?tr=f&hl=fr>
>>>> Annuler<http://translate.google.fr/?tr=t&hl=fr>
>>>> <http://translate.google.fr/?tr=t&hl=fr>
>>>>
>>>>
>>>>      [[alternative HTML version deleted]]
>>>>
>>>>
>>>> ------------------------------------------------------------------------
>>>>
>>>>
>>>> _______________________________________________
>>>> Bioconductor mailing list
>>>> Bioconductor at stat.math.ethz.ch<mailto:Bioconductor at stat.math.ethz.ch>
>>>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>>>> Search the archives:
>>>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>>>
>>>
>>>
>>>
>>
>
>
> 	[[alternative HTML version deleted]]
>
>
>
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor



More information about the Bioconductor mailing list