[BioC] RMA normalization and MAS5.0 detection calls
Sarwar, Rizwan
rizwan.sarwar at csc.mrc.ac.uk
Fri Dec 8 13:06:36 CET 2006
Hi all
Regardless of whichever approach is being used, a cut-off needs to be
applied. What sort of numbers do you use as your thresholds for variance
or CV? What %age (roughly) of probesets does it exclude?
I have to admit using simple RMA signal of 150 (again another arbitrary
figure) for the higher of the 2 means in the group, and have found that
this eliminates a lot of noise
Regards
Rizwan
Dr Rizwan Sarwar
Physiological Genomics & Medicine
MRC-Clinical Sciences Centre
Imperial College London (Hammersmith Campus)
Du Cane Rd W12 0NN
-----Original Message-----
From: James W. MacDonald [mailto:jmacdon at med.umich.edu]
Sent: 07 December 2006 16:19
To: Jenny Drnevich
Cc: bioconductor at stat.math.ethz.ch
Subject: Re: [BioC] RMA normalization and MAS5.0 detection calls
Hi Jenny,
Jenny Drnevich wrote:
> Hi all,
>
> Actually, the problem of the statistical results not always making
sense
> with the P/A calls also happens with MAS5 values, not RMA or GCRMA
> specifically. Many years ago I as using Affy's two sample comparison
in
> the MAS 5.0 software, and I noticed that a probeset would be called
"A"
> in sample 1 and "P" in sample 2, but supposedly sample 2 had higher
> expression than sample 1!! The calls and the expression level
> comparisons sometimes don't correspond, but this is because they use
> different algorithms and values in their computation, as Jim
explained.
> I tend to like and use the calls in a conservative matter, but they
may
> only be about 85% accurate (Choe et al. Genome Biology 2005, 6:R16).
>
>> Another way to approach filtering probesets is based on the
variability
>> of the probesets over all samples. If the variance is low (below some
>> constant c), then you might assume that the gene is not
differentially
>> expressed in any samples (which is different than saying it is
expressed
>> or not). These genes are uninteresting by definition, and can be
removed
>> from the dataset.
>
>
> I still haven't convinced myself that I like this approach. And
wouldn't
> it be better to filter on CV, which takes into account expression
level,
> rather than variance? I know there was a recent exchange on what sort
of
> cutoff value to use... I really need to find the time to play around
> with filtering on some aspect of variability - unless something has
been
> published on it?
I think you want to use CV for data that show a mean/variance
dependence. With RMA (and I suppose GCRMA) values, most of the
dependence has been decoupled by taking logs. For instance, a plot of
mean expression vs variance usually shows nearly constant variance
except at the tails, where the variance appears to go down precipitously
(which CV won't affect anyway).
Best,
Jim
>
> Cheers,
> Jenny
>
>
>
>
>> HTH,
>>
>> Jim
>>
>>
>> >
>> >
>> > Regards!
>> >
>> >
>> > haiyan
>> >
>> > [[alternative HTML version deleted]]
>> >
>> > _______________________________________________
>> > Bioconductor mailing list
>> > Bioconductor at stat.math.ethz.ch
>> > https://stat.ethz.ch/mailman/listinfo/bioconductor
>> > Search the archives:
>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>>
>>
>> --
>> James W. MacDonald, M.S.
>> Biostatistician
>> Affymetrix and cDNA Microarray Core
>> University of Michigan Cancer Center
>> 1500 E. Medical Center Drive
>> 7410 CCGC
>> Ann Arbor MI 48109
>> 734-647-5623
>>
>>
>> **********************************************************
>> Electronic Mail is not secure, may not be read every day, and should
>> not be used for urgent or sensitive issues.
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at stat.math.ethz.ch
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives:
>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>
>
> Jenny Drnevich, Ph.D.
>
> Functional Genomics Bioinformatics Specialist
> W.M. Keck Center for Comparative and Functional Genomics
> Roy J. Carver Biotechnology Center
> University of Illinois, Urbana-Champaign
>
> 330 ERML
> 1201 W. Gregory Dr.
> Urbana, IL 61801
> USA
>
> ph: 217-244-7355
> fax: 217-265-5066
> e-mail: drnevich at uiuc.edu
--
James W. MacDonald, M.S.
Biostatistician
Affymetrix and cDNA Microarray Core
University of Michigan Cancer Center
1500 E. Medical Center Drive
7410 CCGC
Ann Arbor MI 48109
734-647-5623
**********************************************************
Electronic Mail is not secure, may not be read every day, and should not
be used for urgent or sensitive issues.
More information about the Bioconductor
mailing list