[BioC] negative values in raw data
Wolfgang Huber
huber at ebi.ac.uk
Thu Jun 2 18:25:13 CEST 2005
Hi Guillaume,
some image processing / background correction methods produce negative
estimates. In many cases, the "vsn" method from the package of the same
name can deal with this. It uses the "glog" transformation, and glog(x)
is similar to log(x+x0) with appropriate choice of x0. Please read the
vignette carefully and use the diagnostic plot "meanSdPlot" to see
whether it makes sense on your data.
While true gene abundances are obviously always >=0, it can make sense
to have negative estimates if you have an error model that includes a
Normally (or similar) distributed additive noise term. Because if Y is
the observation, x is the true value, and eps the noise,
Y = x + eps
will be negative half of the times if x=0.
Bw
Wolfgang
Deplaine wrote:
> Hi,
>
> I downloaded data on the GEO web site and now, I try to analyze them
> with R 2.0.1 for windows XP but in this dataset, there is a lot of
> negative values. So when I normalyze with log2 transformation, R
> produced NaNs. I understand why but after, when I want to calculate
> p-value with the parametric t test, R can't. I tried to write 0 where
> there is a negative value : R calculate p-value and q-value but all of
> q-value are at 1.
>
> Is there someone to help me.
> Thanks a lot
>
-------------------------------------
Wolfgang Huber
European Bioinformatics Institute
European Molecular Biology Laboratory
Cambridge CB10 1SD
England
Phone: +44 1223 494642
Fax: +44 1223 494486
Http: www.ebi.ac.uk/huber
More information about the Bioconductor
mailing list