[BioC] Vsn transformation repeatability
Paolo Innocenti
paolo.innocenti at ebc.uu.se
Wed Oct 29 11:20:31 CET 2008
Hi all,
We are attempting to identify significantly differentially expressed
genes between two treatments of female Drosophila melanogaster, using
results from Affymetrix Drosophila2 microarray chip hybridization (8
chips, 4 per treatment). The end result is that the number of
significantly differentially expressed genes for a particular FDR
changes with each run, despite the input data and the program code
remaining unchanged.
We are using ”vsnrma” for pre-processing, which utilizes a search
algorithm to find the maximum likelihood transformed values. The
procedure produces slightly different calibrated GLOG2-transformed
expression values on each run, despite the untransformed expression
values and the R code being identical each time. Our first guess is that
the search algorithm is unable to locate the likelihood peak and settles
on a slightly different set of transformed values each time. Is this a
possibility? If so, is there anything we can do to fix it? E.g. increase
the number of iterations, or modify the search algorithm.
Many thanks in advance.
Paolo Innocenti
> library(affy)
Loading required package: Biobase
Loading required package: tools
Welcome to Bioconductor
Vignettes contain introductory material. To view, type
'openVignette()'. To cite Bioconductor, see
'citation("Biobase")' and for packages 'citation(pkgname)'.
> library(vsn)
Loading required package: lattice
Loading required package: limma
> data <- ReadAffy(celfile.path="./celfiles")
> sampleNames(data) <- c("z1","z2","z3","z4","m1","m2","m3","m4")
> eset1 <- vsnrma(data)
100% done.4 x 8 matrix (1 stratum). 0% done.
Please use 'meanSdPlot' to verify the fit.
Calculating Expression
> eset2 <- vsnrma(data)
100% done.4 x 8 matrix (1 stratum). 0% done.
Please use 'meanSdPlot' to verify the fit.
Calculating Expression
> exprs(eset1)[1,]
z1 z2 z3 z4 m1 m2 m3 m4
12.52630 12.23441 12.54579 12.35582 12.45190 12.35984 12.33933 12.35031
> exprs(eset2)[1,]
z1 z2 z3 z4 m1 m2 m3 m4
12.52422 12.24059 12.53899 12.35949 12.44828 12.35460 12.34465 12.35277
> head(exprs(eset1))==head(exprs(eset2))
z1 z2 z3 z4 m1 m2 m3 m4
1616608_a_at FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
1622892_s_at FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
1622893_at FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
1622894_at FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
1622895_at FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
1622896_at FALSE FALSE FALSE FALSE FALSE FALSE FALSE FALSE
> sessionInfo()
R version 2.8.0 (2008-10-20)
i686-pc-linux-gnu
locale:
LC_CTYPE=en_US.UTF-8;LC_NUMERIC=C;LC_TIME=en_US.UTF-8;LC_COLLATE=en_US.UTF-8;LC_MONETARY=C;LC_MESSAGES=en_US.UTF-8;LC_PAPER=en_US.UTF-8;LC_NAME=C;LC_ADDRESS=C;LC_TELEPHONE=C;LC_MEASUREMENT=en_US.UTF-8;LC_IDENTIFICATION=C
attached base packages:
[1] tools stats graphics grDevices utils datasets methods
[8] base
other attached packages:
[1] drosophila2cdf_2.3.0 vsn_3.8.0 limma_2.16.2
[4] lattice_0.17-15 affy_1.20.0 Biobase_2.2.0
loaded via a namespace (and not attached):
[1] affyio_1.10.0 grid_2.8.0 preprocessCore_1.4.0
--
Paolo Innocenti
Department of Animal Ecology, EBC
Uppsala University
Norbyvägen 18D
75236 Uppsala, Sweden
More information about the Bioconductor
mailing list