[BioC] Changes in siggenes?

Torbjörn Klatt torbjoern.k at googlemail.com
Thu Jan 7 17:12:16 CET 2010


Hello Bioconductor followers,

I'm quite new to R and Bioconductor and as a student in biomathematics and 
attending a paper of computational biology I'm asked to identify significantly 
expressed genes by a SAM analysis. We are working on the raw data set provided 
by Wang [1] at GEO [2] and try do reproduce their procedure and analysis for 
practical training.

For our first attempts we worked on one of the university's computers running 
MS Windows XP Professional 32bit with SP3 and R 2.9.2 (2009-08-24). By reading 
the help file for the sam() command we experimented a little bit and used the 
attributes "delta" and "p0" amongst others. It worked well and we were rather 
satisfied with the results.

Afterwards I installed R and Bioconductor on my own computer at home running 
openSUSE 11.2 (64bit) with kernel 2.6.31.5-0.1 and R 2.10.1 (2009-12-14) with 
additional Bioconductor packages installed directly from the repository on 
2010/01/05. I configured and compiled R by myself using gcc 4.4.1 [3] and it's 
complementary f77 [4] with no additional parameters.

I tried to run the same script we wrote in the university's computers and it 
broke with some error messages.
One of the errors was caused by the rma() command. This one could be fixed by 
my professor. It seemed that there have been some changes in the affy package. 
The fixed command (see file sam_analysis.r further down) is working on the old 
university computers as well.

But the sam() command still did not work with the parameters "delta" and "p0" 
and I got a error message saying that there are "unused parameters 'delta', 
'p0'.
Although we could find a way without using these I would like to know since 
when and why these parameters are not longer supported by sam(). I could not 
find a very detailed change log of the siggenes package except of this one 
(http://fgc.lsi.umich.edu/cgi-bin/blosxom.cgi/siggenes), so I'm writing this 
email.

Cheers,
Torbjoern Klatt

PS: If this email is getting through to the list, it seems not possible to 
send PGP-signed mails. Is that right?


--- file: sam_analysis.r ---
#################################################
# R-Script
# author: Torbjoern Klatt
# subject: Bioinformatik
# project: SAM analysis 
# date created: 2010-01-05
# date edited last: 2010-01-06
#################################################

# set the working directory
setwd("/home/myself/Documents/Wissen/Uni/rac/0910_ws/Bioinformatik/Praktikum/wd")

# load required libraries
library(affy)           # for Affymetrix chips
library(siggenes)       # identifying significant genes
library(hgu133a2.db)    # to map the affy probe names to gene names
library(hopach)         # clustering

# read data about the phenotype (here dasatinib sensitivity)
cell.lines <- read.csv("sensitivity.csv",row.names=1,header=TRUE,sep=",")
pheno <- as.data.frame(cell.lines[,2],row.names=row.names(cell.lines))
names(pheno) <- c("sensitivity")

# read the cell files and assign phenotype information
wangData <- ReadAffy(phenoData=pheno)

## old version of previous line
### wangData <- ReadAffy()
### wangData at phenoData<-as(pheno, "AnnotatedDataFrame")

# background correction, generation of expression values and normalization
wangExpr <- rma(wangData)

# look at the expression data
sampleNames(wangExpr)
featureNames(wangExpr)
description(wangExpr)
pData(phenoData(wangExpr))
dim(exprs(wangExpr))
head(exprs(wangExpr))

# here one should do some more quality control, but this is omitted for now

# add the analysis here
# with a 'rand' value of '123' the p0 in the SAM analysis will be 0.5
sam.out <- sam(exprs(wangExpr), pData(phenoData(wangExpr))[,1], method=d.stat, 
B=500, rand=123)
## old version of previous line
### sam.out <- sam(exprs(wangExpr), pData(phenoData(wangExpr))[,1], 
method=d.stat, delta=seq(from=1.0 to=2.0 by=0.1), p0=0.5,B=500)
delta <- findDelta(sam.out,fdr=0.05)
genes <- list.siggenes(sam.out,delta[1,1])
--- END: file---

--- sessionInfo() on my linux machine ---
R version 2.10.1 (2009-12-14)
x86_64-unknown-linux-gnu

locale:
 [1] LC_CTYPE=de_DE.UTF-8       LC_NUMERIC=C
 [3] LC_TIME=de_DE.UTF-8        LC_COLLATE=de_DE.UTF-8
 [5] LC_MONETARY=C              LC_MESSAGES=de_DE.UTF-8
 [7] LC_PAPER=de_DE.UTF-8       LC_NAME=C
 [9] LC_ADDRESS=C               LC_TELEPHONE=C
[11] LC_MEASUREMENT=de_DE.UTF-8 LC_IDENTIFICATION=C

attached base packages:
[1] splines   stats     graphics  grDevices utils     datasets  methods
[8] base

other attached packages:
 [1] hgu133a2cdf_2.5.0   hopach_2.6.0        cluster_1.12.1
 [4] hgu133a2.db_2.3.5   org.Hs.eg.db_2.3.6  RSQLite_0.8-0
 [7] DBI_0.2-5           AnnotationDbi_1.8.1 siggenes_1.20.0
[10] multtest_2.2.0      affy_1.24.2         Biobase_2.6.1

loaded via a namespace (and not attached):
[1] affyio_1.14.0        MASS_7.3-5           preprocessCore_1.8.0
[4] survival_2.35-8      tools_2.10.1
--- END: sessionInfo() ---

--- sessionInfo() on the WinXP machine ---
R version 2.9.2 (2009-08-24) 
i386-pc-mingw32 

locale:
LC_COLLATE=German_Germany.1252;LC_CTYPE=German_Germany.1252;LC_MONETARY=German_Germany.1252;LC_NUMERIC=C;LC_TIME=German_Germany.1252

attached base packages:
[1] splines   stats     graphics  grDevices utils     datasets  methods  
[8] base     

other attached packages:
 [1] hopach_2.6.0        cluster_1.12.0      hgu133a2.db_2.2.11 
 [4] RSQLite_0.8-0       DBI_0.2-5           AnnotationDbi_1.6.1
 [7] siggenes_1.18.0     multtest_2.2.0      affy_1.22.1        
[10] Biobase_2.4.1      

loaded via a namespace (and not attached):
[1] affyio_1.12.0        MASS_7.2-48          preprocessCore_1.6.0
[4] survival_2.35-4
--- END: sessionInfo()

--- references ---
[1] Xi-De Wang, Karen Reeves, Feng R Luo, Li-An Xu, Francis Lee, Edwin Clark, 
Fei Huang. (2007). Identification of candidate predictive and surrogate 
molecular markers for dasatinib in prostate cancer: rationale for patient 
selection and efficacy monitoring. Genome biology 8 (11) p. R255 
http://www.ncbi.nlm.nih.gov/pubmed/18047674
[2] http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE9633
[3] extracted from config.log: gcc version 4.4.1 [gcc-4_4-branch revision 
150839] (SUSE Linux)
[4] extracted from config.log: GNU Fortran (SUSE Linux) 4.4.1 [gcc-4_4-branch 
revision 150839]



More information about the Bioconductor mailing list