[BioC] siggenes and permutations
Louisa A Rispoli/AS/EXP/UTIA
larispoli at mail.ag.utk.edu
Thu Jul 10 22:19:53 CEST 2008
Hi all-
We are investigating SAM and EBAM analysis for our data and run into a
problem that reading the archives has not helped any. We performed SAM on a
data set and noticed the permutations randomly selected 100 to generate the
results. We were curious what that meant and if had anything similar to
iterations in SAS (more usually is better). So we tried more and discovered
that it was not better necessarly to us (increased FDR values for same
deltas). And since we do not exactly understand what permutations exactly
are doing in this case we wanted to check with the experts on their opinion
if it should be changed are left at the default of 100.
Thanks for the time and energy
Louisa
"If we knew what we were doing, it wouldn't be called Research." - Albert
Einstein
Louisa Rispoli, Ph.D. Reproductive Physiology
Department of Animal Science
University of Tennessee, Knoxville
A105 Johnson Animal Research and Teaching Unit
1750 Alcoa Highway
Knoxville, TN 37920
phone:(865) 946-1874
fax:(865) 946-1010
email: lrispoli at utk.edu
> library(siggenes)
Loading required package: multtest
Loading required package: survival
Loading required package: splines
> trt <- pData(Poly.rma.filter)$Treatment
> trt
[1] "Ctrl" "Ctrl" "Ctrl" "Ctrl" "Ctrl" "Ctrl" "Ctrl" "Ctrl" "HS" "HS"
"HS" "HS" "HS" "HS" "HS" "HS"
> clPoly <- ifelse(trt=="Ctrl", 0, 1)
> clPoly
[1] 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1
> dataPoly.rma <- exprs(Poly.rma.filter)
> sam.out <-sam(dataPoly.rma, clPoly, rand=123456)
We're doing 12870 complete permutations
and randomly select 100 of them.
> sam.out
SAM Analysis for the Two-Class Unpaired Case Assuming Unequal Variances
Delta p0 False Called FDR
1 0.1 0.666 7190.38 8620 0.556
2 0.2 0.666 6265.49 7932 0.526
3 0.3 0.666 2303.43 3891 0.394
4 0.4 0.666 299.36 660 0.302
5 0.5 0.666 90.83 246 0.246
6 0.6 0.666 34.53 117 0.197
7 0.7 0.666 10.92 44 0.165
8 0.8 0.666 1.06 6 0.118
9 0.9 0.666 0 0 0
10 1.0 0.666 0 0 0
> sam.out1 <- sam(dataPoly.rma, clPoly, rand=123456, B=1000)
We're doing 12870 complete permutations
and randomly select 1000 of them.
> sam.out1
SAM Analysis for the Two-Class Unpaired Case Assuming Unequal Variances
Delta p0 False Called FDR
1 0.1 0.675 7371.715 8705 0.571
2 0.2 0.675 5928.818 7584 0.527
3 0.3 0.675 1275.681 2233 0.385
4 0.4 0.675 255.636 549 0.314
5 0.5 0.675 87.618 228 0.259
6 0.6 0.675 37.511 116 0.218
7 0.7 0.675 12.686 44 0.195
8 0.8 0.675 2.287 8 0.193
9 0.9 0.675 0 0 0
10 1.0 0.675 0 0 0
> sessionInfo()
R version 2.7.1 (2008-06-23)
i386-pc-mingw32
locale:
LC_COLLATE=English_United States.1252;LC_CTYPE=English_United
States.1252;LC_MONETARY=English_United
States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252
attached base packages:
[1] splines tools stats graphics grDevices utils datasets
methods base
other attached packages:
[1] siggenes_1.14.0 multtest_1.20.0 affycoretools_1.12.0
annaffy_1.12.1 KEGG.db_2.2.0 biomaRt_1.14.0
[7] RCurl_0.9-3 GOstats_2.6.0 Category_2.6.0
RBGL_1.16.0 annotate_1.18.0 xtable_1.5-2
[13] GO.db_2.2.0 AnnotationDbi_1.2.2 RSQLite_0.6-9
DBI_0.2-4 graph_1.18.1 limma_2.14.5
[19] bovinecdf_2.2.0 simpleaffy_2.16.0 gcrma_2.12.1
matchprobes_1.12.0 genefilter_1.20.0 survival_2.34-1
[25] affy_1.18.2 preprocessCore_1.2.0 affyio_1.8.0
Biobase_2.0.1
loaded via a namespace (and not attached):
[1] cluster_1.11.11 XML_1.95-3
More information about the Bioconductor
mailing list