[BioC] Multifactorial analysis with RMA and LIMMA of Affymetrix microarrays

Wed Mar 17 01:32:16 MET 2004

At 07:55 AM 17/03/2004, Jordi Altirriba Gutiérrez wrote:
>(Sorry, but I've had some problems with the HTML)
>Hello all!
>I am a beginner user of R and Bioconductor, sorry if my questions have 
>already been discussed previously.
>I am studying the effects of a new hypoglycaemic drug for the treatment of 
>diabetes and I have done this classical study:
>4 different groups:
>1.- Healthy untreated
>2.- Healthy treated
>3.- Diabetic untreated
>4.- Diabetic treated
>With 3 biological replicates of each group, therefore I have done 12 
>arrays (Affymetrix).
>I have treated the raw data with the package RMA of Bioconductor according 
>to the article ”Exploration, normalization and summaries of high density 
>oligonucleotide array probe level data” (Background=RMA, 
>Normalization=quantiles, PM=PMonly, Summarization=medianpolish).
>I am currently trying to analyse the object eset with the package LIMMA of 
>Bioconductor. I want to know what genes are differentially expressed due 
>to diabetes,  to the treatment and to the combination of both (diabetes + 
>treatment), being therefore an statistic analysis similar to a two-ways ANOVA).
>So, my questions are:
>1.- I have created a PhenoData in RMA, will the covariates of the 
>PhenoData have any influence in the analysis of LIMMA?

Not automatically.

>2.- Are these commands correct to get these results? (see below) In the 
>command TopTable, the output of coef=1 are the genes characteristics of 
>diabetes?

No.

>3.- If I do not see any effect of the treatment in the healthy untreated 
>rats should I design the matrix differently? Something similar to a 
>one-way-ANOVA, considering differently the four groups:
>( > design<-model.matrix(~ -1+factor(c(1,1,1,2,2,2,3,3,3,4,4,4)))  ).

This design matrix would be very much better, i.e., it would be a correct 
matrix. You could then use contrasts to test for differences and 
interaction terms between your four groups, and that would do the job.

If you tell us what's in your phenoData slot, i.e., type pData(eset), then 
we might be able to suggest another approach analogous to the classical 
two-way anova approach.

Gordon

>5.- Any other idea?
>Thank you very much for your time and your suggestions.
>Yours sincerely,
>
>Jordi Altirriba (PhD student, Hospital Clínic  IDIBAPS, Barcelona, Spain)
>
>>design<-cbind("disease"=c(1,1,1,1,1,1,0,0,0,0,0,0),"treatment"=c(0,0,0,1,1,1,0,0,0,1,1,1))
>>fit<-lmFit(eset,design)
>>contrast.matrix<-cbind("diabetes"=c(1,0),"drug"=c(0,1),"diabetes-drug"=c(1,1))
>>rownames(contrast.matrix)<-colnames(design)
>>design
>      disease treatment
>[1,]        1           0
>[2,]        1           0
>[3,]        1           0
>[4,]        1           1
>[5,]        1           1
>[6,]        1           1
>[7,]        0           0
>[8,]        0           0
>[9,]        0           0
>[10,]        0           1
>[11,]        0           1
>[12,]        0           1
>>contrast.matrix
>            diabetes drug diabetes-drug
>diabetes      1         0             1
>tratamiento   0         1             1
>>fit2<-contrasts.fit(fit,contrast.matrix)
>>fit2<-eBayes(fit2)
>>clas<-classifyTests(fit2)
>>vennDiagram(clas)
>>topTable(fit2,number=100,genelist=geneNames(eset),coef=1,adjust="fdr")
>                     Name        M        t      P.Value        B
>11590          1387471_at 19.32575 9.442926 2.743122e-17 29.95299
>2500           1369951_at 19.24748 9.404683 2.743122e-17 29.66737
>10652          1384778_at 19.22968 9.395981 2.743122e-17 29.60254
>

>>sink("limma-diabetes.txt")
>>topTable(fit2,number=1000,genelist=geneNames(eset),coef=1,adjust="fdr")
>>sink()