[BioC] Problem with limma for two-color factorial design - Coefficients not estimable
Stephen Turner
vustephen at gmail.com
Wed Oct 19 19:32:12 CEST 2011
Hello. This is my first time posting to the mailing list - usually
I've been able to find answers in the archives here or elsewhere, but
I have a problem that I haven't been able to solve, so here goes.
I have microarray data I downloaded from Array Express using the
ArrayExpress function in the ArrayExpress package. I managed to
convert the NChannelSet object into an RGList object that I can use in
limma using a tip I found in the archives
(https://stat.ethz.ch/pipermail/bioconductor/2009-September/029705.html).
After some bg subtraction and normalization, I have an MAList object
with 32 arrays (red and green) by 34,944 probes.
The experiment consists of two cell types (WT or KO), by 4 treatments
(control, CD70, CD80, CD70+CD80), at four different time points (2, 4,
8, 14 hours), with a dye-swap. I'm mostly interested in genes that are
differentially expressed when WT cells are hit with CD70+CD80 versus
CD80 alone.
I'm trying to create a targets file so I can use modelMatrix to create
my design matrix. But I have a multifactorial design using a two color
dye-swap design, and I'm not sure how to specify this in the targets
file. The limma manual has information about factorial designs, but no
examples for two-color experiments. I have the following factors:
1: Celltype: WT or KO
2: Treatment: control (x), CD70, CD80, or CD70+CD80 (CD7080).
3: Timepoints: 2, 4, 8, and 14 hours.
I tried collapsing all of factors and levels into a single string in
my targets file:
> targets
array Cy3 Cy5
1 array3329 WT.CD70.2 WT.x.2
2 array2675 KO.CD7080.8 WT.CD7080.8
3 array2242 WT.CD7080.2 KO.CD7080.2
4 array3328 WT.x.2 WT.CD70.2
5 array3310 WT.x.8 WT.CD70.8
6 array2246 KO.CD7080.2 WT.CD7080.2
7 array3337 WT.CD7080.4 WT.CD80.4
8 array3323 WT.CD7080.14 WT.CD80.14
9 array2673 KO.CD70.8 WT.CD70.8
10 array1938 WT.CD7080.14 KO.CD7080.14
11 array2240 WT.CD70.2 KO.CD70.2
12 array3336 WT.CD80.4 WT.CD7080.4
13 array3322 WT.CD80.14 WT.CD7080.14.2
14 array2674 WT.CD7080.8 KO.CD7080.8
15 array3321 WT.CD70.14 WT.x.14
16 array2241 KO.CD70.2 WT.CD70.2
17 array2597 KO.CD7080.4 WT.CD7080.4
18 array3313 WT.CD7080.8 WT.CD80.8
19 array1939 KO.CD7080.14 WT.CD7080.14.2
20 array3335 WT.CD70.4 WT.x.4
21 array2672 WT.CD70.8 KO.CD70.8
22 array3320 WT.x.14 WT.CD70.14
23 array3334 WT.x.4 WT.CD70.4
24 array3311 WT.CD70.8 WT.x.8
25 array3331 WT.CD7080.2 WT.CD80.2
26 array1941 KO.CD70.14 WT.CD70.14.2
27 array2588 WT.CD70.4 KO.CD70.4
28 array2596 WT.CD7080.4 KO.CD7080.4
29 array3330 WT.CD80.2 WT.CD7080.2
30 array3312 WT.CD80.8 WT.CD7080.8
31 array1940 WT.CD70.14.2 KO.CD70.14
32 array2593 KO.CD70.4 WT.CD70.4
I then created a design matrix, where the reference group is WT cells,
untreated, 2-hour timepoint.
design <- modelMatrix(targets, ref="WT.x.2")
This creates a 32x25 design matrix, but when I fit the model, I get a
"Coefficients not estimable" error/warning:
> fit <- lmFit(d, design)
Coefficients not estimable: WT.CD70.14.2 WT.CD80.14 WT.CD80.2
WT.CD80.4 WT.CD80.8 WT.x.14 WT.x.4 WT.x.8
Warning message:
Partial NA coefficients for 34944 probe(s)
Any help with how I can specify this multifactorial time-course design
in a two-channel dye-swap experiment would be greatly appreciated. As
I said, I'm most interested in the conditions on arrays 8-9, 13-14,
19, 26, and 30-31, where I'm looking at WT cells treated with CD70 and
CD80 versus CD80 alone.
Thanks in advance for any help!
More information about the Bioconductor
mailing list