[BioC] marray layout
Vanessa Vermeirssen
vanessa.vermeirssen at psb.ugent.be
Wed Jul 4 10:16:32 CEST 2007
Dear,
I have some questions about the marray Layout object. I try to upload
public microarray
data in R in order to create a microarray compendium.
I have used an SMD database file to create the targets, probes and data
objects for marray.
The array has 21632 spots, however in the database file I only have
information for 21504 spots.
Moreover, some of the probes are not matchable to a gene and therefore
for those I would like to
neglect their expression values. I did this now by parsing the database
file and setting the expression values
at NA for those "non-matchable" probes.
Hence I have 21504 probes, of which 801 are controls and 784 non-matchable
(NAs in their expression values).
I tried to declare this with maSub that I use only part of the array
(21504 spots and not 21632).
My maControls only considers the 21504 spots.
I read in the whole thing and then look at the graphs created by one
color "image". There seems to be a lot
of white spots in my arrays and they are not consistent in the same
place on the array ( although all the NAs
refer to the same 784 probes in the 3 arrays, see code below).
I was wondering if this has something to do with this maSub... and that
it put these NAs (21632-21504, only about 100 ones) randomly on the array?
*How can I correctly read in my marrayLayout?*
I only want to use image to see if there is some spatial gradient in my
arrays. If htere is, I need to use
background substraction, otherwise I won't.
This is the code I use...I am reading in 3 arrays of the format Chalfie379.
In blue the output I get from R.
The pdf file with the M images can be found at
http://www.psb.ugent.be/~vamei/Docs/Chalfie379_image.pdf
library(limma)
library(marray)
library("Biobase")
Chalfie379targets=read.marrayInfo("file_A-SMDB-379", info.id=c(2,4,5),
labels=c(2))
summary(Chalfie379targets)
Object of class marrayInfo.
maLabels SMDfile green Cy3 CH1 red Cy5 CH2
1 new17040.xls new17040.xls wt sorted touch cell mutant sorted touch cell
2 new17041.xls new17041.xls wt sorted touch cell mutant sorted touch cell
3 new17042.xls new17042.xls wt sorted touch cell mutant sorted touch cell
Number of labels: 3 Dimensions of maInfo matrix: 3 rows by 3 columns
Notes:
file_A-SMDB-379
file379 <- as.vector(Chalfie379targets at maInfo[,1])
Chalfie379.gnames <- read.marrayInfo(file379[1],info.id=c(1,2,3), labels=2)
summary(Chalfie379.gnames)
Object of class marrayInfo.
maLabels Spot Name ID
1 AC3.5 1 AC3.5 AC3.5
2 AC3.8 2 AC3.8 AC3.8
3 AH6.6 3 AH6.6 AH6.6
4 B0001.3 4 B0001.3 B0001.3
5 B0024.1 5 B0024.1 B0024.1
6 B0024.4 6 B0024.4 B0024.4
7 C06H5.1 7 C06H5.1 C06H5.1
8 C07A9.5 8 C07A9.5 C07A9.5
9 C07G2.1 9 C07G2.1 C07G2.1
10 C08B6.4 10 C08B6.4 C08B6.4
...
Number of labels: 21504 Dimensions of maInfo matrix: 21504 rows by
3 columns
Notes:
new17040.xls
Chalfie379.L <- new("marrayLayout", maNgr=8, maNgc=4, maNsc=26, maNsr=26)
maControls(Chalfie379.L) <- Chalfie379.gnames at maInfo[,3]
length(Chalfie379.L at maControls)
S <- c(1:maNspots(Chalfie379.L))
T <- Chalfie379.gnames at maInfo[,1]
Z <- S %in% T
maSub(Chalfie379.L) <- Z
ctl <- rep("control", length(T))
ctl[maControls(Chalfie379.L) != "control"] <- "probe"
maControls(Chalfie379.L) <- factor(ctl)
summary(Chalfie379.L)
Array layout: Object of class marrayLayout.
Total number of spots: 21632
Dimensions of grid matrix: 8 rows by 4 cols
Dimensions of spot matrices: 26 rows by 26 cols
Currently working with a subset of 21504spots.
Control spots:
There are 2 types of controls :
control probe
801 20703
Notes on layout:
Chalfie379.raw <- read.marrayRaw(file379,name.Gf="CH1_I",
name.Gb="CH1_B", name.Rf="CH2_I",
name.Rb="CH2_B", gnames=Chalfie379.gnames, layout=Chalfie379.L,
targets=Chalfie379targets, DEBUG=TRUE)
pdf("Chalfie379_image.pdf")
image(Chalfie379.raw, col=1, main="1")
image(Chalfie379.raw[,2], col=1, main="2")
image(Chalfie379.raw[,3], col=1, main="3")
dev.off()
myM <- Chalfie379.raw[,2]@maGf /Chalfie379.raw[,2]@maRf
sum(is.na(myM))
[1] 784
> summary(which(is.na( Chalfie379.raw[,3]@maGf )))
Min. 1st Qu. Median Mean 3rd Qu. Max.
22 5564 10690 10770 15920 21450
> summary(which(is.na( Chalfie379.raw[,2]@maGf )))
Min. 1st Qu. Median Mean 3rd Qu. Max.
22 5564 10690 10770 15920 21450
> summary(which(is.na( Chalfie379.raw[,1]@maGf )))
Min. 1st Qu. Median Mean 3rd Qu. Max.
22 5564 10690 10770 15920 21450
Thank you so much for your help!
Best regards,
Vanessa Vermeirssen
--
--
==================================================================
Vanessa Vermeirssen, PhD
Tel:+32 (0)9 331 38 23 fax:+32 (0)9 3313809
VIB Department of Plant Systems Biology, Ghent University
Technologiepark 927, 9052 Gent, BELGIUM
vamei at psb.ugent.be http://www.psb.ugent.be
More information about the Bioconductor
mailing list