[BioC] Single channel (numerical intensities) data import into BioConductor

Kort, Eric Eric.Kort at vai.org
Fri May 9 05:01:17 CEST 2008


2008/5/8 Dozmorov, Mikhail G. (HSC) <Mikhail-Dozmorov at ouhsc.edu>:

> We receive microarray data from processing facility in a simple format o
> background subtracted intensities, here's an example, tab delimited:
>
> Gene ID C1860gu C1775gu C1777gu
> AA278251        100.825641      82.30144928     222.4144928
> AA401404        383.6702703     374.1666667     342.3405797
> AA454191        100.7403727     175.2884199     135.4404762
> AA460836        343.9164875     366.684058      351.8411658
> AA723761        445.059587      451.999359      355.5185897
> AA902654        400.1301282     431.4055556     367.1327381
> AA905415        200.9855072     208.0207576     218.5183983
>
> Thus, for each array (C2860gu,...) we have corresponding gene expression
> data. How to import this data into ESET object for further handling?

?ExpressionSet is helpful.  It will lead you in the following direction (try to ensure your file--here named data.txt--is free of extraneous white space since this may cause read.delim to interpret numerical data as character):

library(Biobase)
exp <- new("ExpressionSet", exprs=as.matrix(read.delim("c:/data.txt", row.names=1)))

# that's all that is required to accomplish what you wanted, but here I will create
# some fictitious and random pheno data to illustrate adding pheno data:
pheno <- matrix(runif(12, 1, 10), nrow=3)

# note that nrow(pheno) == ncol(exprs(eset)), as it must.
rownames(pheno) <- colnames(exprs(exp));  colnames(pheno) <- c("WBC", "HgB", "AFP", "Age")
pheno <- new("AnnotatedDataFrame", data=as.data.frame(pheno))
exp <- new("ExpressionSet", exprs=as.matrix(read.delim("c:/data.txt", row.names=1)), phenoData=pheno)

# let's check our work
pData(exp)
             WBC      HgB      AFP      Age
C1860gu 6.934886 7.284751 2.676714 8.815719
C1775gu 8.744655 7.067929 4.415327 7.904734
C1777gu 2.271367 4.686928 3.211371 6.035077

exprs(exp)
          C1860gu   C1775gu  C1777gu
AA278251 100.8256  82.30145 222.4145
AA401404 383.6703 374.16667 342.3406
AA454191 100.7404 175.28842 135.4405
AA460836 343.9165 366.68406 351.8412
AA723761 445.0596 451.99936 355.5186
AA902654 400.1301 431.40556 367.1327
AA905415 200.9855 208.02076 218.5184

Cheers,
Eric

> Thank you!
>
>
>






This email message, including any attachments, is for th...{{dropped:6}}



More information about the Bioconductor mailing list