[BioC] Biobase geneData, geneRecommender data object
R.T. Sweeney
rtswee2 at stanford.edu
Wed Jul 13 22:09:52 CEST 2011
Hi,
I am trying to use the package geneRecommender (along with Biobase) to find genes that co-vary with a query set of genes.
I am struggling just to load my own data and the documentation is somewhat brief.
The example uses the geneData prepackaged in Biobase. I have read Biobase documentation and geneRecommender
documentation and searched extensively online for help.
The code appears straightforward:
> library(geneRecommender)
> data(geneData)
> my.query <- c("31613_at", "31712_at", "31497_at")
> normalized.data <- gr.normalize(geneData)
> gr.main(normalized.data, my.query, ngenes = 10
I do not understand how to get my data into a form that will load like data(geneData).
When you load the Biobase library the object geneData cannot by found with typeof or attributes.
> typeof(geneData)
Error in typeof(geneData) : object 'geneData' not found
> class(geneData)
Error: object 'geneData' not found
However, once you run the command data(geneData) then typeof(geneData) and class(geneData) are informative.
> data(geneData)
> typeof(geneData)
[1] "double"
> class(geneData)
[1] "matrix"
Same is true with trying to write.table geneData...object not found until data(geneData) performed.
> write.table(geneData, file="geneData.txt", sep="\t", col.names = NA)
Error in inherits(x, "data.frame") : object 'geneData' not found
> data(geneData)
> write.table(geneData, file="geneData.txt", sep="\t", col.names = NA)
geneData looks like this:
A B C D
AFFX-MurIL2_at 192.742000 85.753300 176.757000 1.35575e+02
AFFX-MurIL10_at 97.137000 126.196000 77.921600 9.33713e+01
AFFX-MurIL4_at 45.819200 8.831350 33.063200 2.87072e+01
AFFX-MurFAS_at 22.544500 3.600930 14.688300 1.23397e+01
I have a test tab delimited file (my_data.txt) that looks like this(cDNA array data, not affy):
S1 S2 S3 S4 S5 S6 S7 S8
1 NA NA NA NA -0.77 -1.02 -1.18 1.51
With the dimensions of 186 rows and 8 columns.
I have built an expressionSet exprs from my_data.txt, coerced into a data.frame, tried it as a matrix, and
cannot get data(exprs) to load like data(geneData).
> dataDirectory <- system.file("extdata", package = "Biobase")
> exprsFile <- "path/to/my_data.txt"
> exprs <- as.matrix(read.table(exprsFile, header = TRUE, sep = "\t", row.names = 1, as.is = TRUE))
> class(exprs)
[1] "matrix"
> typeof(exprs)
[1] "double"
So, exprs appears to match geneData class and typeof (shown above)
> data(exprs)
Warning message:
In data(exprs) : data set 'exprs' not found
Then, when I check exprs, I find the appropriate data which seems to match geneData attributes:
head(exprs[, 1:2])
S1 S2
1 0.58 -0.79
2 1.98 -0.67
3 NA NA
4 -1.37 -1.83
5 NA NA
6 1.60 0.55
> attributes(exprs)
$dim
[1] 186 8
$dimnames[[1]]
[1] "1" "2" "3" "4" "5" "6" "7" "8" "9"
[10] "10" "11" "12" "13" "14" "15" "16" "17" "18"
[19] "19" "20" "21" "22" "23" "24" "25" "26" "27"
[28] "28" "29" "30" "31" "32" "33" "34" "35" "36"
etc
$dimnames[[2]]
[1] "S1"
[2] "S2"
[3] "S3"
etc
Similar to what is found looking at attributes(geneData) (but only after data(geneData) )
> attributes(geneData)
$dim
[1] 500 26
$dimnames
$dimnames[[1]]
[1] "AFFX-MurIL2_at" "AFFX-MurIL10_at"
[3] "AFFX-MurIL4_at" "AFFX-MurFAS_at"
[5] "AFFX-BioB-5_at" "AFFX-BioB-M_at"
$dimnames[[2]]
[1] "A" "B" "C" "D" "E" "F" "G" "H" "I" "J" "K" "L" "M" "N" "O" "P" "Q" "R" "S"
[20] "T" "U" "V" "W" "X" "Y" "Z"
I am aware that this is likely a basic R issue but I have just struggled quite a few hours trying to troubleshoot
and figure this out, but can't. I need help getting my matrix input file loaded like data(geneData) in the geneRecommender example.
Any advice would be greatly appreciated.
Thanks,
Robert Sweeney
> sessionInfo()
R version 2.13.0 (2011-04-13)
Platform: x86_64-redhat-linux-gnu (64-bit)
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=C LC_MESSAGES=en_US.UTF-8
[7] LC_PAPER=en_US.UTF-8 LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] geneRecommender_1.24.0 Biobase_2.12.2
loaded via a namespace (and not attached):
[1] tools_2.13.0
>
More information about the Bioconductor
mailing list