[BioC] GEOquery on rawdata and processed data ?
Sean Davis
sdavis2 at mail.nih.gov
Tue Jul 3 22:06:24 CEST 2007
Hi, Alex. The typical process would be to use getGEO to get a GSE or
GSEMatrix file and parse it into R. The data in these files are taken
directly from submitters to GEO and so could be processed by RMA, MAS5,
or any of several other methods. One will often need to refer to the
protocol information in GEO or to the associated paper to determine the
exact methods. As Saroj pointed out, in many cases, there is a link in
the GSE file or online on the summary page to supplementary files. This
link will, for Affy, usually contain at least .CEL files. One can then
use the getGEO function to get the processed data and annotation, then
get the raw .CEL files and process them however necessary, and replace
the values that come from GEO with the ones derived locally.
Sean
More information about the Bioconductor
mailing list