[BioC] GEOquery-Problem related to importing data set
James W. MacDonald
jmacdon at uw.edu
Mon Sep 10 23:04:31 CEST 2012
Hi Roopa,
On 9/10/2012 4:50 PM, Roopa Subbaiaih wrote:
> Thank you very much. I tried to follow what you did.This is what I get.
> getGEO("GDS2478")
> Error in download.file(myurl, destfile, mode = mode, quiet = TRUE,
> method = getOption("download.file.method.GEOquery")) :
> cannot open URL
> 'ftp://ftp.ncbi.nlm.nih.gov/pub/geo/DATA/SOFT/GDS/GDS2478.soft.gz'
> Am I missing something?
Well, there is the obvious - you are connected to the internet, yes? If
so, does the following code download some data?
read.table("ftp://ftp.stats.ox.ac.uk/pub/datasets/csb/ch11b.dat")
if not, does this help (here I am assuming you are on Windows)?
setInternet2(NA)
read.table("ftp://ftp.stats.ox.ac.uk/pub/datasets/csb/ch11b.dat")
If so, you should then be able to use getGEO().
Best,
Jim
> Thank you for the quick response. Roopa
>
>
> On Mon, Sep 10, 2012 at 4:33 PM, James W. MacDonald <jmacdon at uw.edu
> <mailto:jmacdon at uw.edu>> wrote:
>
> Hi Roopa,
>
>
> On 9/10/2012 4:08 PM, Roopa Subbaiaih wrote:
>
> Hi All,
>
> I am trying to import few affy datasets into bioconductor.When
> I try the
> example given in R script it works fine but not the case with
> the dataset I
> need to import.
>
> gds<-
> getGEO(filename=system.file("extdata/GDS2478.soft.gz",package="GEOquery"))
>
>
> Unless you put the file there, I doubt you are pointing to the
> right place. Note that the vignette uses the extdata directory for
> data that come with the GEOquery package - that isn't where you
> should put things.
>
> Here is what I got:
>
> > getGEO("GDS2478")
> File stored at:
> C:\Users\BIOINF~1\AppData\Local\Temp\RtmpYDVgJB/GDS2478.soft.gz
> > dat <-
> getGEO(filename="C:/Users/BIOINF~1/AppData/Local/Temp/RtmpYDVgJB/GDS2478.soft.gz")
> > class(dat)
> [1] "GDS"
> attr(,"package")
> [1] "GEOquery"
>
> And to see what is in a GDS object,
>
> ?GDS-class
> and
> ?GEOData-class
>
> and from there I get
>
> > head(Table(dat))
> ID_REF IDENTIFIER GSM148887 GSM148888 GSM148889 GSM148890
> GSM148892
> 1 1007_s_at DDR1 760.999 754.827 758.415 746.281
> 776.563
> 2 1053_at RFC2 93.978 113.67 110.74 114.423
> 140.641
> 3 117_at HSPA6 39.268 40.784 42.345 37.779
> 44.122
> 4 121_at PAX8 189.36 194.713 208.084 180.112
> 223.06
> 5 1255_g_at GUCA1A 11.864 8.857 10.754 9.224
> 10.396
> 6 1294_at UBA7 70.985 78.527 79.646 70.705
> 77.107
>
> Best,
>
> Jim
>
>
>
> Error in read.table(con, sep = "\t", header = FALSE, nrows =
> nseries) :
> invalid 'nlines' argument
> In addition: Warning messages:
> 1: In file(fname, "r") :
> file("") only supports open = "w+" and open = "w+b": using
> the former
> 2: In file(con, "r") :
> file("") only supports open = "w+" and open = "w+b": using
> the former
> 3: In file(fname, "r") :
> file("") only supports open = "w+" and open = "w+b": using
> the former
> This what i get.
>
> Please let me know how to rectify this. Thanks, Roopa
>
> [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org <mailto:Bioconductor at r-project.org>
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives:
> http://news.gmane.org/gmane.science.biology.informatics.conductor
>
>
> --
> James W. MacDonald, M.S.
> Biostatistician
> University of Washington
> Environmental and Occupational Health Sciences
> 4225 Roosevelt Way NE, # 100
> Seattle WA 98105-6099
>
>
>
>
--
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099
More information about the Bioconductor
mailing list