[BioC] GEOquery
Cei Abreu-Goodger
cei at sanger.ac.uk
Fri Jul 11 17:30:41 CEST 2008
Hi Sean,
I'm trying to help Harpreet to get the GEOquery library working properly
over here. Thanks to what you pointed out, we are able to track the
problem down to curl using our http proxy, which for ftp transfers is
not required. We still have one problem, that I can't figure how to turn
off the "ftp.use.epsv" option in RCurl. So, on a linux terminal, I can use:
curl --disable-epsv
"ftp://ftp.ncbi.nih.gov/pub/geo/DATA/SeriesMatrix/GSE4201/"
-r--r--r-- 1 ftp anonymous 930471 Apr 13 05:32
GSE4201_series_matrix.txt.gz
(without the --disable-epsv it times out unless I set the ftp_proxy, but
then I get the HTML index instead of the file listing)
inside R, I imagine I have to turn the "ftp.use.epsv" option off, and
I've tried doing something like this:
myCurl <- getCurlOptionsConstants()
myCurl[["ftp.use.epsv"]] <- 0
getURL("ftp://ftp.ncbi.nih.gov/pub/geo/DATA/SeriesMatrix/GSE4201/",
.opts=list(myCurl))
but it keeps timing out...
I also tried:
curlSetOpt("ftp.use.epsv"=0)
but that doesn't seem to have any effect on what
getCurlOptionsConstants() returns, it just creates a CURLOptions object,
which I can't figure out how to use.
Do you have any suggestions, or should I search for help directly with
the RCurl developers?
Many thanks,
Cei
> So, this appears to be the problem. It looks like your proxy is
> intercepting the ftp directory listing and converting it to HTML. I
> do not know how to solve this problem, as it appears to be a proxy
> configuration issue at your institution. However, I can't say for
> sure. The output of the getURL() command should look like:
>
>
>> getURL("ftp://ftp.ncbi.nih.gov/pub/geo/DATA/SeriesMatrix/GSE4201/")
>>
> [1] "-r--r--r-- 1 ftp anonymous 930471 Apr 13 05:32
> GSE4201_series_matrix.txt.gz\n"
>
> Notice how yours is much longer and is HTML, not plain text.
>
> Sean
>
>
>
--
The Wellcome Trust Sanger Institute is operated by Genome Research
Limited, a charity registered in England with number 1021457 and a
company registered in England with number 2742969, whose registered
office is 215 Euston Road, London, NW1 2BE.
More information about the Bioconductor
mailing list