[BioC] Ensembl IDs 2 GO term enrichment --> simple solution withoutbiomart? ( I can't install it because of RCurl)

michael watson (IAH-C) michael.watson at bbsrc.ac.uk
Wed Sep 17 12:21:37 CEST 2008


If you are stuck on using R, then biomart is simply a web service.

To link ens genes to affy chicken, the url is:

http://www.biomart.org/biomart/martservice?query=%3C?xml%20version=%221.0%22%20encoding=%22UTF-8%22?%3E%3C!DOCTYPE%20Query%3E%3CQuery%20%20virtualSchemaName%20=%20%22default%22%20formatter%20=%20%22TSV%22%20header%20=%20%220%22%20uniqueRows%20=%20%220%22%20count%20=%20%22%22%20datasetConfigVersion%20=%20%220.6%22%20%3E%3CDataset%20name%20=%20%22ggallus_gene_ensembl%22%20interface%20=%20%22default%22%20%3E%3CAttribute%20name%20=%20%22ensembl_gene_id%22%20/%3E%3CAttribute%20name%20=%20%22affy_chicken%22%20/%3E%3C/Dataset%3E%3C/Query%3E

thats how my browser formatted it, but I pasted:

http://www.biomart.org/biomart/martservice?query=<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE Query><Query  virtualSchemaName = "default" formatter = "TSV" header = "0" uniqueRows = "0" count = "" datasetConfigVersion = "0.6" ><Dataset name = "ggallus_gene_ensembl" interface = "default" ><Attribute name = "ensembl_gene_id" /><Attribute name = "affy_chicken" /></Dataset></Query>

In R, I can do this:

url <- 'http://www.biomart.org/biomart/martservice?query=%3C?xml%20version=%221.0%22%20encoding=%22UTF-8%22?%3E%3C!DOCTYPE%20Query%3E%3CQuery%20%20virtualSchemaName%20=%20%22default%22%20formatter%20=%20%22TSV%22%20header%20=%20%220%22%20uniqueRows%20=%20%220%22%20count%20=%20%22%22%20datasetConfigVersion%20=%20%220.6%22%20%3E%3CDataset%20name%20=%20%22ggallus_gene_ensembl%22%20interface%20=%20%22default%22%20%3E%3CAttribute%20name%20=%20%22ensembl_gene_id%22%20/%3E%3CAttribute%20name%20=%20%22affy_chicken%22%20/%3E%3C/Dataset%3E%3C/Query%3E'


d <- read.table(url, sep="\t")
d[1:5,]
                  V1                  V2
1 ENSGALG00000014600                    
2 ENSGALG00000011014  Gga.9239.1.S1_s_at
3 ENSGALG00000011014   Gga.16267.1.S1_at
4 ENSGALG00000011014  Gga.9239.1.S1_s_at
5 ENSGALG00000011014 Gga.16267.1.S1_s_at

Mick

-----Original Message-----
From: bioconductor-bounces at stat.math.ethz.ch on behalf of Emmanuel Levy
Sent: Wed 17/09/2008 7:13 AM
To: bioconductor at stat.math.ethz.ch
Subject: [BioC] Ensembl IDs 2 GO term enrichment --> simple solution withoutbiomart? ( I can't install it because of RCurl)
 
Dear list members,

I'd like to carry out a GO TERM enrichment analysis based on ENSEMBL
IDs and the GOstats package.

Though I have problems installing biomaRt, which I reported to the R
mailing list, I was wondering how to use GOstats with biomaRt, i.e.,
what should be specified for the "annotation package"?

Also, I was wondering if by any chance, somebody could think of a way
around biomaRt if I can't manage to install it.

Many thanks for your help,

Best wishes,

Emmanuel


==========================================
Email sent to the R mailing list:

Subject: RCurl compilation error on ubuntu hardy

Dear list members,

I encountered this problem and the solution pointed out in a previous
thread did not work for me.
(e.g.  install.packages("RCurl", repos = "http://www.omegahat.org/R")

I work with Ubuntu Hardy, and installed R 2.6.2 via apt-get.

I really need RCurl in order to use biomaRt ... any help would be
greatly appreciated.

Best wishes,

Emmanuel

=================================================

> sessionInfo()
R version 2.6.2 (2008-02-08)
x86_64-pc-linux-gnu

locale:
LC_CTYPE=en_CA.UTF-8;LC_NUMERIC=C;LC_TIME=en_CA.UTF-8;LC_COLLATE=en_CA.UTF-8;LC_MONETARY=en_CA.UTF-8;LC_MESSAGES=en_CA.UTF-8;LC_PAPER=en_CA.UTF-8;LC_NAME=C;LC_ADDRESS=C;LC_TELEPHONE=C;LC_MEASUREMENT=en_CA.UTF-8;LC_IDENTIFICATION=C

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base

loaded via a namespace (and not attached):
[1] rcompgen_0.1-17 tools_2.6.2
>

=================================================


> install.packages("RCurl", repos = "http://www.omegahat.org/R")
Warning in install.packages("RCurl", repos = "http://www.omegahat.org/R") :
 argument 'lib' is missing: using '/usr/local/lib/R/site-library'
trying URL 'http://www.omegahat.org/R/src/contrib/RCurl_0.9-4.tar.gz'
Content type 'application/x-gzip' length 150884 bytes (147 Kb)
opened URL
==================================================
downloaded 147 Kb

* Installing *source* package 'RCurl' ...
checking for curl-config... /usr/bin/curl-config
checking for gcc... gcc
checking for C compiler default output file name... a.out
checking whether the C compiler works... yes
checking whether we are cross compiling... no
checking for suffix of executables...
checking for suffix of object files... o
checking whether we are using the GNU C compiler... yes
checking whether gcc accepts -g... yes
checking for gcc option to accept ANSI C... none needed
checking how to run the C preprocessor... gcc -E
Version has a libidn field
configure: creating ./config.status
config.status: creating src/Makevars
** libs
gcc -std=gnu99 -I/usr/share/R/include -I/usr/share/R/include
-DHAVE_LIBIDN_FIELD=1     -fpic  -g -O2 -c base64.c -o base64.o
In file included from base64.c:1:
Rcurl.h:52: error: expected specifier-qualifier-list before 'cetype_t'
make: *** [base64.o] Error 1
chmod: cannot access `/usr/local/lib/R/site-library/RCurl/libs/*': No
such file or directory
ERROR: compilation failed for package 'RCurl'
** Removing '/usr/local/lib/R/site-library/RCurl'

The downloaded packages are in
       /tmp/RtmpQ8FMBZ/downloaded_packages
Warning message:
In install.packages("RCurl", repos = "http://www.omegahat.org/R") :
 installation of package 'RCurl' had non-zero exit status

================================================================================

Hi Martin,
are you working on a 64-bit linux distribution and which version of
RCurl are you trying to install? There has been a problem with a
recent version of RCurl and the "R_base64_decode", search the archives
of the Bioconductor mailing list for a thread called
"RCurl loading problem with 64 bit linux distribution".
Please try using the newest versions of R (R-2.7.0 has been released a
few weeks ago) and RCurl, which you can obtain from within R by
typing:
install.packages("RCurl", repos = "http://www.omegahat.org/R")
The new version of RCurl (>= 0.9.2) worked fine for me, while 0.9.0
and 0.9.1 did not.
Hope this helps.
Joern


martin sikora wrote:

   dear list members,

   i'm having a problem installing the biomaRt package on my linux
machine, due to the fact of a compilation error with RCurl. i am using
R 2.6.2 on fedora 7, and this is the output i get:

   gcc -m32 -std=gnu99 -I/usr/include/R -I/usr/include/R
-DHAVE_LIBIDN_FIELD=1 -I/usr/local/include    -fpic  -O2 -g -pipe
-Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector
--param=ssp-buffer-size=4 -m32 -march=i386 -mtune=generic
-fasynchronous-unwind-tables -c base64.c -o base64.o
   In file included from base64.c:1:
   Rcurl.h:52: error: expected specifier-qualifier-list before ?cetype_t?
   base64.c: In function ?R_base64_decode?:
   base64.c:25: warning: pointer targets in assignment differ in signedness
   base64.c:39: warning: pointer targets in passing argument 1 of
?Rf_mkString? differ in signedness
   base64.c: In function ?R_base64_encode?:
   base64.c:60: warning: pointer targets in assignment differ in signedness
   make: *** [base64.o] Error 1

   as far as i know i have all the necessary libraries installed:

   $ yum list installed | grep libxml
   libxml2.i386                             2.6.31-1.fc7
installed      libxml2-devel.i386                       2.6.31-1.fc7
       installed      libxml2-python.i386
2.6.31-1.fc7           installed      perl-libxml-perl.noarch
       0.08-1.2.1             installed
   $ yum list installed | grep curl
   curl.i386                                7.16.4-1.fc7
installed      curl-devel.i386                          7.16.4-1.fc7
       installed      python-pycurl.i386
7.16.0-0.1.20061207.fc installed

   as i am not an expert in linux stuff, i was wondering if there
could be any other missing libraries? any other ideas?

   cheers
   martin

_______________________________________________
Bioconductor mailing list
Bioconductor at stat.math.ethz.ch
https://stat.ethz.ch/mailman/listinfo/bioconductor
Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor



More information about the Bioconductor mailing list