[BioC] biomaRt error
Rhoda Kinsella
rhoda at ebi.ac.uk
Thu Jan 14 10:46:03 CET 2010
Hi Iain
Yes there seems to have been problems connecting to the
www.biomart.org registry which have now been resolved by the Biomart
team at the OICR. As an alternative, you can also access the Ensembl
databases from the Ensembl biomart installation at www.ensembl.org
using the following biomaRt commands:
> library(biomaRt)
> listMarts(host = "www.ensembl.org");
biomart version
1 ENSEMBL_MART_ENSEMBL Ensembl 56
2 ENSEMBL_MART_SNP Ensembl variation 56
3 ENSEMBL_MART_FUNCGEN Ensembl functional genomics 56
4 ENSEMBL_MART_VEGA Vega 36
5 REACTOME Reactome
6 wormbase_current WormBase (CSHL US)
7 pride PRIDE (EBI UK)
> mart = useMart("ENSEMBL_MART_ENSEMBL", host = "www.ensembl.org")
etc...
Regards
Rhoda
On 14 Jan 2010, at 09:37, Iain Gallagher wrote:
> Seems to be ok this morning. Ran the same query with no problems.
> Perhaps some transient changes at the server end.
>
> Cheers
>
> Iain
>
> --- On Wed, 13/1/10, Paul Leo <p.leo at uq.edu.au> wrote:
>
>> From: Paul Leo <p.leo at uq.edu.au>
>> Subject: Re: [BioC] biomaRt error
>> To: "Iain Gallagher" <iaingallagher at btopenworld.com>
>> Cc: bioconductor at stat.math.ethz.ch
>> Date: Wednesday, 13 January, 2010, 23:45
>> Dito.
>> Actually 3 days ago some of my old queries started
>> returning different
>> numbers of rows
>>
>>
>> length(unique(ann.all[ann.all[,"gene_biotype"]=="protein_coding"
>> ,
>> "ensembl_gene_id"]))
>> [1] 19940
>>
>>>
>> length
>> (unique
>> (ann.all.entrez[ann.all.entrez[,"gene_biotype"]=="protein_coding"
>> , "ensembl_gene_id"])) # 22836
>> [1] 22836
>>
>> vs for months before hand and 4 days previous
>>
>> length(unique(ann.all[ann.all[,"gene_biotype"]=="protein_coding"
>> ,
>> "ensembl_gene_id"]))
>> [1] 22836
>>
>>>
>> length
>> (unique
>> (ann.all.entrez[ann.all.entrez[,"gene_biotype"]=="protein_coding"
>> , "ensembl_gene_id"])) # 22836
>> [1] 22836
>>
>> FOR
>> a.filter<-c( "chromosome_name", "start" , "end")
>> fil.vals<-list(chrom, low.cut, high.cut)
>>
>> ann.all<-getBM(attributes =
>> c
>> ("ensembl_gene_id
>> ","external_gene_id
>> ","chromosome_name
>> ","start_position
>> ","end_position","strand","gene_biotype","mgi_symbol","description"),
>> filters = a.filter, values=fil.vals, mart = mart)
>>
>> ann.all.entrez<-getBM(attributes =
>> c(
>> "ensembl_gene_id
>> ","external_gene_id
>> ","ensembl_transcript_id
>> ","chromosome_name
>> ","start_position
>> ","end_position
>> ","strand","gene_biotype","entrezgene","mgi_symbol","description"),
>> filters = a.filter, values=fil.vals, mart = mart)
>>
>> No idea why the first query was affected and not the
>> second.... This did
>> coincide with a bioC update I did two days ago... but that
>> may be a
>> coincidence....
>>
>>
>>
>>
>> Today I get the same error
>>> listMarts()
>> Error: non-BioMart die():
>> not well-formed (invalid token) at line 1, column 11797,
>> byte 11797
>> at /usr/lib/perl5/XML/Parser.pm line 187
>>
>> does not seem to be XML, nor to identify a file
>> name
>>
>>
>>
>>
>>> sessionInfo()
>> R version 2.10.1 (2009-12-14)
>> x86_64-pc-linux-gnu
>>
>> locale:
>> [1] LC_CTYPE=en_AU.UTF-8
>> LC_NUMERIC=C
>>
>> [3] LC_TIME=en_AU.UTF-8
>> LC_COLLATE=en_AU.UTF-8
>> [5] LC_MONETARY=C
>> LC_MESSAGES=en_AU.UTF-8
>> [7] LC_PAPER=en_AU.UTF-8
>> LC_NAME=C
>>
>> [9] LC_ADDRESS=C
>> LC_TELEPHONE=C
>>
>> [11] LC_MEASUREMENT=en_AU.UTF-8 LC_IDENTIFICATION=C
>>
>>
>> attached base packages:
>> [1] stats graphics grDevices
>> utils datasets methods
>> base
>>
>> other attached packages:
>> [1] org.Mm.eg.db_2.3.6 RSQLite_0.8-1
>> DBI_0.2-5
>>
>> [4] AnnotationDbi_1.8.1 Biobase_2.6.1
>> biomaRt_2.2.0
>>
>> loaded via a namespace (and not attached):
>> [1] RCurl_1.3-1 tcltk_2.10.1 tools_2.10.1
>> XML_2.6-0
>>
>>
>>
>> -----Original Message-----
>> From: Iain Gallagher <iaingallagher at btopenworld.com>
>> To: bioconductor at stat.math.ethz.ch
>> Subject: [BioC] biomaRt error
>> Date: Wed, 13 Jan 2010 19:00:14 +0000 (GMT)
>>
>> Hello List
>>
>> Can anyone shed any light on the following biomaRt error?
>> Has something changed at Ensembl?
>>
>>> library(biomaRt)
>>>
>>> mart <- useMart("ensembl",
>> dataset="hsapiens_gene_ensembl")
>> Error: non-BioMart die():
>> not well-formed (invalid token) at line 1, column 11797,
>> byte 11797 at /usr/lib/perl5/XML/Parser.pm line 187
>>
>> does not seem to be XML, nor to identify a file
>> name
>>> sessionInfo()
>> R version 2.10.1 (2009-12-14)
>> x86_64-pc-linux-gnu
>>
>> locale:
>> [1] LC_CTYPE=en_GB.UTF-8
>> LC_NUMERIC=C
>>
>> [3] LC_TIME=en_GB.UTF-8
>> LC_COLLATE=en_GB.UTF-8
>> [5] LC_MONETARY=C
>> LC_MESSAGES=en_GB.UTF-8
>> [7] LC_PAPER=en_GB.UTF-8
>> LC_NAME=C
>>
>> [9] LC_ADDRESS=C
>> LC_TELEPHONE=C
>>
>> [11] LC_MEASUREMENT=en_GB.UTF-8 LC_IDENTIFICATION=C
>>
>>
>> attached base packages:
>> [1] stats graphics grDevices
>> utils datasets
>> methods base
>>
>> other attached packages:
>> [1] biomaRt_2.2.0
>>
>> loaded via a namespace (and not attached):
>> [1] RCurl_1.3-1 XML_2.6-0
>>>
>>
>> Thanks
>>
>> Iain
>>
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at stat.math.ethz.ch
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
>>
>>
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
Rhoda Kinsella Ph.D.
Ensembl Bioinformatician,
European Bioinformatics Institute (EMBL-EBI),
Wellcome Trust Genome Campus,
Hinxton
Cambridge CB10 1SD,
UK.
More information about the Bioconductor
mailing list