[BioC] BioMaRt query

James W. MacDonald jmacdon at med.umich.edu
Mon Oct 4 17:25:29 CEST 2010


Hi Rene,

On 10/4/2010 4:44 AM, René Dreos wrote:
> Dear BioC mailing list,
>
> I am trying to annotate Arabidopsis ATH1 genome array results using biomaRt,
> but it looks like some of the probesets are not annotated in biomaRt
> database. Here is one example:
>
>> library(biomaRt)
>> AT.db<- useMart(biomart="plant_mart_6", dataset="athaliana_eg_gene")
>> getBM(attributes = c("affy_ath1_121501","ensembl_gene_id","description"),
> filters = "affy_ath1_121501", values = "254998_at", mart = AT.db)
> [1] affy_ath1_121501 ensembl_gene_id  description
> <0 rows>  (or 0-length row.names)
>
> But if I use ath1121501.db library to annotate the same probeset:
>
>> library(annotate)
>> library(ath1121501.db)
>
>> mget("254998_at", env=ath1121501GENENAME)
> $`254998_at`
> [1] "encodes a choline synthase whose gene expression is induced by high
> salt and mannitol."
>
>> mget("254998_at", env=ath1121501ACCNUM)
> $`254998_at`
> [1] "AT4G09760"
>
> Why is this happening?

Because you are querying two different data sources and have found an 
instance in which they are not consistent. This is a pretty common 
occurrence, given how fluid gene definitions are (and likely will be for 
some time).

Best,

Jim


>
> Thank you for any advice,
> best regards
> r
>
>> sessionInfo()
> R version 2.11.1 (2010-05-31)
> x86_64-apple-darwin9.8.0
>
> locale:
> [1] C
>
> attached base packages:
> [1] grid      stats     graphics  grDevices utils     datasets  methods
> [8] base
>
> other attached packages:
>   [1] ath1121501.db_2.4.1       org.At.tair.db_2.4.3
>   [3] RSQLite_0.9-2             annotate_1.26.1
>   [5] ath1121501cdf_2.6.0       biomaRt_2.4.0
>   [7] genefilter_1.30.0         marray_1.26.0
>   [9] gplots_2.8.0              caTools_1.10
> [11] bitops_1.0-4.1            gdata_2.7.2
> [13] gtools_2.6.2              bradiar1b520742cdf_1.24.0
> [15] arrayQualityMetrics_2.6.0 affyPLM_1.24.1
> [17] gcrma_2.20.0              preprocessCore_1.10.0
> [19] matchprobes_1.20.0        Biostrings_2.16.9
> [21] IRanges_1.6.15            AnnotationDbi_1.10.2
> [23] affxparser_1.20.0         makecdfenv_1.26.0
> [25] lattice_0.18-8            RMySQL_0.7-5
> [27] DBI_0.2-5                 affy_1.26.1
> [29] Biobase_2.8.0             limma_3.4.4
>
> loaded via a namespace (and not attached):
>   [1] RColorBrewer_1.0-2  RCurl_1.4-2         XML_3.1-1
>   [4] affyio_1.16.0       beadarray_1.16.0    hwriter_1.2
>   [7] latticeExtra_0.6-14 simpleaffy_2.24.0   splines_2.11.1
> [10] stats4_2.11.1       survival_2.35-8     tools_2.11.1
> [13] vsn_3.16.0          xtable_1.5-6
>
> 	[[alternative HTML version deleted]]
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor

-- 
James W. MacDonald, M.S.
Biostatistician
Douglas Lab
University of Michigan
Department of Human Genetics
5912 Buhl
1241 E. Catherine St.
Ann Arbor MI 48109-5618
734-615-7826
**********************************************************
Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues 



More information about the Bioconductor mailing list