[BioC] annotation on mouse4302

Marc Carlson mcarlson at fhcrc.org
Mon Sep 24 21:03:23 CEST 2007


Steffen wrote:
> Hi Mayte,
>
> Adding a third source, Ensembl (which does an independent mapping of the 
> affy probes) gives:
>
>  >library(biomaRt)
> Loading required package: RCurl
>  > ensembl=useMart("ensembl", dataset="mmusculus_gene_ensembl")
> Checking attributes and filters ... ok
>  > getBM(c("affy_mouse430_2","entrezgene","markersymbol","refseq_dna"), 
> filters="affy_mouse430_2", values="1442021_at", mart=ensembl)
>  
> affy_mouse430_2 entrezgene markersymbol refseq_dna
> 1      1442021_at      14680         Gnal  NM_177137
>
>
> So it looks like Ensembl supports the Affy annotation. 
> Possibly the difference with the BioC annotation packages is due to 
> older annotation versions in BioC.  On October 5th, there is a new 
> release of Bioconductor and I assume the annotation packages will be 
> updated at that time as well.
>
> Cheers,
> Steffen
>
> Mayte Suarez-Farinas wrote:
>   
>> Hi
>>
>> I am using mouse4302 chips and my biology counterpart found some odds  
>> annotations,
>> for a non negligible number of gene the symbols produced by  
>> Bioconductor and affymetrix web  site
>> are very different (meaning they are even not synonyms), usually what  
>> it happens is that
>> the genes are next to each other in the genome (If you look at them  
>> in USCS genome browser)
>> An example
>> probeset: 1442021_at:
>>
>> Bioconductor annotation
>> entrez id: 225651        symbol:Mppe1
>> refseq: NM_172630    NP_766218    XM_001001173 XM_001001180  
>> XM_001001193 XM_001001206 XM_001001219 XP_001001173
>>   XP_001001180 XP_001001193 XP_001001206 XP_001001219
>>
>> Affy annotation
>> entrez id:  14680          symbol: Gnal
>> refseq: NM_010307 NM_177137
>>
>> I guess my question is how is the mapping done, so that it results in  
>> such a difference...and which one
>> is better. For what reason are those annotation at odds with  
>> affymetrix. Which at should believe better ?
>>
>>
>> Thanks in advance
>>
>> Mayte
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at stat.math.ethz.ch
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
>>
>>
>>     
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
>
>   
Hi guys.

It looks like the information at Affy has been updated since March. 
This is the source that we use to build the latest annotation packages
for each release.   Using the very latest mouse4302.db package from our
devel branch, I get the very same answer as the ensemble folks above. 
Whereas if I use the older annotations from last March, I get "Mppe1"
instead.  So it looks like you might want to consider updating
annotation packages soon.  The new stuff will be in release on the 5th,
but you can get it now from devel  right now if you feel adventurous.


Here is the code I ran:

library("mouse4302.db")
ls(2)

get("1442021_at", env=mouse4302SYMBOL)

# this gets me:  [1] "Gnal"


get("1442021_at", env=mouse4302GENENAME)

# and this gets me:  [1] "guanine nucleotide binding protein, alpha
stimulating, olfactory type"



#And my sessionInfo() (in case you want to know):
R version 2.6.0 alpha (2007-09-13 r42838)
x86_64-unknown-linux-gnu

locale:
LC_CTYPE=en_US;LC_NUMERIC=C;LC_TIME=en_US;LC_COLLATE=en_US;LC_MONETARY=en_US;LC_MESSAGES=en_US;LC_PAPER=en_US;LC_NAME=C;LC_ADDRESS=C;LC_TELEPHONE=C;LC_MEASUREMENT=en_US;LC_IDENTIFICATION=C

attached base packages:
[1] tools     stats     graphics  grDevices utils     datasets  methods
[8] base

other attached packages:
[1] mouse4302.db_1.99.3  AnnotationDbi_0.99.8 RSQLite_0.6-0
[4] DBI_0.2-3            Biobase_1.15.36

loaded via a namespace (and not attached):
[1] rcompgen_0.1-15


    Hope this helps,

        Marc



More information about the Bioconductor mailing list