[BioC] How to deal with Affymetrix probe that map to multiple genes
Gundala Viswanath
gundalav at gmail.com
Tue Sep 10 07:46:08 CEST 2013
Given a (e.g.431765_a_at), I'm trying to find the gene symbol.
But some how it gives multiple genes. How can I interpret that properly?
This is the example code and output:
library(biomaRt)
mouse = useMart("ensembl", dataset = "mmusculus_gene_ensembl")
getBM( c("affy_mouse430_2", "mgi_symbol"),mart=mouse,
filters="affy_mouse430_2", "1431765_a_at")
Which yields:
affy_mouse430_2 mgi_symbol
1 1431765_a_at Rps2-ps13
2 1431765_a_at Gm12176
3 1431765_a_at Gm7860
4 1431765_a_at Gm12366
5 1431765_a_at Gm10653
6 1431765_a_at Rps2-ps4
7 1431765_a_at Gm10420
8 1431765_a_at Rps2
9 1431765_a_at Gm5921
10 1431765_a_at Gm15846
11 1431765_a_at Gm16061
12 1431765_a_at Gm6433
13 1431765_a_at Gm4968
14 1431765_a_at Gm9013
15 1431765_a_at Gm17150
16 1431765_a_at Gm11687
17 1431765_a_at Gm18025
18 1431765_a_at Gm8225
19 1431765_a_at Gm11643
20 1431765_a_at Gm11249
21 1431765_a_at Gm12922
22 1431765_a_at Gm12933
23 1431765_a_at Gm16148
24 1431765_a_at Gm6139
25 1431765_a_at Gm5786
26 1431765_a_at Rps2-ps9
27 1431765_a_at Gm11599
28 1431765_a_at Gm16305
29 1431765_a_at 4931440P22Rik
30 1431765_a_at Gm12091
31 1431765_a_at Gm6311
32 1431765_a_at Rps2-ps6
33 1431765_a_at Rps2-ps10
34 1431765_a_at Gm5070
35 1431765_a_at Snora64
In reality, having multiple probe set, I'd convert all the affymetrix ID
into gene symbol. And later perform clustering, GO analysis, etc based
on these genes.
Your expert advice will be much appreciated.
- G.V.
More information about the Bioconductor
mailing list