[BioC] understanding multiples matches between probesets and entrezgene (biomart)
Juliet Hannah
juliet.hannah at gmail.com
Wed Jun 13 17:01:32 CEST 2012
All,
I understand the concept of multiple probesets corresponding to one
identifier. But what is the meaning of
a probeset corresponding to multiple identifiers? And below, given
that 220547_s_at has a match,
why should another row be returned with NA.
Did I happen to choose a few probesets where the gene definition is
changing, or am I misunderstanding
something else, such as the biomart syntax.
Thanks,
Juliet
library("biomaRt")
probeSets <- c("219666_at", "220547_s_at", "218034_at")
ensembl = useMart("ensembl")
ensembl = useDataset("hsapiens_gene_ensembl",mart=ensembl)
getBM(attributes = c("affy_hg_u133a", "entrezgene"), filters =
"affy_hg_u133a",values = probeSets, mart = ensembl)
affy_hg_u133a entrezgene
1 220547_s_at 54537
2 218034_at 51024
3 220547_s_at NA
4 219666_at 64231
5 220547_s_at 414241
6 220547_s_at 439965
> sessionInfo()
R version 2.15.0 (2012-03-30)
Platform: x86_64-unknown-linux-gnu (64-bit)
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
[7] LC_PAPER=C LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] biomaRt_2.12.0 BiocInstaller_1.4.6
loaded via a namespace (and not attached):
[1] RCurl_1.91-1 tools_2.15.0 XML_3.9-4
More information about the Bioconductor
mailing list