[BioC] Help with getBM using EST ids from zebrafish
Ochsner, Scott A
sochsner at bcm.tmc.edu
Thu Dec 4 21:14:13 CET 2008
Dear list,
I am trying to retrieve zfin annotation for each of my 6000+ zebrafish EST ids using biomaRt. As an example of the EST ids I've included the output from NCBI's EST search for CO349769.
1. DR_AOV_FL01_G08 adult ovary full-length (TLL) Danio rerio cDNA, mRNA sequence
gi|49431086|gb|CO349769.1|[49431086]
As a further note, searching UniGene is successful with "CO349769", but not with "49431086". I've set up the following:
>library(biomaRt)
>ensembl=useMart("ensembl",dataset="drerio_gene_ensembl"))
Question1: If ESTs are mapped (based on previous posts they don't look like they are), what is the appropriate filter variable in the line of code below?
>map<-getBM(attributes=c("zfin_id","zfin_symbol"),filters="?",values="CO349769",mart=ensembl)
Question2: If it turns out ESTs are not mapped, I'll probably have to go through UniGene to obtain an ID I can use with biomaRt. Does anyone know of a way to batch search UniGene? 6000+ ids is a lot to search one by one.
Thanks for any help,
> sessionInfo()
R version 2.8.0 (2008-10-20)
i386-pc-mingw32
locale:
LC_COLLATE=English_United States.1252;LC_CTYPE=English_United States.1252;LC_MONETARY=English_United States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252
attached base packages:
[1] splines tools stats graphics grDevices utils datasets methods base
other attached packages:
[1] biomaRt_1.16.0 genefilter_1.22.0 survival_2.34-1 geneplotter_1.20.0 annotate_1.20.0 xtable_1.5-4 AnnotationDbi_1.4.0 lattice_0.17-15
[9] limma_2.16.2 affy_1.20.0 Biobase_2.2.0
loaded via a namespace (and not attached):
[1] affyio_1.10.0 DBI_0.2-4 grid_2.8.0 KernSmooth_2.22-22 preprocessCore_1.4.0 RColorBrewer_1.0-2 RCurl_0.91-0
[8] RSQLite_0.7-1 XML_1.94-0.1
Scott A. Ochsner, Ph.D.
NURSA Bioinformatics
Molecular and Cellular Biology
Baylor College of Medicine
Houston, TX. 77030
phone: 713-798-6227
More information about the Bioconductor
mailing list