[BioC] Howto annotate blast subject.id with AnnotationDbi
Arnaud Mounier
arnaud.mounier at dijon.inra.fr
Mon May 27 10:12:22 CEST 2013
Hi,
I try to annotate a blastp result (not so much, just 270 query) with the
org.At.tair.db from AnnotationDBi package throw bioconductor.
First I read blast output with RFLPtools :
> df.blast.report <- read.blast(file = f.blast.report)
> head(df.blast.report)
query.id subject.id identity alignment.length mismatches
gap.opens q.start q.end s.start s.end evalue bit.score
1 medtr8g018420.1 AT1G55020.1 59.77 860 314
9 9 856 20 859 0 1058
2 medtr8g018420.1 AT3G22400.1 56.16 869 344
10 9 856 34 886 0 1004
3 medtr8g018420.1 AT1G72520.1 45.19 821 433
10 45 856 114 926 0 729
The subject ID have version number (.1 or .2) and the original
ATH_GO_GOSLIM.txt from tair site two. But this version number is not
present in the org.At.tait.dbTAIR :
> head(keys(org.At.tair.db, keytype="TAIR"))
[1] "AT1G01010" "AT1G01020" "AT1G01030" "AT1G01040" "AT1G01050" "AT1G01060"
* Is this relevant or can I annotate without taking care of the version
number ?
Does Org.At.tair.db keep the version number elsewhere ?
Because the source file for this package
(ftp://ftp.arabidopsis.org/Ontologies/Gene_Ontology/ATH_GO_GOSLIM.txt)
store it initialy.
* As the query must be selected in function of her subjects annotations
and GO.db, I want to merge all info (blast report, org.At.tair.db +
GO.db) in one db with a bioconductor package (annotationForge perhaps).
So, is there a package or a GNU script to manage this association easily
or do i wrote my own R scripts ?
Any links are welcomes !
Thank's in advance,
Ar.
--
« Le soleil filtre à travers les branches des arbres par éclairs, comme
le sens à travers la langue. »
Nancy Huston
Arnaud Mounier
INRA - UMR Agroécologie 1347
CNRS - ERL IPM 6300 (Plant-Microorganism Interaction)
17, rue Sully - BP 86510 - F-21065 Dijon Cedex - France
Work phone : +33 380 693 167 - Fax : +33 380 693 753
https://www6.dijon.inra.fr/umragroecologie/Personnel/IPM/ITA/MOUNIER-Arnaud
More information about the Bioconductor
mailing list