[BioC] problem with rat database
Alberto Goldoni
alberto.goldoni1975 at gmail.com
Tue May 10 11:30:53 CEST 2011
Dear All,
i'm analyzing agilent microarrays with the "rgug4130a.db" database and
using the function:"topTable(fit2,number=500,adjust="BH")" i have
obtained 500 genes like these:
Row Col ProbeUID ControlType ProbeName GeneName SystematicName Description X.hda.str...ref. X.ref.str...ref. X.hda.str...ref.str. AveExpr F P.Value adj.P.Val
16096 79 38 15309 0 A_43_P10328 CB606456 CB606456 unknown
function 3.988290607 -0.951656306 4.939946913 10.29735936 36.77263264 0.000212298 0.641094595
8109 40 109 7609 0 A_42_P552092 203358_Rn 203358_Rn Rat c-fos
mRNA. 5.670956889 4.413365374 1.257591514 13.47699544 33.20342601 0.000292278 0.641094595
but as you can see most genes like the first one - CB606456 - in the
DESCRPTION there is written "unknown function".
So i have performed a very simply search.
1) First in ENSAMBLE using the GeneName "CB606456" with the "Locations
of DnaAlignFeature" it gives to me the Genomic location(strand): chr
7:16261621-16262210
2) Then in the Rat Genome Database
(http://rgd.mcw.edu/tools/genes/genes_view.cgi?id=735058) i have found
that in this position there is one gene:
735058 GENE Angptl4 angiopoietin-like 4 7 16261623 16267852
so the question is why in the "rgug4130a.db" database the R system
gives to me "unknown function" when using the genomic location in
ensamble and then in rgd it gives to me the Angptl4 gene!
and there is a function in order to do to R to perform this kind of
search automatically? (this why in my 500 genes there are 100 "unknow
function" genes and it will be interesting to have a function that
perform this kind of search automatically).
Best regards to all and to whom answer to me.
--
-----------------------------------------------------
Dr. Alberto Goldoni
Parma, Italy
More information about the Bioconductor
mailing list