[BioC] A question about org.Hs.egMAPCOUNTS
mcarlson at fhcrc.org
Wed Mar 3 01:18:05 CET 2010
To answer questions like this you might want to look at the manual page
for count.mappedRkeys() and count.mappedLkeys(). These can be used to
count the number of mapped keys in a mapping.
# An example to give you an idea:
# This gives us:
# That is the same number as you saw in the MAPCOUNTS, But what exactly
did we just count?
# Lets look at one key just to see:
# So you can see that we were just found that 8245 GO terms are matched
onto some gene!
# So how can we see how many genes are matched onto GO terms??
# which comed out to be 17673
# And just to verify that the Lkeys are entrez gene IDs:
You may have noticed that the Lkeys are still the entrez gene IDs in
both the "reversed" maps as well as the "forward" maps. This is because
we are using "undirected" methods as described in the manual page. So
when using such methods you should pay attention to what you are
counting as this can seem confusing when used with a "reversed" such as
a "GO2EG" mapping. You might expect that the Lkeys and Rkeys would
switch places, but for these methods they have not.
Hope this clarifies things a bit,
On 03/02/2010 01:19 PM, Gilbert Feng wrote:
> Hello, BioC forks,
> I notice that org.Hs.egMAPCOUNTS reports that org.Hs.egGO2EG is 8245. Are
> these 8245 genes are unique or, do all of GO terms contain 8245 human genes
> (could be counted many times)? Actually, I wonder how many unique human
> genes in GO and its subdirectories, BP, MF and CC. Is there any function to
> retrieve such information easily or I have to write several lines to do
> Thanks a lot!
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
More information about the Bioconductor