[BioC] How to plot gene on their chromosome?
Simon Noël
simon.noel.2 at ulaval.ca
Fri May 22 05:18:29 CEST 2009
I am sorry. Has I say, I am new to R and bioconductor. I am still learning.
I will try this tomorow and I will write you back to tell you the result.
Thank's you for your help. Il it's working, I will only have to wory about my
problem with qc()... Or at least for now. The summer is really young;)
Selon Hervé Pagès <hpages at fhcrc.org>, 21.05.2009:
> Hi Simon,
>
> Not a good idea to start a new thread by replying to a different thread
> you started previously. Then it shows up under the previous thread even
> if you changed the subject.
>
> more below...
>
> Simon Noël wrote:
> > Hello every one. I have a question. I have a gene list in a .xls like
> >
> > probeID Symbol
> > 1030431 ACSL1
> > 4610431 ACTG2
> > 4810575 ADAMTSL2
> > 1510750 ADH1C
> > 4060519 ADORA1
> > 5720523 ADRA2A
> > 2810482 AHNAK
> > 1260270 AIM2
> > 4180768 ALAS2
> > ... ...
> >
> > I want to plote all of those genes on their chromosome. How can I do this?
>
> So first you need to map each gene to its chromosome location.
>
> You can use one of the org.*.eg.db annotation packages for
> this (pick up the one for your organism):
>
> http://bioconductor.org/packages/release/data/annotation/
>
> and use the SYMBOL2EG map to map your gene symbols to their corresponding
> Entrez IDs and then the CHRLOC map to map your Entrez IDs to their chromosome
> locations.
>
> Example:
>
> library(org.Hs.eg.db)
> mysymbols <- c("ACSL1", "ACTG2", "ADAMTSL2", "ADH1C",
> "ADORA1", "ADRA2A", "AHNAK", "AIM2", "ALAS2")
> myEgIDs <- unlist(mget(mysymbols, org.Hs.egSYMBOL2EG))
> mylocs <- unname(unlist(mget(myEgIDs, org.Hs.egCHRLOC)))
>
> One thing to be aware of is that those mappings are not necessarily
> one-to-one e.g. the same symbol can be associated with different genes:
>
> > flat <- toTable(org.Hs.egSYMBOL2EG)
> > names(flat)
> [1] "gene_id" "symbol"
> > any(duplicated(flat$gene_id))
> [1] FALSE
> > any(duplicated(flat$symbol))
> [1] TRUE
>
> The same thing happens with the org.Hs.egCHRLOC map (I'm not sure
> why we have this though, may be others on the list can explain).
>
> Anyway this explains why 'mylocs' can have more elements than 'mysymbols'.
>
> Cheers,
> H.
>
> >
> > Simon Noël
> > VP Externe CADEUL
> > Association des étudiants et étudiantes en Biochimie, Bio-
> > informatique et Microbiologie de l'Université Laval
> > CdeC
> >
> > _______________________________________________
> > Bioconductor mailing list
> > Bioconductor at stat.math.ethz.ch
> > https://stat.ethz.ch/mailman/listinfo/bioconductor
> > Search the archives:
> http://news.gmane.org/gmane.science.biology.informatics.conductor
>
> --
> Hervé Pagès
>
> Program in Computational Biology
> Division of Public Health Sciences
> Fred Hutchinson Cancer Research Center
> 1100 Fairview Ave. N, M2-B876
> P.O. Box 19024
> Seattle, WA 98109-1024
>
> E-mail: hpages at fhcrc.org
> Phone: (206) 667-5791
> Fax: (206) 667-1319
>
>
Simon Noël
VP Externe CADEUL
Association des étudiants et étudiantes en Biochimie, Bio-
informatique et Microbiologie de l'Université Laval
CdeC
More information about the Bioconductor
mailing list