[Bioc-devel] Cytogenetic bands

Valerie Obenchain vobencha at fhcrc.org
Tue Mar 25 20:49:11 CET 2014


Hi Ilari,

org.Hs.eg.db is one of the packages included in Homo.sapiens and it's 
the origin of 'MAP'. This variable maps between entrez gene ids and 
cytoband names, not genomic coordinates (as you've discovered). It 
includes bands and sub-bands provided by Entrez Gene downloaded from here:

   ftp://ftp.ncbi.nlm.nih.gov/gene/DATA

To see a full description of 'MAP':

   library(org.Hs.eg.db)
   ?org.Hs.egMAP

We don't have an annotation package with cytoband coordinates but you 
can download them using rtracklayer:

library(rtracklayer)
session <- browserSession()
genome(session) <- "hg19"
query <- ucscTableQuery(session, "cytoBandIdeo")
tbl <- getTable(query)

>> dim(tbl)
> [1] 931   5

>> head(tbl)
>   chrom chromStart chromEnd   name gieStain
> 1  chr1          0  2300000 p36.33     gneg
> 2  chr1    2300000  5400000 p36.32   gpos25
> 3  chr1    5400000  7200000 p36.31     gneg
> 4  chr1    7200000  9200000 p36.23   gpos25
> 5  chr1    9200000 12700000 p36.22     gneg
> 6  chr1   12700000 16200000 p36.21   gpos50

Valerie


On 03/22/2014 10:12 AM, Ilari Scheinin wrote:
> Hi,
>
> I would like to obtain the boundaries of cytogenetic bands for human (hg19) as I need to map arbitrary genomic positions to the band containing them. I figured these would be available via the Homo.sapiens annotation package, so I took a look at the available keytypes. MAP looked promising:
>
>> library(Homo.sapiens)
>> head(keys(Homo.sapiens, keytype="MAP"))
> [1] "19q13.4"  "12p13.31" "8p22"     "14q32.1"  "3q25.1"   “2q35”
>
> However, upon a closer look, these don’t appear to be the actual bands themselves, but are instead the matching bands for some other level of data, as it contains entries such as "19q13-qter”. (And there are 2,446 of these entries whereas there are 862 bands.)
>
> A bit of searching returned two software packages that do contain this information: idiogram (data(Hs.cytoband)) and OmicCircos (data(UCSC.hg19.chr)). The first one seems to be from genome build hg17, but the second one has hg18 and hg19. However, using a software package instead of an annotation one to obtain this information seems wrong, and makes me worry if it will be kept up-to-date in the future (c.f. idiogram).
>
> So, are the coordinates of the cytogenetic bands contained somewhere in the annotation packages?
>
> Thanks,
> Ilari
>
> _______________________________________________
> Bioc-devel at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/bioc-devel
>


-- 
Valerie Obenchain
Program in Computational Biology
Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N, M1-B155
P.O. Box 19024
Seattle, WA 98109-1024

E-mail: vobencha at fhcrc.org
Phone:  (206) 667-3158
Fax:    (206) 667-1319



More information about the Bioc-devel mailing list