[Bioc-devel] request: high-level seqlevel utilities
    Julian Gehring 
    julian.gehring at embl.de
       
    Mon Dec 30 14:18:18 CET 2013
    
    
  
Hi,
With the convenience that seqnamesStyles offers now, having to specify 
the chromosome name notation manually would feel like a step back.  In 
terms of subsetting genomic ranges, I normally think of four major 
groups of interest:
- Toplevel/standard: 1,..22,X,Y,MT
- Autosomes: 1,..,22
- Allosomes: X,Y
- "Linear": 1,..,22,X,Y
If you are concerned about confusing the user with many specialized 
functions, how about extending 'keepSeqlevels' by adding a e.g. 'group' 
argument that allows you to select a group of chromosomes as above.  As 
an example, think of:
## subset as before by seqname
keepSeqlevels(gr, "1")
keepSeqlevels(gr, value = "1")
## the new feature
keepSeqlevels(gr, group = "autosomes")
which would dispatch to specialized methods like '.keepAutosomes' in 
combination with 'seqnames.db'.  This way, one could also create a 
setting in which the groups can be easily extended by the user or other 
packages, by simply defining more of the specialized functions.
Best wishes
Julian
    
    
More information about the Bioc-devel
mailing list