[Bioc-sig-seq] AlignedRead and complex subsetting
Steve Lianoglou
mailinglist.honeypot at gmail.com
Tue Sep 1 00:53:58 CEST 2009
Hi Ivan,
On Aug 31, 2009, at 4:54 PM, Ivan Gregoretti wrote:
> Hello Everybody,
>
> How do you subset an AlignedRead instance to keep (or reject) tags
> that lay within a set of genomic regions?
>
>
> Example
>
> Lets say that I have an AlignedRead instance called aln.
>
> Now let's say that I have a set of positions in BED style:
>
> (chromosome, start end)
> ch1 1000000 1000050
> chrX 20000000 20100000
> ...(many more)...
>
> We can imagine that I have the BED set loaded as a data frame.
>
> Is it possible to pick from aln only the tags within (or outside) the
> features defined in the table described above?
I think that you should convert your BED file to an IRanges object,
and use overlap with your ranges + your readAligned object to get what
your after. See Martin's post about something like this in this thread:
https://stat.ethz.ch/pipermail/bioc-sig-sequencing/2009-August/000509.html
To get the reads *outside* of your ranges, maybe you can call the
``gaps`` on your bed/ranges and then do the same thing ... or
perhaps ``setdiff(ranges, aln)`` might work, too? (where aln is your
IRanges converted alignedRead object (if necessary)).
-steve
--
Steve Lianoglou
Graduate Student: Computational Systems Biology
| Memorial Sloan-Kettering Cancer Center
| Weill Medical College of Cornell University
Contact Info: http://cbio.mskcc.org/~lianos/contact
More information about the Bioc-sig-sequencing
mailing list