[BioC] faster way to get differential calls from pileup?
Martin Morgan
mtmorgan at fhcrc.org
Sat Oct 16 22:33:11 CEST 2010
On 10/16/2010 11:54 AM, Hollis Wright wrote:
> Hi, all; I've got a pair of lanes of exome sequencing data; we've
> generated pileup files from samtools and we're interested in looking
> for discordant calls for quality control or snp discovery. As best I
> can figure out the way to do this involves doing a findOverlaps and
> the programatically iterating through the match matrix to get the
> matching positions and check for differences. However, the overlap
> finding takes several hours, and since we anticipate there being many
This sounds like it's taking longer than findOverlaps should be taking;
perhaps you are running out of memory (so process in batches, e.g., by
chromosome) or doing something inefficiently. What does your code look
like (simplified, if possible...)
Martin
> lanes in the future I'm curious if there's a faster or better way to
> go about this sort of process. Thanks...
>
> Hollis Wright
>
> Sent from my iPhone
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
--
Computational Biology
Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N. PO Box 19024 Seattle, WA 98109
Location: M1-B861
Telephone: 206 667-2793
More information about the Bioconductor
mailing list