[BioC] GenomicAlignments and QNAME collision
Stefano Calza
stecalza at gmail.com
Thu May 8 17:14:23 CEST 2014
Hi everybody
I am using GenomicAlignments package to read RNAseq pair-end data. The
problem is that readGAlignmentPairsFromBam, after setting asMates=TRUE
in BamFile, returns 0 mates.
The reason is that mates have different QNAMEs. Eg:
UNC15-SN850:240:D148CACXX:3:1308:19719:99367/1
UNC15-SN850:240:D148CACXX:3:1308:19719:99367/2
that is the two mates have /1 or /2 at the end.
I wrote a Python (and a cpp) program to fix it...but this takes still
quite a substantial amount of time on big files.
Does the mating algorithm allow for this? If so how?
Regards
Stefano
More information about the Bioconductor
mailing list