[BioC] VariantAnnotation: Performance and memory issues in readVcf

Valerie Obenchain vobencha at fhcrc.org
Wed May 15 18:26:44 CEST 2013


> INFO variables with multiple values are parsed into CompressedLists.
> FORMAT variables with multiple values are parsed into arrays. We thought
> these parsed classes were more beneficial to the user for further
> exploration/analysis. If you only want the list form you can use
> scanBam() instead of readVcf(). scanBam() also takes a 'param' argument
> so you can subset on position, variables etc.

Sorry, not scanBam() but scanTabix(), as you've already discovered.

Valerie



More information about the Bioconductor mailing list