[R-sig-genetics] Reading large VCF into genind

Sven E. Templer sven.templer at gmail.com
Fri May 29 16:31:52 CEST 2015


Dear Stefano,

could you describe which package:::function(s) you used (providing code
helps to help!)?

You could split the vcf file before reading it into the R workspace by
chromosomes (use GNU grep, for example), then extract data for single files
(e.g. frequencies, subset of SNPs, etc.). Export this data, and start a new
session which loads only the processed information.

Best,
Sven



On 29 May 2015 at 16:07, Stefano Iantorno <si3 at sanger.ac.uk> wrote:

> Hello
>
> I have a VCF file containing 306596 biallelic SNP calls from 18
> individuals. I was able to read the entire file as a “loci” object but when
> I try to convert it into a genind object I run into memory issues.
>
> How can I circumvent this problem? Should I break the single “loci” object
> into smaller subsets with only 50 000 SNPs or so, and convert each one into
> a genind object separately with loci2genind? How do I combine the resulting
> genind objects?
>
> - Stefano
>
>
>
> --
>  The Wellcome Trust Sanger Institute is operated by Genome Research
>  Limited, a charity registered in England with number 1021457 and a
>  company registered in England with number 2742969, whose registered
>  office is 215 Euston Road, London, NW1 2BE.
>
>
>
>         [[alternative HTML version deleted]]
>
>
> _______________________________________________
> R-sig-genetics mailing list
> R-sig-genetics at r-project.org
> https://stat.ethz.ch/mailman/listinfo/r-sig-genetics
>
>

	[[alternative HTML version deleted]]



More information about the R-sig-genetics mailing list