[R-sig-genetics] ape, read.dna to phase fasta file not working properly, tajima
Ella Bowles
bowlese at gmail.com
Wed Aug 30 22:49:50 CEST 2017
fasta8c18.fa
<https://drive.google.com/file/d/0B6qb8IlaQGFZX0Q0YzNUVGNHV1E/view?usp=drive_web>
Hello,
I'm trying to complete the very simple task of reading in an unphased fasta
file and phasing it using ape, and then calculating Tajima's D using pegas,
but my data doesn't seem to be reading in correctly. Input and output is as
follows:
library("ape")
library("adegenet")
library("ade4")
library("pegas")
> DNAbin8c18 <- read.dna(file="fasta8c18.fa", format="f")
> data(DNAbin8c18)
Warning message:
In data(DNAbin8c18) : data set ‘DNAbin8c18’ not found
##clearly the data is not read in properly, so looked at what had been
loaded
> DNAbin8c18
817452 DNA sequences in binary format stored in a matrix.
All sequences of same length: 96
Labels:
CLocus_12706_Sample_1_Locus_34105_Allele_0 [BayOfIslands_s08...
CLocus_12706_Sample_2_Locus_31118_Allele_0 [BayOfIslands_s08...
CLocus_12706_Sample_3_Locus_30313_Allele_0 [BayOfIslands_s09...
CLocus_12706_Sample_5_Locus_33345_Allele_0 [BayOfIslands_s09...
CLocus_12706_Sample_7_Locus_37388_Allele_0 [BayOfIslands_s09...
CLocus_12706_Sample_8_Locus_29451_Allele_0 [BayOfIslands_s09...
...
More than 10 million nucleotides: not printing base composition
##although likely won't work, trying taj d test to see what happens
> tajima.test(DNAbin8c18)
Error: cannot allocate vector of size 2489.3 Gb
I'm sending the datafile along as a link as well.
Any thoughts would be much appreciated.
Ella
--
Ella Bowles, PhD
Postdoctoral Researcher
Department of Biology
Concordia University
Website: https://ellabowlesphd.wordpress.com/
Email: bowlese at gmail.com
[[alternative HTML version deleted]]
More information about the R-sig-genetics
mailing list