[R] reordering huge data file
Boks, M.P.M.
M.P.M.Boks at umcutrecht.nl
Mon Jan 21 22:45:41 CET 2008
Dear R-experts,
My problem is how to handle a 10GB data file containing genotype data. The file is in a particular format (Illumina final report) and needs to be altered and merged with phenotype data for further analysis.
PERL seems to be an frequently used solution for this type of work, however I am inclined to think it should be doable with R.
How do I open a text-file, line by line, evaluate it and write it back into a textfile in a different position;
Phenotypeinfo.txt (contains phenotype information)
Before.txt (contains genotypeinformation -see below-)
SNP;1-305,000 ID:1-900 allele.A alleleB
After.txt (the required format)
ID:1-250 phenotype SNP1.allelA SNP1.alleleB SNP2.Allele.A SNP2.allele.B etc
I have been looking at ?read.table/scan/readline/SQL-light but have not resolved it. Should I refer to PERL or can this be tackled?
I am using a windows machine with R 2.6.0
Any help would be highly appreciated,
Many Thanks,
Marco
More information about the R-help
mailing list