[R] Split data frame into 250-row chunks
Liz Hare
doggene at earthlink.net
Wed Jun 10 14:39:01 CEST 2015
Hi R-Experts,
I have a data.frame like this:
> head(map)
chr snp poscm posbp dist
1 1 M1 2.99043 3249189 NA
2 1 M2 3.06457 3273096 0.07414
3 1 M3 3.17018 3307151 0.10561
4 1 M4 3.20892 3319643 0.03874
5 1 M5 3.28120 3342947 0.07228
6 1 M6 3.29624 3347798 0.01504
I need to split this into chunks of 250 rows (there will usually be a last chunk with < 250 rows).
If I only had to extract one 250-line chunk, it would be easy:
map1 <- map[1:250, ]
or using subset().
I tried to make it a loop iterating through num and using beg and nd for starting and ending indices, but I couldn’t figure out how to reference all the variables I needed in this:
> chunks
beg nd let num
1 1 250 a 1
2 251 500 b 2
3 501 750 c 3
4 751 1000 d 4
5 1001 1250 e 5
6 1251 1500 f 6
7 1501 1750 g 7
8 1751 2000 h 8
9 2001 2250 i 9
10 2251 2500 j 10
…
Remembering that loops are not always the best answer in R, I looked at other options like split, following this example but not being able to adapt it from a vector to a data.frame version
http://stackoverflow.com/questions/3318333/split-a-vector-into-chunks-in-r <http://stackoverflow.com/questions/3318333/split-a-vector-into-chunks-in-r> (Yes, I’ve reviewed the language documentation). I checked out ddply and data.table, but couldn’t find a way to use them with index positions instead of column values.
Thanks,
Liz
[[alternative HTML version deleted]]
More information about the R-help
mailing list