[R] create matrix

jim holtman jholtman at gmail.com
Thu Mar 20 10:55:31 CET 2008


Since you did not provide a sample of your data, here is an example of
how to take a vector and create a matrix with 5 entries for each
value, with the extra ones having a zero in the second column:

> x <- sample(1:7, 20, T)
> table(x)
x
1 2 3 4 5 6 7
2 4 3 2 4 4 1
> # create a matrix with 5 rows of each value in the vector 'x' with the extra rows
> # having 0 in the second column
> x.l <- lapply(split(x, x), function(.val){
+     # pad with at least 5 extra rows to make sure matrix is filled out
+     z <- cbind(c(.val, rep(.val[1],5)), c(rep(1, length(.val)), rep(0,5)))
+     z[1:5,]   # only return the first 5
+ })
> # output the new matrix
> do.call(rbind, x.l)
      [,1] [,2]
 [1,]    1    1
 [2,]    1    1
 [3,]    1    0
 [4,]    1    0
 [5,]    1    0
 [6,]    2    1
 [7,]    2    1
 [8,]    2    1
 [9,]    2    1
[10,]    2    0
[11,]    3    1
[12,]    3    1
[13,]    3    1
[14,]    3    0
[15,]    3    0
[16,]    4    1
[17,]    4    1
[18,]    4    0
[19,]    4    0
[20,]    4    0
[21,]    5    1
[22,]    5    1
[23,]    5    1
[24,]    5    1
[25,]    5    0
[26,]    6    1
[27,]    6    1
[28,]    6    1
[29,]    6    1
[30,]    6    0
[31,]    7    1
[32,]    7    0
[33,]    7    0
[34,]    7    0
[35,]    7    0
>


On Thu, Mar 20, 2008 at 1:51 AM, Felix Zajitschek - UNSW
<felix.zajitschek at unsw.edu.au> wrote:
> Hi all,
>
> I have a dataset consisting of 5 columns and over 5000 rows. Each row
> gives information about an individual animal, including longevity, i.e.
> at what age an animal died.
> For the model I use I need to create n rows for each animal, n being its
> longevity, and a new column 'survival' with a binary 0/1 outcome. When
> an animal died e.g. at age 5, there have to be 5 rows of identical data,
> except 4 with 0 (=alive) for 'survival', and 1 row with '1' for
> 'survival'.
>
> I thought of creating matrices for each individual, adding first one
> column 'survival' containing zeros to the original dataset, then
> creating matrices with data = 'the vector containing all elements of an
> individual/row' ([1,], nrow = [a,b], exctracting the element for
> longevity, and then with byrow = TRUE letting the data be filled in by
> row. At the end I would have to set the last element in 'survival' to
> '1', and then combine all matrices into one single one.
>
> So far I've used Excel to create these datesets manually, but with more
> than 1000 individuals this gets really tedious. I haven't used R before
> for this sort of a bit more advanced data manipulation, and I would
> really appreciate any input/primer about how people would go about doing
> this.
>
> Thanks,
> Felix
>
>
> ______________________________________________________________
> ::Felix Zajitschek
> Evolution & Ecology Research Centre
> School of Biological, Earth and Environmental Sciences
> University of New South Wales - Sydney NSW 2052 - Australia
> Tel           +61 (0)2 9385 8068
> Fax          +61 (0)2 9385 1558
> eMail       <mailto:felix.zajitschek at unsw.edu.au>
> felix.zajitschek at unsw.edu.au
>
> <http://www.bees.unsw.edu.au/school/researchstudents/zajitschekfelix.htm
> l> www.bees.unsw.edu.au/school/researchstudents/zajitschekfelix.html
>
>
>        [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>



-- 
Jim Holtman
Cincinnati, OH
+1 513 646 9390

What is the problem you are trying to solve?



More information about the R-help mailing list