[R] random sample from arrays

Joris Meys jorismeys at gmail.com
Thu Jul 8 15:20:08 CEST 2010


Don't know what exactly you're trying to do, but you make a matrix
with 11 columns and 50 rows, then treat it as a vector. On top of
that, you try to fill 50 rows/columns with 50 values. Off course that
doesn't work. Did you check the warning messages when running the
code?

Either do :

 for(i in c(1:11)){
    set[,i] <-sample(x,50)
    print(c(i,"->", set), quote = FALSE)
   }

or

 for(i in c(1:50)){
    set[i,] <-sample(x,11)
    print(c(i,"->", set), quote = FALSE)
   }

Or just forget about the loop altogether and do :

set <- replicate(11,sample(x,50))
or
set <- t(replicate(50,sample(x,11)))

cheers

On Thu, Jul 8, 2010 at 8:04 AM, Assa Yeroslaviz <frymor at gmail.com> wrote:
> Hello R users,
>
> I'm trying to extract random samples from a big array I have.
>
> I have a data frame of over 40k lines and would like to produce around 50
> random sample of around 200 lines each from this array.
>
> this is the matrix
>          ID xxx_1c xxx__2c xxx__3c xxx__4c xxx__5T xxx__6T xxx__7T xxx__8T
> yyy_1c yyy_1c _2c
> 1 A_512  2.150295  2.681759  2.177138  2.142790  2.115344  2.013047
> 2.115634  2.189372  1.643328  1.563523
> 2 A_134 12.832488 12.596373 12.882581 12.987091 11.956149 11.994779
> 11.650336 11.995504 13.024494 12.776322
> 3 A_152  2.063276  2.160961  2.067549  2.059732  2.656416  2.075775
> 2.033982  2.111937  1.606340  1.548940
> 4 A_163  9.570761 10.448615  9.432859  9.732615 10.354234 10.993279
> 9.160038  9.104121 10.079177  9.828757
> 5 A_184  3.574271  4.680859  4.517047  4.047096  3.623668  3.021356
> 3.559434  3.156093  4.308437  4.045098
> 6 A_199  7.593952  7.454087  7.513013  7.449552  7.345718  7.367068
> 7.410085  7.022582  7.668616  7.953706
> ...
>
> I tried to do it with a for loop:
>
> genelist <- read.delim("/user/R/raw_data.txt")
> rownames(genelist) <- genelist[,1]
> genes <- rownames(genelist)
>
> x <- 1:40000
> set <- matrix(nrow = 50, ncol = 11)
>
> for(i in c(1:50)){
>    set[i] <-sample(x,50)
>    print(c(i,"->", set), quote = FALSE)
>    }
>
> which basically do the trick, but I just can't save the results outside the
> loop.
> After having the random sets of lines it wasn't a problem to extract the
> line from the arrays using subset.
>
> genSet1 <-sample(x,50)
> random1 <- genes %in% genSet1
> subsetGenelist <- subset(genelist, random1)
>
>
> is there a different way of creating these random vectors or saving the loop
> results outside tjhe loop so I cn work with them?
>
> Thanks a lot
>
> Assa
>
>        [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>



-- 
Joris Meys
Statistical consultant

Ghent University
Faculty of Bioscience Engineering
Department of Applied mathematics, biometrics and process control

tel : +32 9 264 59 87
Joris.Meys at Ugent.be
-------------------------------
Disclaimer : http://helpdesk.ugent.be/e-maildisclaimer.php



More information about the R-help mailing list