[R] A category reduction problem
Kathy Gerber
kmg5b at virginia.edu
Sat Mar 21 03:00:58 CET 2009
Many thanks to Bill and Thomas for both insight and this nice solution.
Kathy Gerber
William Dunlap wrote:
> If you diff the desired series you get all 2^n
> possible n-long sequences of 0's and 1's. Hence
> another solution is to convert the numbers in
> 0:(2^n-1) to their binary representations, putting
> the j'th bit into the j'column and run
> cumsum over the resulting rows.
>
> For efficiency in R do this a column at a time
> (n iterations), not a row at a time (2^n iterations).
> E.g.,
>
> f <- function (n=10)
> {
> ret <- matrix(0, nrow=2^n, ncol=1+n)
> x <- 0:(2^n-1)
> for(j in (n+1):2) {
> ret[,j] <- x%%2
> x <- x %/% 2
> }
> if (n>=2) for(j in 2:(n+1)) {
> ret[,j] <- ret[,j-1] + ret[,j]
> }
> ret
> }
>
> Bill Dunlap
> TIBCO Software Inc - Spotfire Division
> wdunlap tibco.com
>
> ------------------------------------------------------------------------
> --------
>
> [R] A category reduction problem
>
> Thomas Lumley tlumley at u.washington.edu
> Fri Mar 20 18:12:26 CET 2009
> Previous message: [R] A category reduction problem
> Next message: [R] struggling with pairlists
> Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
> I'd do this with recursion, I think
>
> * A valid (n+1) number sequence is a valid n-number sequence followed
> either by the last number of the sequence or the last number plus one.
>
> * A valid 1-number sequence is 0
>
>
> -thomas
>
>
> On Fri, 20 Mar 2009, Kathy Gerber wrote:
>
>
>> I am trying to print out a list of strings of length 11 based on
>>
> integers 0
>
>> through 10. The rules as given to me for the ordering are:
>> The first digit must be 0.
>> The 2nd digit must be 0 or 1.
>> The 3rd digit must equal the 2nd digit or the 2nd digit +1.
>> ...
>> Given the final digit, n, all digits 0 through n must appear in a
>>
> given
>
>> sequence.
>>
>> So the final 1024 item list should look like 0 1 2 3 4 5 6 7 8 9 10
>> 0 1 2 3 4 5 6 7 8 9 9
>> 0 1 2 3 4 5 6 7 8 8 9
>> 0 1 2 3 4 5 6 7 7 8 9
>> ...
>> 0 1 2 3 4 5 6 7 8 8 8
>> 0 1 2 3 4 5 6 7 7 8 8
>> ...
>> 0 1 2 3 3 3 3 3 3 3 3
>> 0 1 2 2 3 3 3 3 3 3 3
>> ...
>> 0 0 0 0 0 0 0 0 0 0 0
>>
>> This is easy to do for a small integer, e.g., to see what's going on I
>>
> drew the
>
>> tree for 5, getting 16 rows including the two trivial rows, 0 0 0 0 0
>>
> and 0 1 2
>
>> 3 4. Offsetting from 0-10 to 1-11, I have tried two basic approaches
>>
> with only
>
>> partial success: using nested loops (gets messy quickly), and using
>>
> the
>
>> partitions package to construct rows by generating combinations based
>>
> on the
>
>> partitions.
>>
>> Here's my very flawed code for completeness, but I'm guessing there is
>>
> a better
>
>> approach entirely. There are actually 56 partitions (here hardcoded
>>
> 2:19).
>
>> require(partitions)
>> b <- as.matrix(parts(11))
>> u <- b[-1,]
>> for (ptn in 2:19) {
>> s <- as.matrix(u[, ptn])
>> dss <- as.vector(s[which(s>0)])
>> if (length(dss) <= b[1, ptn]) {
>> comb <- combn(b[1, ptn], length(dss))
>> ccnt <- dim(comb)[2]
>> for (i in ccnt:1) {
>> a <- c(1:b[1,ptn])
>> for (k in 1:length(dss)) {
>> a <- c(a,rep(comb[k, i], u[k, ptn]))
>> }
>> cat(sort(a-1,"\n")
>> }
>> }
>> }
>>
>> I appreciate any insight or direction.
>>
>> "Counting is hard." -- Alan Lee Schwartz
>> -----------------------------------------------------
>> Kathy Gerber
>> University of Virginia
>> Research Computing Lab
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>>
> http://www.R-project.org/posting-guide.html
>
>> and provide commented, minimal, self-contained, reproducible code.
>>
>>
>
> Thomas Lumley Assoc. Professor, Biostatistics
> tlumley at u.washington.edu University of Washington, Seattle
>
>
More information about the R-help
mailing list