[R] disaggregate frequency table into flat file

Marc Schwartz marc_schwartz at comcast.net
Thu May 22 16:04:43 CEST 2008


Is this what you want?

 > xtabs(Freq ~ Var1 + Var2, data = orig)
     Var2
Var1  A  B
    A 40 30
    B  5 25

See ?xtabs


Or is this what you want?

expand.dft <- function(x, na.strings = "NA", as.is = FALSE, dec = ".")
{
   DF <- sapply(1:nrow(x), function(i) x[rep(i, each = x$Freq[i]), ],
                simplify = FALSE)

   DF <- subset(do.call("rbind", DF), select = -Freq)

   for (i in 1:ncol(DF))
   {
     DF[[i]] <- type.convert(as.character(DF[[i]]),
                             na.strings = na.strings,
                             as.is = as.is, dec = dec)

   }

   DF
}



DF <- expand.dft(orig)

 > str(DF)
'data.frame':	100 obs. of  2 variables:
  $ Var1: Factor w/ 2 levels "A","B": 1 1 1 1 1 1 1 1 1 1 ...
  $ Var2: Factor w/ 2 levels "A","B": 1 1 1 1 1 1 1 1 1 1 ...


HTH,

Marc Schwartz


on 05/22/2008 07:56 AM maiya wrote:
> sorry, my mistake!
> the data frame should read:
> orig<-as.data.frame.table(orig)
> orig
>  Var1 Var2 Freq
> 1    A    A   40
> 2    B    A    5
> 3    A    B   30
> 4    B    B   25
> 
> but basicaly i would simply like a sample of the original matrix ( which is
> a frequency table/contingency table/crosstabulation)
> 
> hope this is clearer now!
> 
> maja
> 
> 
> 
> 
> 
> 
> jholtman wrote:
>> Not exactly clear what you are asking for.  Your data.frame.table does not
>> seem related to the original 'orig'.  What exactly are you expecting as
>> output?
>>
>> On Wed, May 21, 2008 at 10:16 PM, maiya <maja.zaloznik at gmail.com> wrote:
>>
>>> i appologise for the trivialness of this post - but i've been searching
>>> the
>>> forum wothout luck - probably simply because it's late and my brain is
>>> starting to go..
>>>
>>> i have a frequency table as a matrix:
>>>
>>> orig<-matrix(c(40,5,30,25), c(2,2))
>>> orig
>>>     [,1] [,2]
>>> [1,]   40   30
>>> [2,]    5   25
>>>
>>> i basically need a random sample say 10 from 100:
>>>
>>>     [,1] [,2]
>>> [1,]   5   2
>>> [2,]    0   3
>>>
>>> i got as far as
>>>
>>> orig<-as.data.frame.table(orig)
>>> orig
>>>  Var1 Var2 Freq
>>> 1    A    A   10
>>> 2    B    A    5
>>> 3    A    B   30
>>> 4    B    B   25
>>>
>>> and then perhaps
>>>
>>> individ<-rep(1:4, times=orig$Freq)
>>>
>>> which gives a vector of the 100 individuals in each of the 4 groups -
>>> cells,
>>> but I'm
>>> (a) stuck here and
>>> (b) afraid this is a very round-about way at getting to what I want i.e.
>>> I
>>> can now sample(individ, 10), but then I'll have a heck of a time getting
>>> the
>>> result back into the original matrix form....
>>>
>>> sorry again, just please tell me the simple solution that I've missed?
>>>
>>> thanks!
>>>
>>> maja
>>>
>



More information about the R-help mailing list