[R] summarized data set - how to use an "occurs" field

Dennis Murphy djmuser at gmail.com
Sat Jul 16 03:24:13 CEST 2011


Your count variable is a frequency associated with a given row of the
data set. If you're more specific about what you want and can post a
representative sample of (some facsimile of) your data using dput(),
the list is likely to be more helpful. See the posting guide linked at
the bottom of this message for guidelines.


On Fri, Jul 15, 2011 at 3:10 PM, mloxton <mhloxton at gmail.com> wrote:
> I have a data set with 22 fields and several thousand records in which
> one field (count) indicates the number of times that each specific
> combination of the other 21 fields occurred in a bigger and largely
> unavailable data set.
> So each record is unique in its combination of field values and has a
> field that identifies how many multiples of this record actually
> occurred.
> Without resorting to writing a program that re-expands the data set to
> several million rows by cloning each row by the number of times the
> "count" field indicated, is there a way in R to use that field to come
> up with summary stats and bargraphs of the distribution of any one of
> the other fields?
> best
> Matthew
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.

More information about the R-help mailing list