[R] Pattern Analysis Libraries
Bert Gunter
bgunter@4567 @end|ng |rom gm@||@com
Tue Dec 17 02:17:53 CET 2019
Your specification seems too vague to me. What sort of "patterns" are of
interest?
See also ?table on your "concatenated" columns, e.g. something like:
table(do.call(paste0, yourdata.frame))
or even
do.call(table,yourdata.frame)
for a contingency table.
There are books written on the "analytics" (both statistical and graphical)
of multidimensional contingency tables and categorical data that you may
wish to consult some to get some more specific ideas.
Bert Gunter
"The trouble with having an open mind is that people keep coming along and
sticking things into it."
-- Opus (aka Berkeley Breathed in his "Bloom County" comic strip )
On Mon, Dec 16, 2019 at 11:13 AM Jeff Reichman <reichmanj using sbcglobal.net>
wrote:
> R-Help
>
> I have a need to find aggregated patterns within a data.frame of some 80
> million records and wanted to know if there are any packages which could be
> used to find patterns by row. For example
>
> Col 1 Col 2 Col3
> A 1 aa
> A 2 bb
> A 1 aa
>
> In this example pattern A - 1 - aa occurs twice, and A - 2 - bb occurs
> once.
> Presently I'm simply concatenating the columns and performing a group by,
> and count. Which works but wonder if there were any packages that would
> perform such (and maybe other) analytics.
>
> Sincerely
>
> Jeff Reichman
> (314) 457-1966
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
[[alternative HTML version deleted]]
More information about the R-help
mailing list