[R] text mining analysis and word visualization of pdfs

Karl Ove Hufthammer karl at huftis.org
Wed May 18 10:14:42 CEST 2011


Ajay Ohri wrote:

> What is the appropriate software package for dumping say 20 PDFS in a
> folder, then creating data visualization with frequency counts of
> certain words as well as measure correlation within each file for
> certain key relationships or key words.

pdftotext + Unix™ for Poets + R (ggplot2)

HTH.

-- 
Karl Ove Hufthammer



More information about the R-help mailing list