[R] readPDF() -- unsure how to install xpdf to make this work?
tony.breyal at googlemail.com
Thu Nov 13 16:10:14 CET 2008
I need to convert a set of '.pdf' files into an equivalent set of
'.txt' files. This is so that i can do some text mining on the
In the latest R-News letter (http://cran.r-project.org/doc/Rnews/
Rnews_2008-2.pdf), the package 'tm' for text mining is mentioned. In
that lovely package, there is a function called 'readPDF()'. In order
to use this, ?readPDF says
"Note that this PDF reader needs both the tools pdftotext and
pdfinfo installed and accessable on your system."
These tools are available from http://www.foolabs.com/xpdf/download.html
I am able to download this and use it easily from a dos window to
convert a pdf file into a txt file.
Question: how do i make these tools available to R, so that i can use
the readPDF() function?
Thank you in advance for any help, and I hope the above made sense.
###OS = Windows Vista Ultimate
R version 2.8.0 (2008-10-20)
LC_COLLATE=English_United Kingdom.1252;LC_CTYPE=English_United Kingdom.
attached base packages:
 grid stats graphics grDevices utils datasets
other attached packages:
 tm_0.3-1 XML_1.98-1 Snowball_0.0-3
RWeka_0.3-14 rJava_0.6-0 Matrix_0.999375-16
loaded via a namespace (and not attached):
More information about the R-help