[R] read data from pdf file

paul sorenson (sosman) sourceforge at metrak.com
Sun Oct 23 01:12:00 CEST 2005


Marco Venanzi wrote:
> Hi, I'm trying to read data from a PDF file.Is it possible to do it
> with R? Thanks,  Marco [[alternative HTML version deleted]]

Ghostview has at least one method for extracting the text from a PDF 
document.  In particular Text|Extract allows you to select pages for 
extraction.  This may or may not give the same result as pdftotext 
because I think that is ghostscript based.

Your mileage may vary when extracting tables from a PDF.

cheers




More information about the R-help mailing list