[R] Help for pdf conversion
Thomas Subia
t@ub|@ @end|ng |rom |mgprec|@|on@com
Sat Nov 2 00:14:14 CET 2019
Jim,
That works well!
Thanks again for your help!
Thomas Subia
-----Original Message-----
From: Jim Lemon <drjimlemon using gmail.com>
Sent: Wednesday, October 30, 2019 11:14 PM
To: Thomas Subia <tsubia using imgprecision.com>
Cc: r-help using r-project.org
Subject: Re: [R] Help for pdf conversion
Hi Thomas,
Perhaps you should be doing something like writeLines(txt[1],...) or just:
sink("10619.txt")
cat(txt[1])
sink()
Jim
On Thu, Oct 31, 2019 at 4:48 PM Thomas Subia <tsubia using imgprecision.com> wrote:
>
> Colleagues,
>
> I'm trying to convert a pdf to a text file with the following code.
>
> # pdf to excel
> library(pdftools) # pdf to excel library # set working directory
> setwd("C:/Users")
> # input pdf
> txt <- pdf_text("C:/Users/10619.pdf")
> cat(txt[1])
> write.table(cat(txt[1]),file="10619.txt",sep= "\t",row.names
> =TRUE,col.names =FALSE)
>
> When I examine the contents of cat(txt[1]) on the console, everything I need is displayed in the format I need.
>
> However when I execute write.table(cat(txt[1]),file="10619.txt",sep= "\t",row.names =TRUE,col.names =FALSE) and examine the output, my output does not match cat(txt[1]).
> I suspect that sep= "\t",row.names =TRUE,col.names =FALSE) might be the error.
>
> How can one output the contents of cat(txt[1]) and retain its format?
>
> Thomas Subia
>
>
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list