[R] Help for pdf conversion

Jim Lemon drj|m|emon @end|ng |rom gm@||@com
Thu Oct 31 07:13:58 CET 2019


Hi Thomas,
Perhaps you should be doing something like writeLines(txt[1],...) or just:

sink("10619.txt")
cat(txt[1])
sink()

Jim

On Thu, Oct 31, 2019 at 4:48 PM Thomas Subia <tsubia using imgprecision.com> wrote:
>
> Colleagues,
>
> I'm trying to convert a pdf to a text file with the following code.
>
> # pdf to excel
> library(pdftools) # pdf to excel library
> # set working directory
> setwd("C:/Users")
> # input pdf
> txt <- pdf_text("C:/Users/10619.pdf")
> cat(txt[1])
> write.table(cat(txt[1]),file="10619.txt",sep= "\t",row.names =TRUE,col.names =FALSE)
>
> When I examine the contents of cat(txt[1]) on the console, everything I need is displayed in the format I need.
>
> However when I execute write.table(cat(txt[1]),file="10619.txt",sep= "\t",row.names =TRUE,col.names =FALSE) and examine the output, my output does not match cat(txt[1]).
> I suspect that sep= "\t",row.names =TRUE,col.names =FALSE) might be the error.
>
> How can one output the contents of cat(txt[1]) and retain its format?
>
> Thomas Subia
>
>
>
>         [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list