[R] Function gutenberg_download in the gutenbergr package

Patrick Connolly p_connolly at slingshot.co.nz
Wed Jan 24 08:23:06 CET 2018

I've been working through https://www.tidytextmining.com/tidytext.html
wherein everything worked until I got to this part in section 1.5

> hgwells <- gutenberg_download(c(35, 36, 5230, 159))
Determining mirror for Project Gutenberg from http://www.gutenberg.org/robot/harvest
Error in open.connection(con, "rb") : 
  Failed to connect to www.gutenberg.org port 80: Connection timed out

Which indicates the problem is at the very start:

  if (is.null(mirror)) {
    mirror <- gutenberg_get_mirror(verbose = verbose)

The documentation for gutenberg_get_mirror indicates there's nothing
different I could set.

So I tried specifying my usual mirror:

> hgwells <- gutenberg_download(c(1260, 768, 969, 9182, 767), mirror = "http://cran.stat.auckland.ac.nz")
Error in read_zip_url(full_url) : could not find function "read_zip_url"

Which is, indeed, strange since according to 

> help.search("read_zip_url")
Help files with alias or concept or title matching ‘read_zip_url’ using
regular expression matching:

                        Read a file from a .zip URL
  Aliases: read_zip_url


And according to 
library(help = "gutenbergr")


gutenberg_authors       Metadata about Project Gutenberg authors
gutenberg_download      Download one or more works using a Project
                        Gutenberg ID
gutenberg_get_mirror    Get the recommended mirror for Gutenberg files
gutenberg_metadata      Gutenberg metadata about each work
gutenberg_strip         Strip header and footer content from a Project
                        Gutenberg book
gutenberg_subjects      Gutenberg metadata about the subject of each
gutenberg_works         Get a filtered table of Gutenberg work metadata
read_zip_url            Read a file from a .zip URL


However, when I look at the list for that part of the search(), there
is no read_zip_url but all the rest of that list are present.  So it's
not surprising that it isn't found.  But it puzzles me that it is not

Ideas as to where I should proceed gratefully appreciated.

> sessionInfo()
R version 3.4.2 (2017-09-28)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Ubuntu 14.04.5 LTS

Matrix products: default
BLAS: /home/hrapgc/local/R-3.4.2/lib/libRblas.so
LAPACK: /home/hrapgc/local/R-3.4.2/lib/libRlapack.so

 [1] LC_CTYPE=en_NZ.UTF-8       LC_NUMERIC=C              
 [3] LC_TIME=en_NZ.UTF-8        LC_COLLATE=en_NZ.UTF-8    
 [7] LC_PAPER=en_NZ.UTF-8       LC_NAME=C                 
 [9] LC_ADDRESS=C               LC_TELEPHONE=C            

attached base packages:
[1] grDevices utils     stats     graphics  methods   base     

other attached packages:
 [1] sos_2.0-0          brew_1.0-6         gutenbergr_0.1.3   ggplot2_2.2.1     
 [5] stringr_1.2.0      bindrcpp_0.2       dplyr_0.7.4        janeaustenr_0.1.5 
 [9] tidytext_0.1.6     FactoMineR_1.38    readxl_1.0.0       tm_0.7-3          
[13] NLP_0.1-11         wordcloud_2.5      RColorBrewer_1.1-2 lattice_0.20-35   

loaded via a namespace (and not attached):
 [1] Rcpp_0.12.13         cellranger_1.1.0     compiler_3.4.2      
 [4] plyr_1.8.4           bindr_0.1            tokenizers_0.1.4    
 [7] tools_3.4.2          gtable_0.2.0         tibble_1.3.4        
[10] nlme_3.1-131         pkgconfig_2.0.1      rlang_0.1.2         
[13] Matrix_1.2-11        psych_1.7.8          curl_3.0            
[16] parallel_3.4.2       xml2_1.1.1           cluster_2.0.6       
[19] hms_0.3              flashClust_1.01-2    grid_3.4.2          
[22] scatterplot3d_0.3-40 glue_1.1.1           ellipse_0.3-8       
[25] R6_2.2.2             foreign_0.8-69       readr_1.1.1         
[28] purrr_0.2.4          tidyr_0.7.2          reshape2_1.4.2      
[31] magrittr_1.5         scales_0.5.0         SnowballC_0.5.1     
[34] MASS_7.3-47          leaps_3.0            assertthat_0.2.0    
[37] mnormt_1.5-5         colorspace_1.3-2     labeling_0.3        
[40] stringi_1.1.5        lazyeval_0.2.1       munsell_0.4.3       
[43] slam_0.1-42          broom_0.4.2         

   ___    Patrick Connolly   
 {~._.~}                   Great minds discuss ideas    
 _( Y )_  	         Average minds discuss events 
(:_~*~_:)                  Small minds discuss people  
 (_)-(_)  	                      ..... Eleanor Roosevelt

More information about the R-help mailing list