[R] How to access https page
Prof Brian Ripley
ripley at stats.ox.ac.uk
Tue Mar 10 07:28:38 CET 2015
On 09/03/2015 22:39, Hui Du wrote:
> Hi All,
>
> I am trying to parse some information from website, say, a linkedin page.
> The linkedin url was
>
> url = "http://www.linkedin.com/in/huidu"
>
> I had no problem to use readLines and XML package to collect the
> information I need. However, that url became "
> https://www.linkedin.com/in/huidu" now.
>
> url = "https://www.linkedin.com/in/huidu"
>
> It failed readLines function.
>
>> readLines(url)
> Error in file(con, "r") : cannot open the connection
> In addition: Warning message:
> In file(con, "r") : unsupported URL scheme
>
>
> Do you know any way to read-in web information if the url is https? Thanks
> a lot.
Try R-devel, soon to become R 3.2.0. That has support for this on
platforms where libcurl is installed (which should be possible almost
everywhere).
You did not give the 'at a minimum' information required by the posting
guide. This has long been possible on Windows with --internet2.
>
> Hui
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
--
Brian D. Ripley, ripley at stats.ox.ac.uk
Emeritus Professor of Applied Statistics, University of Oxford
1 South Parks Road, Oxford OX1 3TG, UK
More information about the R-help
mailing list