[R] Problem with scan() from UTF-8 encoded URL
john seers (IFR)
john.seers at bbsrc.ac.uk
Mon Dec 3 18:00:26 CET 2007
Hello
Works fine for me:
> data
<-scan(file='http://en.wikipedia.org/wiki/Special:Recentchanges',what='c
haracter')
Read 3581 items
>
So I don't think it is the Wikipedia end.
Regards
John Seers
---
-----Original Message-----
From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org]
On Behalf Of EUROPOL
Sent: 03 December 2007 16:51
To: r-help at stat.math.ethz.ch
Subject: [R] Problem with scan() from UTF-8 encoded URL
Hallo,
I am trying to import a website and structure it from within R:
The following code:
data <-
scan(file='http://en.wikipedia.org/wiki/Special:Recentchanges',what='cha
racter')
results in the error:
Error in file(file, "r") : unable to open connection In addition:
Warning message:
cannot open: HTTP status was '403 Forbidden' in: file(file, "r")
It seems that the error is connected to the UTF-8-format of wikipedia,
since the following line works:
data <- scan(file='http://www.google.de',what='character')
I am looking forward to your answers.
Greetings
Marc Schwenzer
______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list