[R] Problem with scan() from UTF-8 encoded URL

john seers (IFR) john.seers at bbsrc.ac.uk
Mon Dec 3 18:00:26 CET 2007


 

Hello

Works fine for me:

> data
<-scan(file='http://en.wikipedia.org/wiki/Special:Recentchanges',what='c
haracter')
Read 3581 items
> 

So I don't think it is the Wikipedia end.

Regards

John Seers


 
---

-----Original Message-----
From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org]
On Behalf Of EUROPOL
Sent: 03 December 2007 16:51
To: r-help at stat.math.ethz.ch
Subject: [R] Problem with scan() from UTF-8 encoded URL

Hallo,

I am trying to import a website and structure it from within R:

The following code:

data <-
scan(file='http://en.wikipedia.org/wiki/Special:Recentchanges',what='cha
racter')

results in the error:

Error in file(file, "r") : unable to open connection In addition:
Warning message:
cannot open: HTTP status was '403 Forbidden' in: file(file, "r")

It seems that the error is connected to the UTF-8-format of wikipedia,
since the following line works:

data <- scan(file='http://www.google.de',what='character')

I am looking forward to your answers.

Greetings

Marc Schwenzer

______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list