[R] The KJV
bolker at ufl.edu
Sun Feb 7 02:06:40 CET 2010
Jim Lemon <jim <at> bitwrit.com.au> writes:
> On 02/06/2010 06:57 PM, Charlotte Maia wrote:
> > Hey all,
> > Does anyone know if there are any R packages with a copy of the KJV?
> > I'm guessing the answer is no...
> > So the next question, and the more important one is:
> > Does anyone think it would be useful (e.g. for text-mining purposes)?
> > I know almost nothing about theology,
> > so I'm not sure what kind of questions theologists might have (that R
> > could answer).
> > An alternative, that would achieve a similar result (I think),
> > would be an R interface to another open source system, such as Sword.
> Hi Charlotte,
I couldn't help it:
x <- url("http://www.gutenberg.org/dirs/etext90/kjv10.txt",open="r")
X <- readLines(x,n=20000)
z <- grep("First Book of Moses",X)
X <- X[-(1:z)]
X <- X[nchar(X)>0]
length(X) ## 15058
words <- tolower(unlist(strsplit(X,"[ .,:;()]")))
words2 <- grep("[^0-9]",words,value=TRUE)
tt <- rev(sort(table(words2)))
More information about the R-help