[R] Troubles with stemming (tm + Snowball packages) under MacOS
Milan Bouchet-Valat
nalimilan at club.fr
Sun Jan 15 15:52:10 CET 2012
Le vendredi 13 janvier 2012 à 15:49 +0100, Julien Velcin a écrit :
> Dear all,
>
> I have some troubles using the stemming algorithm provided by the tm
> (text mining) + Snowball packages.
> Here is my config:
>
> MacOS 10.5
> R 2.12.0 / R 2.13.1 / R 2.14.1 (I have tried several versions)
>
> I have installed all the needed packages (tm, rJava, rWeka, Snowball)
> + dependencies. I have desactivated AWT (like written in http://r.789695.n4.nabble.com/Problem-with-Snowball-amp-RWeka-td3402126.html)
> with :
>
> Sys.setenv(NOAWT=TRUE)
>
> The command tm_map(reuters, stemDocument) gives the following errors :
>
> - First time:
> Error in .jnew(name) :
> java.lang.InternalError: Can't start the AWT because Java was
> started on the first thread. Make sure StartOnFirstThread is not
> specified in your application's Info.plist or on the command line
> Refreshing GOE props...
In my experience, there's no clean solution to this problem for now.
There's a good workaround, though: run your code from JGR, which is a
GUI written in Java. Snowball works well this way.
Cheers
More information about the R-help
mailing list