[R] POS counting number of verbs
Elahe chalabi
ch@|@b|@e|@he @end|ng |rom y@hoo@de
Mon Nov 5 13:38:17 CET 2018
Hi all,
I have 16630 Messages in my data frame and I would like to count number of verbs in each message, to do so I have the following code:
> str(tar)
'data.frame': 16630 obs. of 2 variables:
$ Message : Factor w/ 13412 levels "","'alter database datafile' needs to be executed",..: 11163 1 9715 10110 9683 11364 12952 2242 7153 6907 ...
$ group : Factor w/ 16630 levels "1","2","3","4",..: 1 2 3 4 5 6 7 8 9 10 ...
> tagPOS <- function(x, ...) {+ s <- as.String(x)+ word_token_annotator <- Maxent_Word_Token_Annotator()+ a2 <- Annotation(1L, "sentence", 1L, nchar(s))+ a2 <- annotate(s, word_token_annotator, a2)+ a3 <- annotate(s, Maxent_POS_Tag_Annotator(), a2)+ a3w <- a3[a3$type == "word"]+ POStags <- unlist(lapply(a3w$features, `[[`, "POS"))+ POStagged <- paste(sprintf("%s/%s", s[a3w], POStags), collapse = " ")+ list(POStagged = POStagged, POStags = POStags)+ }> count_verbs <-function(x) {+ pos_tags <- tagPOS(x)$POStags+ sum(grepl("VB", pos_tags))+ }> library(dplyr)> tar %>% + group_by(group) %>%+ summarise(num_verbs = count_verbs(Message))
And here is the error I get:Error in summarise_impl(.data, dots) : Evaluation error: no word token annotations found.
Does anyone know about this error? Thanks for any help.Elahe
[[alternative HTML version deleted]]
More information about the R-help
mailing list