[R-es] procesamiento de textos con R

Carlos Ortega cof en qualityexcellence.es
Mar Jul 7 13:33:52 CEST 2015


Hola,

También tienes el paquete "stringdist", que se aproxima bastante a los que
has preguntado.

http://cran.r-project.org/web/packages/stringdist/index.html

stringdist: Approximate String Matching and String Distance Functions

Implements an approximate string matching version of R's native 'match'
function. Can calculate various string distances based on edits
(damerau-levenshtein, hamming, levenshtein, optimal sting alignment),
qgrams (q-gram, cosine, jaccard distance) or heuristic metrics (jaro,
jaro-winkler). An implementation of soundex is provided as well.

Y también por la variedad de funciones y posibilidades de análisis que
ofrece puedes mirar "qDap":

http://cran.r-project.org/web/packages/qdap/index.html
qdap: Bridging the Gap Between Qualitative Data and Quantitative Analysis

Automates many of the tasks associated with quantitative discourse analysis
of transcripts containing discourse including frequency counts of sentence
types, words, sentences, turns of talk, syllables and other assorted
analysis tasks. The package provides parsing tools for preparing transcript
data. Many functions enable the user to aggregate data by any number of
grouping variables, providing analysis and seamless integration with other
R packages that undertake higher level analysis and visualization of text.
This affords the user a more efficient and targeted analysis. 'qdap' is
designed for transcript analysis, however, many functions are applicable to
other areas of Text Mining/Natural Language Processing.
Saludos,
Carlos Ortega
www.qualityexcellence.es


El 7 de julio de 2015, 11:14, MªLuz Morales <mlzmrls en gmail.com> escribió:

> Buenos días,
>
> quisiera saber si existe algún paquete en R para procesamiento de texto,
> búsqueda de similitudes y ese tipo de cosas. He estado buscando pero no he
> encontrado nada al respecto.
>
> Gracias
> Un saludo
>
>         [[alternative HTML version deleted]]
>
> _______________________________________________
> R-help-es mailing list
> R-help-es en r-project.org
> https://stat.ethz.ch/mailman/listinfo/r-help-es
>



-- 
Saludos,
Carlos Ortega
www.qualityexcellence.es

	[[alternative HTML version deleted]]



Más información sobre la lista de distribución R-help-es