quanteda: Quantitative Analysis of Textual Data

A fast, flexible, and comprehensive framework for quantitative text analysis in R. Provides functionality for corpus management, creating and manipulating tokens and n-grams, exploring keywords in context, forming and manipulating sparse matrices of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and distances, applying content dictionaries, applying supervised and unsupervised machine learning, visually representing text and text analyses, and more.

Version: 4.1.0
Depends: R (≥ 3.5.0), methods
Imports: fastmatch, jsonlite, lifecycle, magrittr, Matrix (≥ 1.5-0), Rcpp (≥ 0.12.12), SnowballC, stopwords, stringi, xml2, yaml
LinkingTo: Rcpp
Suggests: rmarkdown, spelling, testthat, formatR, tm (≥ 0.6), knitr, lsa, rlang, slam
Enhances: dplyr, lda, purrr, spacyr, stm, text2vec, tibble, tidytext, tokenizers, topicmodels
Published: 2024-09-04
DOI: 10.32614/CRAN.package.quanteda
Author: Kenneth Benoit ORCID iD [cre, aut, cph], Kohei Watanabe ORCID iD [aut], Haiyan Wang ORCID iD [aut], Paul Nulty ORCID iD [aut], Adam Obeng ORCID iD [aut], Stefan Müller ORCID iD [aut], Akitaka Matsuo ORCID iD [aut], William Lowe ORCID iD [aut], Christian Müller [ctb], Olivier Delmarcelle ORCID iD [ctb], European Research Council [fnd] (ERC-2011-StG 283794-QUANTESS)
Maintainer: Kenneth Benoit <kbenoit at lse.ac.uk>
BugReports: https://github.com/quanteda/quanteda/issues
License: GPL-3
URL: https://quanteda.io
NeedsCompilation: yes
Language: en-GB
Citation: quanteda citation info
Materials: README NEWS
In views: NaturalLanguageProcessing
CRAN checks: quanteda results

Documentation:

Reference manual: quanteda.pdf
Vignettes: Quick Start Guide (source, R code)

Downloads:

Package source: quanteda_4.1.0.tar.gz
Windows binaries: r-devel: quanteda_4.1.0.zip, r-release: quanteda_4.1.0.zip, r-oldrel: quanteda_4.1.0.zip
macOS binaries: r-release (arm64): quanteda_4.1.0.tgz, r-oldrel (arm64): quanteda_4.1.0.tgz, r-release (x86_64): quanteda_4.1.0.tgz, r-oldrel (x86_64): quanteda_4.1.0.tgz
Old sources: quanteda archive

Reverse dependencies:

Reverse depends: idiolect, seededlda
Reverse imports: AutoPlots, conText, corpustools, DICEM, doc2concrete, eHDPrep, grafzahl, highlightr, keyATM, LDABiplots, LDAShiny, LexisNexisTools, LSX, newsmap, oolong, orderanalyzer, poldis, politeness, pseudobibeR, quanteda.textmodels, quanteda.textplots, quanteda.textstats, rainette, RNewsflow, sentometrics, sentopics, stm, stylest2, sweater, textstem, tosca, Twitmo, ulex, wordmap, wordvector
Reverse linking to: quanteda.textmodels, quanteda.textstats, seededlda, wordvector
Reverse suggests: explor, mlr3pipelines, readtext, spacyr, stminsights, stopwords, text2map, tidylda, tidytext

Linking:

Please use the canonical form https://CRAN.R-project.org/package=quanteda to link to this page.