Package: text2map 0.3.0

text2map: R Tools for Text Matrices, Embeddings, and Networks

This is a collection of functions optimized for working with various kinds of text matrices. Focusing on the text matrix as the primary object - represented either as a base R dense matrix or a 'Matrix' package sparse matrix - allows for a consistent and intuitive interface that stays close to the underlying mathematical foundation of computational text analysis. In particular, the package includes functions for working with word embeddings, text networks, and document-term matrices. Methods developed in Stoltz and Taylor (2019) <doi:10.1007/s42001-019-00048-6>, Taylor and Stoltz (2020) <doi:10.1007/s42001-020-00075-8>, Taylor and Stoltz (2020) <doi:10.15195/v7.a23>, and Stoltz and Taylor (2021) <doi:10.1016/j.poetic.2021.101567>.

Authors:Dustin Stoltz [aut, cre], Marshall Taylor [aut]

text2map_0.3.0.tar.gz
text2map_0.3.0.zip(r-4.7)text2map_0.3.0.zip(r-4.6)text2map_0.3.0.zip(r-4.5)
text2map_0.3.0.tgz(r-4.6-any)text2map_0.3.0.tgz(r-4.5-any)
text2map_0.3.0.tar.gz(r-4.7-any)text2map_0.3.0.tar.gz(r-4.6-any)
text2map_0.3.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
text2map/json (API)

# Install 'text2map' in R:

install.packages('text2map', repos = c('https://dustinstoltz.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://gitlab.com/culturalcartography/text2map

Pkgdown/docs site:https://culturalcartography.gitlab.io

Datasets:

anchor_lists - A dataset of anchor lists
ft_wv_sample - Sample of fastText embeddings
jfk_speech - Full Text of JFK's Rice Speech
meta_shakespeare - Metadata for Shakespeare's First Folio
stoplists - A dataset of stoplists

On CRAN:

4.11 score 32 scripts 385 downloads 26 exports 26 dependencies

Last updated from:c62b1d1339. Checks:9 OK. Indexed: yes.

Target	Result	Time
linux-devel-x86_64	OK	224
source / vignettes	OK	210
linux-release-x86_64	OK	246
macos-release-arm64	OK	185
macos-oldrel-arm64	OK	275
windows-devel	OK	145
windows-release	OK	172
windows-oldrel	OK	137
wasm-release	OK	162

Exports:cmdist CMDist coca CoCA doc_centrality doc_similarity dtm_builder dtm_melter dtm_resampler dtm_stats dtm_stopper find_projection find_rejection find_transformation get_anchors get_centroid get_direction get_regions get_stoplist perm_tester rancor_builder rancors_builder seq_builder test_anchors tiny_gender_tagger vocab_builder

Dependencies:cli codetools doParallel dplyr fastmatch foreach generics glue iterators kit lattice lifecycle magrittr Matrix permute pillar pkgconfig R6 rlang rsvd stringi tibble tidyselect utf8 vctrs withr

Help page	Topics
A dataset of anchor lists	anchor_lists
Calculate Concept Mover's Distance	CMDist cmdist
Performs Concept Class Analysis (CoCA)	CoCA coca
Find a specified document centrality metric	doc_centrality
Find a similarities between documents	doc_similarity
A fast unigram DTM builder	dtm_builder
Melt a DTM into a triplet data frame	dtm_melter
Resamples an input DTM to generate new DTMs	dtm_resampler
Gets DTM summary statistics	dtm_stats
Removes terms from a DTM based on rules	dtm_stopper
Find the 'projection matrix' to a semantic vector	find_projection
Find the 'rejection matrix' from a semantic vector	find_rejection
Find a specified matrix transformation	find_transformation
Sample of fastText embeddings	ft_wv_sample
Gets anchor terms from precompiled anchor lists	get_anchors
Word embedding semantic centroid extractor	get_centroid
Word embedding semantic direction extractor	get_direction
Word embedding semantic region extractor	get_regions
Gets stoplist from precompiled lists	get_stoplist
Full Text of JFK's Rice Speech	jfk_speech
Metadata for Shakespeare's First Folio	meta_shakespeare
Monte Carlo Permutation Tests for Model P-Values	perm_tester
Plot CoCA	plot.CoCA
Prints CoCA class information	print.CoCA
Build a Random Corpus	rancor_builder
Build Multiple Random Corpora	rancors_builder
Represent Documents as Token-Integer Sequences	seq_builder
A dataset of stoplists	stoplists
Evaluate anchor sets in defining semantic relations	test_anchors
A very tiny "gender" tagger	tiny_gender_tagger
A fast unigram vocabulary builder	vocab_builder

Package: text2map 0.3.0

text2map: R Tools for Text Matrices, Embeddings, and Networks

Citation

Development and contributors

Readme and manuals

Help Manual

Usage by other packages (reverse dependencies)