Package: text2map 0.2.0

text2map: R Tools for Text Matrices, Embeddings, and Networks

This is a collection of functions optimized for working with with various kinds of text matrices. Focusing on the text matrix as the primary object - represented either as a base R dense matrix or a 'Matrix' package sparse matrix - allows for a consistent and intuitive interface that stays close to the underlying mathematical foundation of computational text analysis. In particular, the package includes functions for working with word embeddings, text networks, and document-term matrices. Methods developed in Stoltz and Taylor (2019) <doi:10.1007/s42001-019-00048-6>, Taylor and Stoltz (2020) <doi:10.1007/s42001-020-00075-8>, Taylor and Stoltz (2020) <doi:10.15195/v7.a23>, and Stoltz and Taylor (2021) <doi:10.1016/j.poetic.2021.101567>.

Authors:Dustin Stoltz [aut, cre], Marshall Taylor [aut]

text2map_0.2.0.tar.gz
text2map_0.2.0.zip(r-4.5)text2map_0.2.0.zip(r-4.4)text2map_0.2.0.zip(r-4.3)
text2map_0.2.0.tgz(r-4.4-any)text2map_0.2.0.tgz(r-4.3-any)
text2map_0.2.0.tar.gz(r-4.5-noble)text2map_0.2.0.tar.gz(r-4.4-noble)
text2map_0.2.0.tgz(r-4.4-emscripten)text2map_0.2.0.tgz(r-4.3-emscripten)
text2map.pdf |text2map.html
text2map/json (API)
NEWS

# Install 'text2map' in R:
install.packages('text2map', repos = c('https://dustinstoltz.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://gitlab.com/culturalcartography/text2map

Datasets:

On CRAN:

27 exports 1.41 score 107 dependencies 21 scripts 324 downloads

Last updated 3 months agofrom:1d7048f8ab. Checks:OK: 7. Indexed: yes.

TargetResultDate
Doc / VignettesOKSep 15 2024
R-4.5-winOKSep 15 2024
R-4.5-linuxOKSep 15 2024
R-4.4-winOKSep 15 2024
R-4.4-macOKSep 15 2024
R-4.3-winOKSep 15 2024
R-4.3-macOKSep 15 2024

Exports:cmdistCMDistcocaCoCAdoc_centralitydoc_similaritydtm_builderdtm_melterdtm_resamplerdtm_statsdtm_stopperfind_projectionfind_rejectionfind_transformationget_anchorsget_centroidget_directionget_regionsget_stoplistMatrixperm_testerrancor_builderrancors_builderseq_buildertest_anchorstiny_gender_taggervocab_builder

Dependencies:abindbackportsbase64encbslibcachemcheckmatecliclusterClusterRcodetoolscolorspacecorpcorcpp11data.tabledigestdoParalleldplyrevaluatefansifarverfastmapfastmatchfdrtoolfloatfontawesomeforeachforeignFormulafsgenericsggplot2glassogluegmpGPArotationgridExtragtablegtoolshighrHmischtmlTablehtmltoolshtmlwidgetsigraphisobanditeratorsjpegjquerylibjsonlitekitknitrlabelinglatticelavaanlgrlifecyclemagrittrMASSMatrixMatrixExtramemoisemgcvmimemlapimnormtmunsellnlmennetnumDerivpbapplypbivnormpermutepillarpkgconfigplyrpngpsychqgraphquadprogR6rappdirsRColorBrewerRcppRcppArmadilloreshape2RhpcBLASctlrlangrmarkdownrpartrsparserstudioapirsvdsassscalesstringistringrtext2vectibbletidyselecttinytexutf8vctrsviridisviridisLitewithrxfunyaml

Readme and manuals

Help Manual

Help pageTopics
A dataset of anchor listsanchor_lists
Calculate Concept Mover's DistanceCMDist cmdist
Performs Concept Class Analysis (CoCA)CoCA coca
Find a specified document centrality metricdoc_centrality
Find a similarities between documentsdoc_similarity
A fast unigram DTM builderdtm_builder
Melt a DTM into a triplet data framedtm_melter
Resamples an input DTM to generate new DTMsdtm_resampler
Gets DTM summary statisticsdtm_stats
Removes terms from a DTM based on rulesdtm_stopper
Find the 'projection matrix' to a semantic vectorfind_projection
Find the 'rejection matrix' from a semantic vectorfind_rejection
Find a specified matrix transformationfind_transformation
Sample of fastText embeddingsft_wv_sample
Gets anchor terms from precompiled anchor listsget_anchors
Word embedding semantic centroid extractorget_centroid
Word embedding semantic direction extractorget_direction
Word embedding semantic region extractorget_regions
Gets stoplist from precompiled listsget_stoplist
Full Text of JFK's Rice Speechjfk_speech
Metadata for Shakespeare's First Foliometa_shakespeare
Monte Carlo Permutation Tests for Model P-Valuesperm_tester
Plot CoCAplot.CoCA
Prints CoCA class informationprint.CoCA
Build a Random Corpusrancor_builder
Build Multiple Random Corporarancors_builder
Represent Documents as Token-Integer Sequencesseq_builder
A dataset of stoplistsstoplists
Evaluate anchor sets in defining semantic directionstest_anchors
A very tiny "gender" taggertiny_gender_tagger
A fast unigram vocabulary buildervocab_builder