NEWS

text2map 0.2.3 (2026-02-11)

Added test_anchor Added more unit tests Fixed a bug in doc_centrality using the centroid method

Fixes for changes to the Matrix package Updating documentation and added examples

Fix encoding issue for non-ASCII characters to work with fastmatch Add functionality

Include additional tests, updated documentation and vignettes

Working on an encoding error in fastmatch which shows inconsistent behavior with non-ASCII characters. This dev version provides a temporary fix.

Add functionality
- doc_centrality calculates four graph-based centrality metrics using DTMs
- doc_similarity calculates four document similarity measures using DTMs

Replaced dependency
- using ClusterR for get_regions, instead of mlpack
- Uses the Armadillo library k-means algorithm only (no longer provides an option)
Added functionality:
- seq_builder creates a token-integer sequence representation
Added Shakespeare metadata for examples
Import Matrix package methods

Added functionality
- dtm_builder includes an option to return a dense base R matrix
- dtm_stopper includes an option to remove based on a terms rank (e.g., top 10), stopping based on count and proportion are now two separate options

Add functions:
- find_transformation() to norm, center, and align matrices
- find_projection() finds the projection matrix onto a vector
- find_rejection() finds the rejection matrix away from a vector
- dtm_melter() quickly turns a DTM into a triplet dataframe (doc_id, term, count)
Fixed get_centroid() naming (limits to single word for names)

Added functionality to dtm_stopper() to stop words by document or term frequencies
- Nomenclature was changed, stop_freq was changed to stop_termfreq
Added functionality to dtm_resampler() to resample proportion and fixed N lengths
Added and clarified documentation
Added a NEWS.md file to track changes to the package.