Package: stylo 0.7.6

stylo: Stylometric Multivariate Analyses

Supervised and unsupervised multivariate methods, supplemented by GUI and some visualizations, to perform various analyses in the field of computational stylistics, authorship attribution, etc. For further reference, see Eder et al. (2016), <https://journal.r-project.org/archive/2016/RJ-2016-007/index.html>. You are also encouraged to visit the Computational Stylistics Group's website <https://computationalstylistics.github.io/>, where a reasonable amount of information about the package and related projects are provided.

Authors:Maciej Eder [aut, cre], Jan Rybicki [aut], Mike Kestemont [aut], Steffen Pielstroem [aut]

stylo_0.7.6.tar.gz
stylo_0.7.6.zip(r-4.5)stylo_0.7.6.zip(r-4.4)stylo_0.7.6.zip(r-4.3)
stylo_0.7.6.tgz(r-4.5-any)stylo_0.7.6.tgz(r-4.4-any)stylo_0.7.6.tgz(r-4.3-any)
stylo_0.7.6.tar.gz(r-4.5-noble)stylo_0.7.6.tar.gz(r-4.4-noble)
stylo_0.7.6.tgz(r-4.4-emscripten)stylo_0.7.6.tgz(r-4.3-emscripten)
stylo.pdf |stylo.html✨
stylo/json (API)
NEWS

# Install 'stylo' in R:

install.packages('stylo', repos = c('https://computationalstylistics.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/computationalstylistics/stylo/issues

Datasets:

galbraith - Table of word frequencies
lee - Table of word frequencies
novels - A selection of 19th-century English novels

On CRAN:

8.58 score 187 stars 462 scripts 1.1k downloads 2 mentions 51 exports 15 dependencies

Last updated 2 months agofrom:98998cd6b1. Checks:9 OK. Indexed: yes.

Target	Result	Latest binary
Doc / Vignettes	OK	Mar 16 2025
R-4.5-win	OK	Mar 16 2025
R-4.5-mac	OK	Mar 16 2025
R-4.5-linux	OK	Mar 16 2025
R-4.4-win	OK	Mar 16 2025
R-4.4-mac	OK	Mar 16 2025
R-4.4-linux	OK	Mar 16 2025
R-4.3-win	OK	Mar 16 2025
R-4.3-mac	OK	Mar 16 2025

Exports:assign.plot.colors change.encoding check.encoding classify crossv define.plot.area delete.markup delete.stop.words dist.argamon dist.cosine dist.delta dist.eder dist.entropy dist.minmax dist.simple dist.wurzburg gui.classify gui.oppose gui.stylo imposters imposters.optimize load.corpus load.corpus.and.parse make.frequency.list make.ngrams make.samples make.table.of.frequencies oppose parse.corpus parse.pos.tags perform.culling perform.delta perform.impostors perform.knn perform.naivebayes perform.nsc perform.svm performance.measures rolling.classify rolling.delta samplesize.penalize stylo stylo.default.settings stylo.network stylo.pronouns txt.to.features txt.to.words txt.to.words.ext zeta.chisquare zeta.craig zeta.eder

Dependencies:ape class cluster digest e1071 lattice MASS Matrix nlme pamr proxy Rcpp survival tcltk2 tsne

Citation

Development and contributors

Readme and manuals

Help Manual

Help page	Topics
Assign colors to samples	assign.plot.colors
Change character encoding	change.encoding
Check character encoding in corpus folder	check.encoding
Machine-learning supervised classification	classify
Function to Perform Cross-Validation	crossv
Define area for scatterplots	define.plot.area
Delete HTML or XML tags	delete.markup
Exclude stop words (e.g. pronouns, particles, etc.) from a dataset	delete.stop.words
Cosine Distance	dist.cosine
Delta Distance	dist.argamon dist.delta dist.eder
Entropy Distance	dist.entropy
Min-Max Distance (aka Ruzicka Distance)	dist.minmax
Cosine Distance	dist.simple
Cosine Delta Distance (aka Wurzburg Distance)	dist.wurzburg
Table of word frequencies (Galbraith, Rowling, Coben, Tolkien, Lewis)	galbraith
GUI for the function classify	gui.classify
GUI for the function oppose	gui.oppose
GUI for stylo	gui.stylo
Authorship Verification Classifier Known as the Imposters Method	imposters
Tuning Parameters for the Imposters Method	imposters.optimize
Table of word frequencies (Lee, Capote, Faulkner, Styron, etc.)	lee
Load text files	load.corpus
Load text files and perform pre-processing	load.corpus.and.parse
Make List of the Most Frequent Elements (e.g. Words)	make.frequency.list
Make text n-grams	make.ngrams
Split text to samples	make.samples
Prepare a table of (relative) word frequencies	make.table.of.frequencies
A selection of 19th-century English novels	novels
Contrastive analysis of texts	oppose
Perform pre-processing (tokenization, n-gram extracting, etc.)	parse.corpus
Extract POS-tags or Words from Annotated Corpora	parse.pos.tags
Exclude variables (e.g. words, n-grams) from a frequency table that are too characteristic for some samples	perform.culling
Distance-based classifier	perform.delta
An Authorship Verification Classifier Known as the Impostors Method. ATTENTION: this function is obsolete; refer to a new implementation, aka the imposters() function!	perform.impostors
k-Nearest Neighbor classifier	perform.knn
Naive Bayes classifier	perform.naivebayes
Nearest Shrunken Centroids classifier	perform.nsc
Support Vector Machines classifier	perform.svm
Accuracy, Precision, Recall, and the F Measure	performance.measures
Plot Classification Accuracy for Short Text Samples	plot.sample.size
Sequential machine-learning classification	rolling.classify
Sequential stylometric analysis	rolling.delta
Determining Minimal Sample Size for Text Classification	samplesize.penalize
Stylometric multidimensional analyses	stylo stylo.package
Setting variables for the package stylo	stylo.default.settings
Bootstrap consensus networks, with D3 visualization	stylo.network
List of pronouns	stylo.pronouns
Split string of words or other countable features	txt.to.features
Split text into words	txt.to.words
Split text into words: extended version	txt.to.words.ext
Compare two subcorpora using a home-brew variant of Craig's Zeta	zeta.chisquare
Compare two subcorpora using Craig's Zeta	zeta.craig
Compare two subcorpora using Eder's Zeta	zeta.eder