Package: stylo 0.7.6

stylo: Stylometric Multivariate Analyses

Supervised and unsupervised multivariate methods, supplemented by GUI and some visualizations, to perform various analyses in the field of computational stylistics, authorship attribution, etc. For further reference, see Eder et al. (2016), <https://journal.r-project.org/archive/2016/RJ-2016-007/index.html>. You are also encouraged to visit the Computational Stylistics Group's website <https://computationalstylistics.github.io/>, where a reasonable amount of information about the package and related projects are provided.

Authors:Maciej Eder [aut, cre], Jan Rybicki [aut], Mike Kestemont [aut], Steffen Pielstroem [aut]

stylo_0.7.6.tar.gz
stylo_0.7.6.zip(r-4.5)stylo_0.7.6.zip(r-4.4)stylo_0.7.6.zip(r-4.3)
stylo_0.7.6.tgz(r-4.4-any)stylo_0.7.6.tgz(r-4.3-any)
stylo_0.7.6.tar.gz(r-4.5-noble)stylo_0.7.6.tar.gz(r-4.4-noble)
stylo_0.7.6.tgz(r-4.4-emscripten)stylo_0.7.6.tgz(r-4.3-emscripten)
stylo.pdf |stylo.html
stylo/json (API)
NEWS

# Install 'stylo' in R:
install.packages('stylo', repos = c('https://computationalstylistics.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/computationalstylistics/stylo/issues

Datasets:
  • galbraith - Table of word frequencies
  • lee - Table of word frequencies
  • novels - A selection of 19th-century English novels

On CRAN:

51 exports 170 stars 5.25 score 15 dependencies 2 mentions 422 scripts 632 downloads

Last updated 23 days agofrom:0ed4dd282f. Checks:OK: 7. Indexed: yes.

TargetResultDate
Doc / VignettesOKAug 26 2024
R-4.5-winOKAug 26 2024
R-4.5-linuxOKAug 26 2024
R-4.4-winOKAug 26 2024
R-4.4-macOKAug 26 2024
R-4.3-winOKAug 26 2024
R-4.3-macOKAug 26 2024

Exports:assign.plot.colorschange.encodingcheck.encodingclassifycrossvdefine.plot.areadelete.markupdelete.stop.wordsdist.argamondist.cosinedist.deltadist.ederdist.entropydist.minmaxdist.simpledist.wurzburggui.classifygui.opposegui.styloimpostersimposters.optimizeload.corpusload.corpus.and.parsemake.frequency.listmake.ngramsmake.samplesmake.table.of.frequenciesopposeparse.corpusparse.pos.tagsperform.cullingperform.deltaperform.impostorsperform.knnperform.naivebayesperform.nscperform.svmperformance.measuresrolling.classifyrolling.deltasamplesize.penalizestylostylo.default.settingsstylo.networkstylo.pronounstxt.to.featurestxt.to.wordstxt.to.words.extzeta.chisquarezeta.craigzeta.eder

Dependencies:apeclassclusterdigeste1071latticeMASSMatrixnlmepamrproxyRcppsurvivaltcltk2tsne

Readme and manuals

Help Manual

Help pageTopics
Assign colors to samplesassign.plot.colors
Change character encodingchange.encoding
Check character encoding in corpus foldercheck.encoding
Machine-learning supervised classificationclassify
Function to Perform Cross-Validationcrossv
Define area for scatterplotsdefine.plot.area
Delete HTML or XML tagsdelete.markup
Exclude stop words (e.g. pronouns, particles, etc.) from a datasetdelete.stop.words
Cosine Distancedist.cosine
Delta Distancedist.argamon dist.delta dist.eder
Entropy Distancedist.entropy
Min-Max Distance (aka Ruzicka Distance)dist.minmax
Cosine Distancedist.simple
Cosine Delta Distance (aka Wurzburg Distance)dist.wurzburg
Table of word frequencies (Galbraith, Rowling, Coben, Tolkien, Lewis)galbraith
GUI for the function classifygui.classify
GUI for the function opposegui.oppose
GUI for stylogui.stylo
Authorship Verification Classifier Known as the Imposters Methodimposters
Tuning Parameters for the Imposters Methodimposters.optimize
Table of word frequencies (Lee, Capote, Faulkner, Styron, etc.)lee
Load text filesload.corpus
Load text files and perform pre-processingload.corpus.and.parse
Make List of the Most Frequent Elements (e.g. Words)make.frequency.list
Make text n-gramsmake.ngrams
Split text to samplesmake.samples
Prepare a table of (relative) word frequenciesmake.table.of.frequencies
A selection of 19th-century English novelsnovels
Contrastive analysis of textsoppose
Perform pre-processing (tokenization, n-gram extracting, etc.)parse.corpus
Extract POS-tags or Words from Annotated Corporaparse.pos.tags
Exclude variables (e.g. words, n-grams) from a frequency table that are too characteristic for some samplesperform.culling
Distance-based classifierperform.delta
An Authorship Verification Classifier Known as the Impostors Method. ATTENTION: this function is obsolete; refer to a new implementation, aka the imposters() function!perform.impostors
k-Nearest Neighbor classifierperform.knn
Naive Bayes classifierperform.naivebayes
Nearest Shrunken Centroids classifierperform.nsc
Support Vector Machines classifierperform.svm
Accuracy, Precision, Recall, and the F Measureperformance.measures
Plot Classification Accuracy for Short Text Samplesplot.sample.size
Sequential machine-learning classificationrolling.classify
Sequential stylometric analysisrolling.delta
Determining Minimal Sample Size for Text Classificationsamplesize.penalize
Stylometric multidimensional analysesstylo stylo.package
Setting variables for the package stylostylo.default.settings
Bootstrap consensus networks, with D3 visualizationstylo.network
List of pronounsstylo.pronouns
Split string of words or other countable featurestxt.to.features
Split text into wordstxt.to.words
Split text into words: extended versiontxt.to.words.ext
Compare two subcorpora using a home-brew variant of Craig's Zetazeta.chisquare
Compare two subcorpora using Craig's Zetazeta.craig
Compare two subcorpora using Eder's Zetazeta.eder