Comprehensive Python package for stylometric analysis
Project description
pystylometry
Stylometric analysis and authorship attribution for Python. 50+ metrics across 11 modules, from vocabulary diversity to AI-generation detection.
Install
pip install pystylometry # Core (lexical metrics)
pip install pystylometry[all] # Everything
Modules
| Module | Metrics | Description |
|---|---|---|
| lexical | TTR, MTLD, Yule's K/I, Hapax, MATTR, VocD-D, HD-D, MSTTR, function words, word frequency | Vocabulary diversity and richness |
| readability | Flesch, Flesch-Kincaid, SMOG, Gunning Fog, Coleman-Liau, ARI, Dale-Chall, Fry, FORCAST, Linsear Write, Powers-Sumner-Kearl | Grade-level and difficulty scoring |
| syntactic | POS ratios, sentence types, parse tree depth, clausal density, passive voice, T-units, dependency distance | Sentence and parse structure (requires spaCy) |
| authorship | Burrows' Delta, Cosine Delta, Zeta, Kilgarriff chi-squared, MinMax, John's Delta, NCD | Author attribution and text comparison |
| stylistic | Contractions, hedges, intensifiers, modals, punctuation, vocabulary overlap (Jaccard/Dice/Cosine/KL), cohesion, genre/register | Style markers and text similarity |
| character | Letter frequencies, digit/uppercase ratios, special characters, whitespace | Character-level fingerprinting |
| ngrams | Word/character/POS n-grams, Shannon entropy, skipgrams | N-gram profiles and entropy |
| dialect | British/American classification, spelling/grammar/vocabulary markers, markedness | Regional dialect detection |
| consistency | Sliding-window chi-squared drift, pattern classification | Intra-document style analysis |
| prosody | Syllable stress, rhythm regularity | Prose rhythm (requires spaCy) |
| viz | Timeline, scatter, report (PNG + interactive HTML) | Drift detection visualization |
Development
git clone https://github.com/craigtrim/pystylometry && cd pystylometry
pip install -e ".[dev,all]"
make test # 1022 tests
make lint # ruff + mypy
make all # lint + test + build
License
MIT
Author
Craig Trim -- craigtrim@gmail.com
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pystylometry-1.4.3.tar.gz
(243.9 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
pystylometry-1.4.3-py3-none-any.whl
(285.7 kB
view details)
File details
Details for the file pystylometry-1.4.3.tar.gz.
File metadata
- Download URL: pystylometry-1.4.3.tar.gz
- Upload date:
- Size: 243.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.11.9 Darwin/24.6.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e8d4694a2f549e0996b0476a0a6525b6c44ebd249d9cba760aad1aad6fd73641
|
|
| MD5 |
c7c1bd996d44b7c8fc27665342ddc072
|
|
| BLAKE2b-256 |
fe51b30b0217284491261e3309eb01c102ac066911c709f6579f9a0d761ff42b
|
File details
Details for the file pystylometry-1.4.3-py3-none-any.whl.
File metadata
- Download URL: pystylometry-1.4.3-py3-none-any.whl
- Upload date:
- Size: 285.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.11.9 Darwin/24.6.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
600ce1ced5edad206ee1b48e7111034b2a83982f54544136fc81d1b936e12991
|
|
| MD5 |
64a386a3f7ded4d71340a09495b27a7e
|
|
| BLAKE2b-256 |
490fc98be8a269f1accd375abc2ef58c1d8e5216d7110d3a4b28658634bb0abf
|