Skip to main content

Authorship attribution toolkit — stylometric analysis with a configurable pipeline

Project description

mowen

Core Python library for authorship attribution.

This package provides the pipeline engine, all built-in components (canonicizers, event drivers, event cullers, distance functions, analysis methods), evaluation utilities (cross-validation, metrics), and the tokenizer framework.

Install

pip install -e .            # core only (no optional deps)
pip install -e '.[nlp]'     # + spaCy POS/NER
pip install -e '.[all]'     # everything

Usage

from mowen import Pipeline, PipelineConfig, Document, leave_one_out

# Attribution
results = Pipeline(config).execute(known_docs, unknown_docs)

# Cross-validation
eval_result = leave_one_out(docs, config)
print(eval_result.accuracy, eval_result.macro_f1)

See the root README for full documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mowen-0.1.0.tar.gz (13.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mowen-0.1.0-py3-none-any.whl (13.7 MB view details)

Uploaded Python 3

File details

Details for the file mowen-0.1.0.tar.gz.

File metadata

  • Download URL: mowen-0.1.0.tar.gz
  • Upload date:
  • Size: 13.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for mowen-0.1.0.tar.gz
Algorithm Hash digest
SHA256 bd3d4bf2795c04d22b2ee62987b5eb492034cd759e79a46fa2dd8a636948318f
MD5 b261ffa40db76a3d61a8f528a23c816c
BLAKE2b-256 2685640e1a9acfcb6355613b1321904fce9b7ba40fe058e3be0c93adf858c600

See more details on using hashes here.

File details

Details for the file mowen-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: mowen-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 13.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for mowen-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6122a4311df2a7b92e8914de9f780ed6bd62cc18409efb13b7a4f331893dcbb7
MD5 0cddc57a1cf3bda1d45f5ff0a265f7cf
BLAKE2b-256 9ff39d4370b4edb978f4b696e9ef3d1229c4e6a2e98aadbb489e739c170c5cbd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page