Skip to main content

Authorship attribution toolkit — stylometric analysis with a configurable pipeline

Project description

mowen

Core Python library for authorship attribution.

This package provides the pipeline engine, all built-in components (canonicizers, event drivers, event cullers, distance functions, analysis methods), evaluation utilities (cross-validation, metrics), and the tokenizer framework.

Install

pip install -e .            # core only (no optional deps)
pip install -e '.[nlp]'     # + spaCy POS/NER
pip install -e '.[all]'     # everything

Usage

from mowen import Pipeline, PipelineConfig, Document, leave_one_out

# Attribution
results = Pipeline(config).execute(known_docs, unknown_docs)

# Cross-validation
eval_result = leave_one_out(docs, config)
print(eval_result.accuracy, eval_result.macro_f1)

See the root README for full documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mowen-1.0.0.tar.gz (12.9 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mowen-1.0.0-py3-none-any.whl (13.5 MB view details)

Uploaded Python 3

File details

Details for the file mowen-1.0.0.tar.gz.

File metadata

  • Download URL: mowen-1.0.0.tar.gz
  • Upload date:
  • Size: 12.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mowen-1.0.0.tar.gz
Algorithm Hash digest
SHA256 7a2fbd56f44aba1ddaba2e878d47e909ef2118ce39e3b1903bfe9d5f2ba58136
MD5 1fab727f2bfba3ae527380248a5661a4
BLAKE2b-256 685b70c7e9fe44e27cd7e22dc358abe6edf92df7fb9f1855469f31a615021a6d

See more details on using hashes here.

Provenance

The following attestation bundles were made for mowen-1.0.0.tar.gz:

Publisher: publish.yml on jnoecker/mowen

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mowen-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: mowen-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 13.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mowen-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c6b7b900836d31118b27d50a4ea4d1af19e1213b40248ad7414fe90ff7db61dc
MD5 c078c78df19b0a87b0f23bc291c54cbf
BLAKE2b-256 2a334ad160008a73bfca4715060e46e8b1926e409e016dcf64bab5b586236388

See more details on using hashes here.

Provenance

The following attestation bundles were made for mowen-1.0.0-py3-none-any.whl:

Publisher: publish.yml on jnoecker/mowen

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page