Skip to main content

Authorship attribution toolkit — stylometric analysis with a configurable pipeline

Project description

mowen

Core Python library for authorship attribution.

This package provides the pipeline engine, all built-in components (canonicizers, event drivers, event cullers, distance functions, analysis methods), evaluation utilities (cross-validation, metrics), and the tokenizer framework.

Install

pip install -e .            # core only (no optional deps)
pip install -e '.[nlp]'     # + spaCy POS/NER
pip install -e '.[all]'     # everything

Usage

from mowen import Pipeline, PipelineConfig, Document, leave_one_out

# Attribution
results = Pipeline(config).execute(known_docs, unknown_docs)

# Cross-validation
eval_result = leave_one_out(docs, config)
print(eval_result.accuracy, eval_result.macro_f1)

See the root README for full documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mowen-2.0.0.tar.gz (13.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mowen-2.0.0-py3-none-any.whl (14.0 MB view details)

Uploaded Python 3

File details

Details for the file mowen-2.0.0.tar.gz.

File metadata

  • Download URL: mowen-2.0.0.tar.gz
  • Upload date:
  • Size: 13.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mowen-2.0.0.tar.gz
Algorithm Hash digest
SHA256 586ecd5bdc8e82b39b2fabb582abe8fa448f853b6a4ad01bc6a05dea2b7857b0
MD5 5452477ae4e3b3d5c9b983f9e1ccba1d
BLAKE2b-256 63ab28c882bda5b2a204f34fd51d36da34b05f4c96123c38bd4bd1aae9c7fd35

See more details on using hashes here.

Provenance

The following attestation bundles were made for mowen-2.0.0.tar.gz:

Publisher: publish.yml on jnoecker/mowen

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mowen-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: mowen-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 14.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mowen-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ec059dcac471e3112b68c6f663620528e855a1079712dc8ed4f4658bfa0feb62
MD5 25372f111b5455d9398436cbe2d9e2d4
BLAKE2b-256 0e4381328bc82403746a2ce6b1ca0705827a103a06c6646be74cf7190c9512de

See more details on using hashes here.

Provenance

The following attestation bundles were made for mowen-2.0.0-py3-none-any.whl:

Publisher: publish.yml on jnoecker/mowen

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page