Skip to main content

Authorship attribution toolkit — stylometric analysis with a configurable pipeline

Project description

mowen

Core Python library for authorship attribution.

This package provides the pipeline engine, all built-in components (canonicizers, event drivers, event cullers, distance functions, analysis methods), evaluation utilities (cross-validation, metrics), and the tokenizer framework.

Install

pip install -e .            # core only (no optional deps)
pip install -e '.[nlp]'     # + spaCy POS/NER
pip install -e '.[all]'     # everything

Usage

from mowen import Pipeline, PipelineConfig, Document, leave_one_out

# Attribution
results = Pipeline(config).execute(known_docs, unknown_docs)

# Cross-validation
eval_result = leave_one_out(docs, config)
print(eval_result.accuracy, eval_result.macro_f1)

See the root README for full documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mowen-2.1.0.tar.gz (13.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mowen-2.1.0-py3-none-any.whl (14.0 MB view details)

Uploaded Python 3

File details

Details for the file mowen-2.1.0.tar.gz.

File metadata

  • Download URL: mowen-2.1.0.tar.gz
  • Upload date:
  • Size: 13.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mowen-2.1.0.tar.gz
Algorithm Hash digest
SHA256 00433c6583c0743a83cd5e98195f63fbe2a4ec1aeed7f720dd03d34a816adb19
MD5 1f924fd14ca159c80244365db737c683
BLAKE2b-256 fc2f875934748e5aa2ef180c39ede3d8bd2317cdc0fe4601b3a3d9c2d2a93d51

See more details on using hashes here.

Provenance

The following attestation bundles were made for mowen-2.1.0.tar.gz:

Publisher: publish.yml on jnoecker/mowen

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mowen-2.1.0-py3-none-any.whl.

File metadata

  • Download URL: mowen-2.1.0-py3-none-any.whl
  • Upload date:
  • Size: 14.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mowen-2.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7bca9e79e911d924b92ad041d78f77e24eae129c1a8182271a58bee971008de9
MD5 66c3b006d597123eb09ce3f01f2e1aff
BLAKE2b-256 840fd581dba9d382f4f7825f6c5ed61c8413496c252e7f44961ce4c12cc2f360

See more details on using hashes here.

Provenance

The following attestation bundles were made for mowen-2.1.0-py3-none-any.whl:

Publisher: publish.yml on jnoecker/mowen

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page