Skip to main content

Validation, sanitization and metrics for Markdown manuscripts.

Project description

manuscript-tools

A QA toolkit for German Markdown manuscripts. Validates style, sanitizes encoding, converts quotation marks, and measures readability.

Deutsche Version | Wiki

Installation

pip install manuscript-tools

Or as project dependency:

poetry add manuscript-tools

Commands

Command Description
ms-check Style checks (5 core rules, --strict adds 3 prose rules)
ms-sanitize Fix encoding, strip invisible chars, normalize Unicode
ms-quotes Convert quotation marks to German typographic style „ " ‚ '
ms-metrics Word counts, sentence analysis, Flesch-DE readability score
ms-validate Full QA pipeline (sanitize + quotes + check + readability)

Quick start

# Full QA pipeline
ms-validate manuscript/

# Style check only (core rules)
ms-check manuscript/

# Style check with prose analysis (filler words, passive voice, sentence length)
ms-check manuscript/ --strict

# Readability report
ms-metrics manuscript/

# Fix quotation marks (dry-run)
ms-quotes manuscript/ --dry-run

Rules

Core (always active):

no-dashes, no-invisible-chars, no-repeated-words, no-double-spaces, non-german-quotes

Prose (with --strict or ms-validate):

max-sentence-length, filler-words-de, passive-voice-de

Custom rules are simple callables with the signature (text: str, path: Path) -> list[StyleViolation]. See the Wiki for a step-by-step tutorial.

Readability

ms-metrics computes the Flesch-DE reading ease score (Amstad, 1978) with German-optimized syllable counting. Score interpretation:

Score Level Typical use
80-100 Very easy Children's books
60-80 Easy to medium Fiction, non-fiction
30-60 Difficult Journalism, academic
0-30 Very difficult Legal, scientific

Development

git clone https://github.com/astrapi69/manuscript-tools.git
cd manuscript-tools
make install-dev
make ci          # lint + format check + 89 tests

Documentation

Full documentation is available in the Wiki:

License

BSD 3-Clause. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

manuscript_tools-0.4.0.tar.gz (16.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

manuscript_tools-0.4.0-py3-none-any.whl (19.1 kB view details)

Uploaded Python 3

File details

Details for the file manuscript_tools-0.4.0.tar.gz.

File metadata

  • Download URL: manuscript_tools-0.4.0.tar.gz
  • Upload date:
  • Size: 16.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.11.13 Linux/6.8.0-101-lowlatency

File hashes

Hashes for manuscript_tools-0.4.0.tar.gz
Algorithm Hash digest
SHA256 d90661e55d1b75b136c0b8448532c1cee868c825adb6fe5dc2d947f6508fa329
MD5 3394d9e748ff4b02be2a253c0499a85f
BLAKE2b-256 0af45a5aa9f0df3448d8b78f976cafe1e53533e42d79b4a74fbbf5308d0004c4

See more details on using hashes here.

File details

Details for the file manuscript_tools-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: manuscript_tools-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 19.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.11.13 Linux/6.8.0-101-lowlatency

File hashes

Hashes for manuscript_tools-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4f072fc725d5ced1253d9d00a1ce1129981f1918ce673c852ea129c09007704c
MD5 343dc2a7a11fa60d7656cd61f19a0dee
BLAKE2b-256 d56bd8d2bc5e34f19c9e09d29862c5b669b233460191c013c212f54464ab27e8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page