Skip to main content

No project description provided

Project description

twat-text

twat-text is the user-facing text utility package in the twat ecosystem.

It keeps deterministic text algorithms local and delegates LLM-powered operations to twat_llm through a narrow adapter. Generated images, video, audio, and speech belong in twat_genai and media domain packages, not here.

Install

pip install twat-text

Deterministic API

from twat_text import clean_text, chunk_text, extract_emails, convert_text, estimate_tokens

text = clean_text(" “Hello”\tworld ")
chunks = chunk_text(long_document, max_chars=2000, overlap=100)
emails = extract_emails("Ada <ada@example.com>")
plain = convert_text("<p>Hello &amp; welcome</p>", source="html", target="plain")
tokens = estimate_tokens(plain)

Available deterministic helpers include:

  • normalize_text and clean_text
  • chunk_text and context_window_chunks
  • extract_urls, extract_emails, and extract_numbers
  • markdown_to_plain, html_to_plain, plain_to_html, and convert_text
  • estimate_tokens

LLM-backed API

from twat_text import summarize, rewrite, extract_structured, classify

summary = summarize(long_text)
rewrite = rewrite(draft, instruction="Make this friendlier")
data = extract_structured(note, schema_hint="Return JSON with date and total")
label = classify(message, labels=["support", "sales", "spam"])

These functions import twat_llm only when called, so deterministic utilities stay dependency-light and testable.

CLI

twat-text clean "  Hello   world  "
twat-text chunk "long text..." --max-chars 500 --overlap 50
twat-text convert "<p>Hello</p>" --source html --target plain
twat-text summarize "long text..."
twat-text rewrite "draft text" --instruction "Make it concise"

Through the host plugin dispatcher, the same package is available as twat text ... once installed.

Development

hatch run test
hatch run lint
hatch run type-check

The compatibility Config and process_data API remains available, but process_data now returns cleaned text, chunks, simple extracted values, and an estimated token count instead of an empty placeholder.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

twat_text-2.7.10.tar.gz (18.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

twat_text-2.7.10-py3-none-any.whl (9.4 kB view details)

Uploaded Python 3

File details

Details for the file twat_text-2.7.10.tar.gz.

File metadata

  • Download URL: twat_text-2.7.10.tar.gz
  • Upload date:
  • Size: 18.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for twat_text-2.7.10.tar.gz
Algorithm Hash digest
SHA256 0d4a5bd7545a76c76827a12b3c0abc9536b11204a0d7d0ed1d2fbd9bf571a2c8
MD5 9209c82bc6e03a868bb4d1d0fbeba092
BLAKE2b-256 1c4b6252240e54ba4aeecaa0baf6a5fff6fb062c86e53fcf44566602cbbeb9ef

See more details on using hashes here.

File details

Details for the file twat_text-2.7.10-py3-none-any.whl.

File metadata

  • Download URL: twat_text-2.7.10-py3-none-any.whl
  • Upload date:
  • Size: 9.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for twat_text-2.7.10-py3-none-any.whl
Algorithm Hash digest
SHA256 75ca06c2057149be9a73b193a20c429711612fe6e755816fd37a54fad8e914f2
MD5 f8f4abbb22fa83f874fc1ef8ef40f98e
BLAKE2b-256 4b53a87d6bc0f5a980a30dd1dc20a232f85c04a8f1dd3ceebdc6502a1414d2f2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page