Codex/Claude-backed text transformation and Kokoro TTS command-line tools.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

davronsherbaev

These details have not been verified by PyPI

Project description

agent-tools

Python CLI tools for:

transforming raw text into TTS-ready narration and synthesizing it in one command
transforming piped text through either the private Codex backend used by local Codex or an experimental Claude Code CLI wrapper
synthesizing the result to WAV with Kokoro-82M
auto-TTS of completed Codex and Claude Code replies through installed desktop hooks

This repo is intentionally wired to local Codex and Claude Code installs.

Release policy:

semantic-release owns version bumps, changelog updates, and py-v* tags
do not manually edit project.version in pyproject.toml during normal work
do not create release tags by hand unless the release workflow explicitly calls for it

Status

This is an experimental public package with a private Codex dependency and an experimental Claude Code CLI transform path.

The transform command mirrors the current request shape used by the local Codex source tree and depends on ChatGPT-backed auth in ~/.codex/auth.json.

The Claude Code path uses the official claude CLI in headless -p mode with a constrained single-turn wrapper. It does not rely on any private Claude Code API.

It does not use:

codex exec
codex app-server
the public OpenAI API key flow

That means:

you must already be logged into local Codex
backend compatibility can break if Codex internals or backend contracts change
this package is best suited for users who already use local Codex

Requirements

Python 3.11+
local Codex already logged in via ChatGPT
local Claude Code CLI installed if you want Claude-backed transforms or Claude auto-TTS integration
espeak-ng installed for best Kokoro English fallback behavior

Install

cd repos/agent-tools
uv venv
uv pip install -e ".[dev]"

Public package install:

pip install ai-nd-co-agent-tools

UI-enabled install:

pip install "ai-nd-co-agent-tools[ui]"

Install a CUDA-enabled PyTorch stack for this CLI environment:

agent-tools install-cuda

Pass an explicit track if you do not want auto-detection:

agent-tools install-cuda --cuda-track cu130

Usage

Single-command path: `ttsify`

echo "Turn this note into natural spoken narration." | agent-tools ttsify --output-file out.wav

ttsify uses a built-in rewrite prompt stored in the package and then pipes the transformed text into Kokoro TTS.

Default ttsify settings:

model: gpt-5.4-mini
reasoning effort: medium
voice: af_heart

Configurable via env vars:

AGENT_TOOLS_CODEX_MODEL=gpt-5.4-mini
AGENT_TOOLS_CODEX_REASONING_EFFORT=medium
AGENT_TOOLS_KOKORO_VOICE=af_heart
AGENT_TOOLS_KOKORO_LANGUAGE=a
AGENT_TOOLS_KOKORO_SPEED=1.0
AGENT_TOOLS_KOKORO_DEVICE=auto
AGENT_TOOLS_TRANSFORM_PROVIDER=codex
AGENT_TOOLS_CLAUDE_CODE_MODEL=haiku
AGENT_TOOLS_CLAUDE_CODE_EFFORT=low
AGENT_TOOLS_CLAUDE_CODE_BARE=false

Claude Code transform models are intentionally limited to:

haiku
sonnet

opus is rejected by the CLI and runtime wrapper.

CLI flags override env vars.

Queue for playback on Windows:

echo "Turn this note into natural spoken narration." | agent-tools ttsify --output-mode play --source agent-a

Desktop integrations

Install both supported desktop integrations:

agent-tools install-integrations

Install only one provider if needed:

agent-tools install-codex-integration
agent-tools install-claude-integration

On native Windows Codex, this installs a notify command in ~/.codex/config.toml.
On non-Windows, this keeps the Stop-hook integration path.
Claude Code integration installs an AgentTools Stop hook into ~/.claude/settings.json and writes the hook script to ~/.claude/agent-tools/stop_tts.sh.
The compatibility alias agent-tools install-codex-stop-hook remains available.

Windows debug logs:

~/.codex/notify_tts.log
~/.codex/notify_tts_agent_tools.log

On Windows, Codex passes the notify payload as the final JSON argv argument to the installed Python command. No PowerShell or bash wrapper is used.

This enqueues the generated audio, starts the background controller if needed, and returns immediately.

Claude Code hook logs:

~/.claude/agent-tools/stop_tts.log
~/.claude/agent-tools/stop_tts_agent_tools.log

Shared performance log:

Windows: %LOCALAPPDATA%/AgentTools/state/performance.jsonl
Linux/macOS: ~/.local/share/agent-tools/state/performance.jsonl

Each JSONL row records one completed command or dispatch with stage timings such as transform_ms, tts_generation_ms, tts_postprocess_ms, enqueue_ms, and total_ms.

You can also manage AgentTools auto-TTS integration from the desktop controller UI and tray menu. AgentTools only needs one backend to be available:

Codex, with local ChatGPT login working
or Claude Code installed on PATH

If neither backend is available, the UI shows an info-only message telling you to install or sign in to any one of them first.

Transform text

echo "Rewrite this into short spoken narration." | agent-tools transform \
  --system-prompt-file prompt_examples/rewrite_for_tts.md

Optional controls:

echo "Input text" | agent-tools transform \
  --system-prompt-file prompt_examples/rewrite_for_tts.md \
  --provider codex \
  --model gpt-5 \
  --reasoning-effort medium \
  --fast

Experimental Claude Code-backed transform:

echo "Input text" | agent-tools transform \
  --system-prompt-file prompt_examples/rewrite_for_tts.md \
  --provider claude-code \
  --claude-model haiku \
  --claude-effort low

What the Claude wrapper does:

runs claude -p in a temporary minimal working directory
forces --max-turns 1, --tools "", --no-session-persistence, and --output-format json
uses --system-prompt with the same rewrite prompt file you pass to transform
does not require or use any unofficial Claude Code API

Important limitation:

--claude-bare is supported, but it is off by default because local Claude help shows bare mode only reads ANTHROPIC_API_KEY or apiKeyHelper auth. If you rely on normal Claude login state, keep --claude-bare off.

Text to speech

echo "Hello world." | agent-tools tts --output-file hello.wav

Queue already-prepared speech on Windows:

echo "Hello world." | agent-tools tts --output-mode play --source agent-a

Desktop controller UI

agent-tools ui

If the controller is already running, this focuses the existing window instead of starting a second process.

The controller behavior is:

when neither Codex nor Claude Code is available, the normal playback UI is hidden and the window shows an info-only message telling you to install or sign in to any one backend first
when either backend is available, the normal playback UI stays usable
a single switch soft-disables or re-enables AgentTools auto-TTS processing when the relevant AgentTools hook/integration is installed
a dropdown chooses the default transform engine used for ttsify and desktop auto-TTS: Codex or Claude Code
if the saved/default provider is unavailable but another backend is available, AgentTools falls back automatically unless you explicitly force a provider on the CLI

End-to-end pipeline

cat input.txt | agent-tools transform \
  --system-prompt-file prompt_examples/rewrite_for_tts.md \
  | agent-tools tts --voice af_heart --output-file out.wav

Notes

ttsify is the recommended end-user path; transform and tts remain available as building blocks.
transform reads stdin by default and writes plain text to stdout.
tts reads stdin by default and writes WAV bytes to stdout unless --output-file is set.
tts and ttsify support --output-mode play on Windows.
in play mode, audio is queued into a single background controller process.
agent-tools ui launches or focuses the popup/tray controller window.
controller shortcuts: Space pause/resume, Esc stop, Ctrl+R replay, Ctrl+N next.
tts and ttsify default to --device auto.
auto device selection uses a real CUDA probe, not just torch.cuda.is_available().
agent-tools install-cuda reinstalls the full PyTorch stack (torch, torchvision, torchaudio) into the current Python environment and validates the full Kokoro import chain in a fresh subprocess by default.
transform refreshes ChatGPT tokens when the Codex backend returns 401.
the experimental Claude transform path uses the official claude CLI only; it is intentionally limited to one turn with tools disabled
Native Windows Codex uses notify; hooks.json lifecycle hooks are not used there.
semantic-release now owns future Python package version bumps and py-v* tags.

CPU performance

Measured on this machine on April 15, 2026 with forced CPU:

Scenario	Wall time	Audio time	Real-time factor
first-ever cold init after dependency/model setup	~43.1s	n/a	n/a
cached init	~2.9s	n/a	n/a
warm short	0.309s	4.80s	0.064
warm medium	1.199s	15.53s	0.077
warm long	2.514s	26.70s	0.094

Interpretation:

warm CPU generation on this machine is about 10x-15x faster than realtime
the main cost is cold startup/model load, not steady-state synthesis

To reproduce locally:

python scripts/benchmark_tts_cpu.py

Troubleshooting

Missing ~/.codex/auth.json: run codex login
Expired auth: rerun codex login if refresh fails permanently
Missing espeak-ng: install it for better English fallback behavior
Slow first run: expected; Kokoro downloads voices/models and initializes the pipeline
After changing Python versions for the interpreter that runs agent-tools, rerun agent-tools install-cuda in that same interpreter to repair the PyTorch stack for Kokoro

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

davronsherbaev

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.7.3

Apr 20, 2026

0.7.2

Apr 18, 2026

0.7.1

Apr 17, 2026

0.7.0

Apr 17, 2026

0.6.0

Apr 17, 2026

0.5.2

Apr 17, 2026

0.5.1

Apr 17, 2026

0.5.0

Apr 16, 2026

0.4.0

Apr 15, 2026

0.3.0

Apr 15, 2026

0.2.0

Apr 14, 2026

0.1.3

Apr 14, 2026

0.1.2

Apr 14, 2026

0.1.1

Apr 14, 2026

0.1.0

Apr 14, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_nd_co_agent_tools-0.7.3.tar.gz (75.7 kB view details)

Uploaded Apr 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ai_nd_co_agent_tools-0.7.3-py3-none-any.whl (70.0 kB view details)

Uploaded Apr 20, 2026 Python 3

File details

Details for the file ai_nd_co_agent_tools-0.7.3.tar.gz.

File metadata

Download URL: ai_nd_co_agent_tools-0.7.3.tar.gz
Upload date: Apr 20, 2026
Size: 75.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ai_nd_co_agent_tools-0.7.3.tar.gz
Algorithm	Hash digest
SHA256	`0e311407920dfb846f9e651a1a95985f93e478dd3feb5d9213dde3ee8fe5057a`
MD5	`ca099e9cc1afb8b1453313a960c9e148`
BLAKE2b-256	`49aad10df6fbadeb6897007588bf0d96cb6e099b631440ff5bf9d16954f27223`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ai_nd_co_agent_tools-0.7.3.tar.gz:

Publisher: release.yml on ai-nd-co/agent-tools

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ai_nd_co_agent_tools-0.7.3.tar.gz
- Subject digest: 0e311407920dfb846f9e651a1a95985f93e478dd3feb5d9213dde3ee8fe5057a
- Sigstore transparency entry: 1342602014
- Sigstore integration time: Apr 20, 2026
Source repository:
- Permalink: ai-nd-co/agent-tools@22d9a8927862b5cc9ef8129ed3501b4d4757562a
- Branch / Tag: refs/tags/py-v0.7.3
- Owner: https://github.com/ai-nd-co
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@22d9a8927862b5cc9ef8129ed3501b4d4757562a
- Trigger Event: workflow_dispatch

File details

Details for the file ai_nd_co_agent_tools-0.7.3-py3-none-any.whl.

File metadata

Download URL: ai_nd_co_agent_tools-0.7.3-py3-none-any.whl
Upload date: Apr 20, 2026
Size: 70.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ai_nd_co_agent_tools-0.7.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`624a9ea5269ec0b4a9f0950261458ecd06eced595b59ec2127de1fa446fd4a13`
MD5	`f43a6cd42f1cccea8e3a2f9576c57d3f`
BLAKE2b-256	`8e70a29e08e7ef03dab71d18ee621a0f621abf9855513f7456062108dec69221`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ai_nd_co_agent_tools-0.7.3-py3-none-any.whl:

Publisher: release.yml on ai-nd-co/agent-tools

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ai_nd_co_agent_tools-0.7.3-py3-none-any.whl
- Subject digest: 624a9ea5269ec0b4a9f0950261458ecd06eced595b59ec2127de1fa446fd4a13
- Sigstore transparency entry: 1342602023
- Sigstore integration time: Apr 20, 2026
Source repository:
- Permalink: ai-nd-co/agent-tools@22d9a8927862b5cc9ef8129ed3501b4d4757562a
- Branch / Tag: refs/tags/py-v0.7.3
- Owner: https://github.com/ai-nd-co
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@22d9a8927862b5cc9ef8129ed3501b4d4757562a
- Trigger Event: workflow_dispatch

ai-nd-co-agent-tools 0.7.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

agent-tools

Status

Requirements

Install

Usage

Single-command path: ttsify

Desktop integrations

Transform text

Text to speech

Desktop controller UI

End-to-end pipeline

Notes

CPU performance

Troubleshooting

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Single-command path: `ttsify`