Turn a team meeting into shipped code. Recording in, working tested PR out, humans approving at every gate.

These details have not been verified by PyPI

Project links

Project description

Cascade

An open-source AI agent that takes a meeting recording, a tracker ticket, or a one-line prompt, and ships a tested pull request. Self-hosted. Uses your LLM key. Your code never leaves your org.

Status: pre-alpha, building in public. Star and watch to follow along, or join the early beta list.

Why Cascade exists

Most AI dev tools assume your team works exactly the way they expect. You write code in their IDE. You pay for their LLM. You host on the VCS they support. You start work by typing into a chat window.

Real engineering teams don't look like that. Some discuss requirements in standups, others work from Jira tickets, others just have a senior dev send a Slack message. Some teams have a corporate Copilot subscription and no Anthropic budget. Some teams can't legally send code to a SaaS at all. Some teams use GitLab. Some use Azure DevOps. Some still use Bitbucket.

Cascade was built to meet teams where they already work. Bring your own LLM. Use your existing tracker. Keep your code on your own infrastructure. Pick whichever input mode fits how you actually capture requirements.

How it works

There are three on-ramps and one pipeline:

                       INPUT                                    OUTPUT
                       =====                                    ======
                       meeting recording (.mp3/.mp4)            ┐
                       tracker ticket                  ┐        │   plan, code, test
                         (Jira / Linear /              │═══════>│════════════════════>  PR on
                          GitHub Issues /              │        │                          GitHub /
                          Azure Boards /               │        │   LLM:                  GitLab /
                          GitLab Issues)               │        │   Anthropic /            Bitbucket /
                       ad-hoc prompt                   │        │   OpenAI /               Azure DevOps
                                                       ┘        │   Google Gemini /
                                                                │   Claude Code (no API key) /
                                                                │   Ollama / vLLM (self-hosted)
                                                                ┘

A meeting recording, a ticket, or a typed prompt all turn into the same thing: a Story. Each story moves through the same pipeline. A human reviews the extracted stories before any code gets generated. A human reviews the PR before any code gets merged. Cascade does the work in between.

What makes it different

A handful of choices set Cascade apart from the rest of the agent landscape:

Everything runs on your machine or your CI. Cascade doesn't ship your code anywhere except to the LLM provider you configured.
You can use it with no API key if you already have a Claude Code subscription. The Claude Code SDK becomes the LLM transport, so the marginal cost is zero.
It's polyglot from day one. Python, TypeScript, JavaScript, Go, Rust, Java, Ruby, and C# are supported. Adding a ninth language is one entry in a registry.
Humans approve every gate. Cascade is built for trust, not for autonomy theatre. Engineers stay in the loop.
Team memory is a first-class concept. A team-memory/ directory captures your conventions, decisions, glossary, and prior work. Every AI stage reads from it.

Quick start

# Alpha release. Use --pre to opt in to pre-release versions:
pip install --pre cascade-agent                 # base install (anthropic + github + cli)
pip install --pre "cascade-agent[all]"          # adds non-default providers + Studio web dashboard
pip install --pre "cascade-agent[all,ingest]"   # also adds meeting transcription (whisper)

cascade init                           # scaffold cascade.yaml + smart-seeded team-memory/

cascade configure llm anthropic --key sk-ant-xxx --set-default
# Or skip the key entirely if you have Claude Code installed:
cascade configure llm claude_code --set-default

cascade configure vcs github --token ghp-xxx

cascade doctor                         # verify everything is wired correctly
cascade try                            # risk-free end-to-end pipeline test

# Then pick whichever entry point matches how the work showed up:
cascade prompt "Add cursor pagination to /api/users with ?limit and ?after"
cascade ticket jira:PROJ-123
cascade ticket github:myorg/myrepo#42
cascade ingest recordings/standup.mp3       # writes transcripts/*.yaml
cascade extract transcripts/standup.yaml    # writes stories/*.yaml
cascade review stories/standup.yaml         # interactive accept / edit / reject
cascade build stories/standup.yaml          # plan, code, test, PR

# Prefer a web dashboard?
cascade ui                                  # opens http://localhost:8000

First-run experience

Cascade includes two commands designed to remove the "is this thing working?" anxiety:

cascade doctor runs ~11 health checks (Python version, optional extras, git, cascade.yaml, team memory, language detection, LLM credentials, VCS credentials, test runner) and reports each one as OK / warning / failed with actionable hints for fixing anything that's wrong. Inspired by gh auth status and homebrew doctor.
cascade try runs a built-in toy story end-to-end in a disposable temp directory. It asks Cascade to add a tiny hello() function and a test, generates the code, runs the test, and reports whether it all worked. Touches nothing in your real repo. Five minutes of confidence before you bet on a real build.

Run cascade init to scaffold a project; the team-memory files come pre-seeded with sensible defaults based on the language detected in your repo (Python projects get pytest conventions, Go projects get gofmt conventions, etc.). Edit to personalize.

Cost visibility

Every LLM call surfaces its cost in the CLI output. After cascade extract, cascade build, or cascade try:

  cost: $0.12 (8,234 in / 2,156 out tokens, anthropic/claude-opus-4-7)

For multi-story builds, a session total prints at the end:

  session: 4 stories built, 8 LLM calls, $0.84

And cascade build --max-cost 5.00 aborts between stories if cumulative cost would exceed your budget. Self-hosted providers (Claude Code, Ollama) report free because the cost is covered by your subscription or local resources.

Streaming progress

LLM calls can take 20-60 seconds. Instead of silent terminals, Cascade prints animated per-stage spinners as the pipeline moves through plan -> code -> apply -> install -> test -> commit -> push -> PR, with a checkmark and short summary at the end of each stage. Works for cascade build, prompt, ticket, try, and extract.

For CI runs, scripts, or anything non-interactive, pass -q / --quiet to suppress the animation:

cascade --quiet build stories/standup.yaml

Error messages that actually help

When something goes wrong, Cascade tells you what happened AND what to try next. No stack traces for known failure modes.

error: No API key configured for LLM provider 'anthropic'

  How to fix:
    * Set it now: cascade configure llm anthropic --key <YOUR_KEY>
    * Or export the env var: export ANTHROPIC_API_KEY=<YOUR_KEY>
    * Or use Claude Code instead (no API key needed):
      cascade configure llm claude_code --set-default
    * Or use a local model with Ollama (no API key needed):
      cascade configure llm ollama --model llama3.1 --set-default

  Learn more: cascade doctor

Multiple fixes ranked by likelihood, concrete commands ready to paste, and a pointer to a deeper diagnostic when none of them fit. Inspired by Rust's compiler diagnostics.

Cascade Studio (the web dashboard)

Cascade ships with a web UI that surfaces the same operations as the CLI in a friendlier interface: visual story review, build history, provider config forms, and a team-memory editor with markdown preview.

pip install --pre "cascade-agent[studio]"   # adds FastAPI + uvicorn
cascade ui                                  # starts at http://localhost:8000

The dashboard runs locally. No remote service, no auth required for single-user mode, your code never leaves your machine. The frontend ships pre-built inside the pip package; no Node.js install needed at runtime.

Studio is in early development; the frontend source lives at Thinknext-Software-Solutions/Cascade-Studio and the bundled UI updates with every cascade-agent release.

Credentials live at ~/.config/cascade/config.yaml (mode 0600). Run cascade configure show to see what's set, with secrets masked.

The pipeline in detail

+-------------------------------------------------------------------+
|                       TEAM MEMORY LAYER                            |
| (conventions, decisions, glossary, prior work, constraints)        |
+-------------------------------------------------------------------+
            ^    ^    ^    ^    ^    ^    ^    ^
            |    |    |    |    |    |    |    |  every stage
            |    |    |    |    |    |    |    |  reads memory
            |    |    |    |    |    |    |    |
+---------+ ++  ++  +-+----+ ++ ++ ++ +-+-----+
|  input  |->|IN|->|TX|->|stories|->|RV|->|PL|->|CD|->|PR open|
+---------+ +--+ +--+ +-------+ +--+ +--+ +--+ +-------+
            ingest transcribe extract review plan code   PR
                                          |
                            HUMAN APPROVES EACH GATE
                            (story review + final PR review)

Stage	Module	What it does
Ingest	`transcribe.py`	Audio or video to text via Whisper. Three backends: faster-whisper, openai-whisper, OpenAI API.
Transcribe	(same)	Optional speaker diarization via pyannote, so each turn knows who said it.
Extract	`extractor.py`	Transcript into structured user stories with Given/When/Then acceptance criteria.
Review	`review.py`	Interactive accept / edit / reject / skip per story. Edit opens the story YAML in your `$EDITOR`.
Plan	`planner.py`	Approved story into a file-level implementation plan, with risks and explicit out-of-scope notes.
Code	`coder.py`	Plan into full file contents (modify, create, or delete). Language-aware, conventions-aware.
Test	`tester.py`	Run the language's test command. Capture pass/fail, output, duration.
Repo	`repo.py` + `vcs*.py`	Create branch, apply changes, commit, push, open PR.

Providers

LLM providers

Provider	API key needed?	Notes
Anthropic Claude	Yes	Default. Uses tool-use for structured output.
OpenAI	Yes	Structured Outputs via `response_format json_schema`. Works with Azure OpenAI, OpenRouter, or vLLM via `--base-url`.
Google Gemini	Yes	`response_schema` with Pydantic models.
Claude Code SDK	No	Uses your local Claude Code subscription. Zero-setup.
Ollama / vLLM	No	Local self-hosted models via OpenAI-compatible API.

VCS providers

Provider	Self-hosted supported?
GitHub	Yes (GitHub Enterprise via `--base-url`)
GitLab	Yes (cloud and self-hosted)
Bitbucket Cloud	Cloud only in v0.1
Azure DevOps Repos	Yes

Issue trackers (for `cascade ticket`)

GitHub Issues, Jira (Cloud and Server), Linear, Azure DevOps Boards, GitLab Issues.

Languages

Python, TypeScript, JavaScript, Go, Rust, Java, Ruby, C#.

Adding a new language is a single entry in languages.py. Each entry captures the file extensions, default source and test directories, the test command, the install command, and any language-specific guidance to pass to the LLM.

Security model

Cascade preserves a small set of invariants. Each one is a non-goal of the design. If any of them is violated, that's a bug, and we treat it as a security issue.

Cascade never merges PRs. Humans always approve before merge.
Cascade only writes to paths matching paths.allowed in cascade.yaml, minus anything in paths.disallowed. Deny wins over allow.
Cascade never modifies .github/, cascade.yaml, or team-memory/ by default.
Cascade only executes the configured test_command and git shell commands. No arbitrary shell access.
Source code, transcripts, and meeting recordings stay on the local machine and the configured LLM provider. Nothing else.
User credentials at ~/.config/cascade/config.yaml are stored with mode 0600.

See SECURITY.md for the full threat model and how to report a vulnerability.

Configuration

Four layers, highest wins:

CLI flags for per-call overrides like --language go or --model claude-opus-4-7.
Project config at ./cascade.yaml for per-repo settings, language overrides, and path allowlists. See cascade.yaml.example.
User config at ~/.config/cascade/config.yaml for credentials and personal defaults. Managed via cascade configure.
Environment variables as a fallback. ANTHROPIC_API_KEY, OPENAI_API_KEY, GITHUB_TOKEN, JIRA_API_TOKEN, and so on.

Roadmap

Version	Target	Highlights
v0.1 (tech preview)	2026-09-15	Foundation, all 5 LLM providers, all 4 VCS providers, all 5 issue trackers, ingest / review / build pipeline
v0.2	2026-11-15	Quality bar via real-world dogfooding, vector-store team memory (RAG over embeddings), Copilot CLI provider, multi-story batch build
v0.3	2027-01-15	Real-time meeting capture, Slack and Teams as sources, multi-repo coordination
v1.0	2027-04-15	Web UI for review, fine-tuned routing, GA

Contributing

Contributions of all sizes welcome. See CONTRIBUTING.md.

Maintainer response targets: issues within 5 business days, PRs within 3.

License

MIT. Use freely, commercially, anywhere.

Built by ThinkNext Software Solutions. Questions? hello@thinknextsoftware.com.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0a1 pre-release

May 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cascade_agent-0.1.0a1.tar.gz (357.2 kB view details)

Uploaded May 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cascade_agent-0.1.0a1-py3-none-any.whl (404.1 kB view details)

Uploaded May 26, 2026 Python 3

File details

Details for the file cascade_agent-0.1.0a1.tar.gz.

File metadata

Download URL: cascade_agent-0.1.0a1.tar.gz
Upload date: May 26, 2026
Size: 357.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for cascade_agent-0.1.0a1.tar.gz
Algorithm	Hash digest
SHA256	`5c98b806300b87c585d8a4b31ab267a156769e1225016b721217d4d522072733`
MD5	`22f3c10b42872711ed19b29f766d5605`
BLAKE2b-256	`4d9aedab35dd0b3b7d4c35d6448d892268fee78eba72075b0e2a8aaf4a0766a2`

See more details on using hashes here.

File details

Details for the file cascade_agent-0.1.0a1-py3-none-any.whl.

File metadata

Download URL: cascade_agent-0.1.0a1-py3-none-any.whl
Upload date: May 26, 2026
Size: 404.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for cascade_agent-0.1.0a1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`17d034c3f995d13e82fa136ba02775f6dc2a5377cf9ac40ff4c18331d9b0f732`
MD5	`750da4bb45a4a40a527396f0d0c34d2f`
BLAKE2b-256	`441cf5b6873d981f65c2ee0029e7557e5670c3853d7f6843be2759cb4a8e8450`

See more details on using hashes here.

cascade-agent 0.1.0a1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Cascade

Why Cascade exists

How it works

What makes it different

Quick start

First-run experience

Cost visibility

Streaming progress

Error messages that actually help

Cascade Studio (the web dashboard)

The pipeline in detail

Providers

LLM providers

VCS providers

Issue trackers (for cascade ticket)

Languages

Security model

Configuration

Roadmap

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Issue trackers (for `cascade ticket`)