Skip to main content

AI-powered multi-agent CLI coding assistant for local LLMs

Project description

Deus

Claude Code-style AI coding assistant that runs on your own GPU.

The problem it solves

Proprietary coding assistants charge per token — and token anxiety kills the "just let it figure it out" workflow. Every file read, every search, every iteration costs money and sends your code to someone else's server. Deus runs on a GPU you rent by the hour: analyze your entire codebase, run the agent in loops, explore freely. Flat rate. Your data stays yours.

How it works

Your prompt
    ↓
Complexity detection (simple? skip planning)
    ↓
Planner → action plan (files to read, searches to run, files to create)
    ↓
Parallel pre-loading (reads files + runs searches before agent starts)
    ↓
Agent loop (reads files, writes code, runs commands, searches web)
    ↓
Result

The planner analyzes your task and pre-loads all relevant context before the agent starts. Instead of the agent discovering it needs a file 3 turns in, it begins with everything already loaded.

Quick start

1. Install

pip install deuscode

2. Launch a GPU on RunPod

deus setup --runpod

Interactive wizard: picks model → GPU size → launches vLLM → writes your config automatically.

💡 Support Deus development by using our RunPod affiliate link: runpod.io/ref/ww1q3uhd

  • ~40x cheaper than API tokens for intensive use
  • Flat hourly rate — no token counting, no throttling
  • Code never leaves your pod — not sent to any third-party API
  • Auto-stop built in — Deus stops your pod when done, no idle charges
  • Any open-source model — Qwen2.5-Coder, DeepSeek, Llama 3, and more

3. Start coding

cd your-project
deus "add Stripe payment integration"

Or launch the interactive REPL:

deus

Features

Intelligent planning

Every prompt goes through complexity detection first. Simple questions go straight to the agent. Complex tasks get a full action plan: which files to read, which docs to search, what to create, and how to validate the result. The agent starts with context already loaded.

Web search built in

Before the agent writes a line of code, the planner can queue web searches. Latest API docs, Stack Overflow answers, package changelogs — fetched and injected into context automatically. Uses DuckDuckGo by default; swap to Brave Search with an API key.

Full tool suite

  • Read/write files — diff preview and confirmation before any write
  • Bash commands — explicit confirmation before execution
  • Web search — DuckDuckGo (default) or Brave Search
  • Repo-map — scans your codebase structure, passed to every prompt

Model management

deus model list                    # see downloaded models on your pod
deus model download --size small   # pick and download a model

Coding models:

Model VRAM Notes
Qwen2.5-Coder-1.5B 4 GB Tiny, any GPU
Qwen2.5-Coder-3B 8 GB Small but capable
Qwen2.5-Coder-7B 16 GB Fast, cheap, good
Qwen2.5-Coder-14B 28 GB Best mid-size
DeepSeek-Coder-V2-Lite 32 GB MoE, strong for size
Qwen2.5-Coder-32B 64 GB Top quality, needs A100

General models: Llama 3.1/3.2 (1B–70B), Mistral 7B/Nemo 12B, Gemma 2 (9B/27B).

Pod management

deus setup --runpod    # launch a new pod
deus connect --runpod  # connect to an existing pod
deus setup --stop      # stop pod (stop paying)

Enable auto_stop_runpod: true in ~/.deus/config.yaml to stop the pod automatically after each prompt.

CLI reference

deus "your prompt"              ask anything
deus "prompt" --path ./src      specify working directory
deus "prompt" --no-map          skip repo-map (faster for large repos)
deus "prompt" --model MODEL_ID  override active model

deus setup --runpod             launch a new RunPod pod
deus setup --stop               stop current pod
deus connect --runpod           connect to existing pod

deus model list                 list downloaded + available models
deus model download             download a model to your pod
deus model download --size small  filter by size (small/medium/big/all)

In-chat commands (REPL only):

--model                show active model and downloaded list
--model MODEL_ID       switch active model mid-session

Comparison

Deus Claude Code Cursor
Runs on your GPU
No per-token cost
Code stays private
Any open-source model
Task planning
Web search
Terminal-native
Free to use

Configuration

Config lives at ~/.deus/config.yaml. Created automatically on first run.

base_url: https://your-runpod-endpoint/v1
api_key: your-key
model: Qwen/Qwen2.5-Coder-7B-Instruct
max_tokens: 8192
auto_stop_runpod: false
search_backend: duckduckgo   # or: brave
brave_api_key: ""            # required if search_backend: brave

Requirements

  • Python 3.12+
  • A RunPod account — or any OpenAI-compatible vLLM endpoint
  • GPU with 4 GB+ VRAM (16 GB+ recommended for coding tasks)

License

AGPL-3.0 — free to use, modify, and distribute. Commercial use requires a separate license. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deuscode-0.7.4.tar.gz (48.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deuscode-0.7.4-py3-none-any.whl (44.5 kB view details)

Uploaded Python 3

File details

Details for the file deuscode-0.7.4.tar.gz.

File metadata

  • Download URL: deuscode-0.7.4.tar.gz
  • Upload date:
  • Size: 48.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for deuscode-0.7.4.tar.gz
Algorithm Hash digest
SHA256 cb4440dcdb3181a67862e2d3c79bd24b0b5ae46b310304e89117cc755679d55e
MD5 50fa6a50a1abc48907ed12f2fc70c601
BLAKE2b-256 860bcb50427ff2774dea05819d2ec8ad13fc96c56481bc0873eca94258b3485c

See more details on using hashes here.

File details

Details for the file deuscode-0.7.4-py3-none-any.whl.

File metadata

  • Download URL: deuscode-0.7.4-py3-none-any.whl
  • Upload date:
  • Size: 44.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for deuscode-0.7.4-py3-none-any.whl
Algorithm Hash digest
SHA256 957b7099031e316cf7a75a2cd4044b7103f4a9a6d33056e2f4e302d4d9656554
MD5 67e7179e217b7ab98db17ed4706adaf7
BLAKE2b-256 754d4219a879cbc0f1f8b1c80dcf856d82b7c01e4eef0763aee9df8d790441e0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page