RawLLM – minimal orchestrator that exposes the raw power of LLMs

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

RawLLM

Minimal POC of "dumb orchestrator – smart model". The LLM evolves itself by writing plugins (add_plugin / run_plugin). Inspired by MemPalace (96.6% on LongMemEval) and Claude Code’s TAOR loop. The core is immutable (<150 loc). HTTP transport is a plugin.

Idea

Instead of hard-coding complexity into the orchestrator, the model is given two tools:

add_plugin(name, code) — write and save a new plugin (or overwrite an existing one)
run_plugin(name, input_data) — execute an already-loaded plugin by name

The core (orchestrator) is immutable and deliberately "dumb" (~150 lines). All intelligence — creating new capabilities, coordinating agents, managing memory, parsing data — lives in the LLM’s reasoning and in the plugins it generates. The model decides when to write a plugin and can hot-reload them at any time.

Inspiration

MemPalace — the approach that dominated LongMemEval (96.6%) without complex RAG, simply by giving the model raw data and freedom to decide.
Claude Code — the TAOR (Think‑Act‑Observe‑Repeat) architecture and the "dumb orchestrator, smart model" principle.
Critique of over-engineered RAG pipelines — give the model a clean context and let it decide.

Status

✅ Implemented — core, HTTP plugin, tests and CI.

Quick start

# 1. Install dependencies
pip install -r requirements.txt

# 2. Create .env with your Anthropic API key
echo "ANTHROPIC_API_KEY=sk-ant-..." > .env

# 3. Start the orchestrator (HTTP server on port 8080)
python run.py

# 4. Send a request
curl -X POST http://localhost:8080/ \
     -H "Content-Type: application/json" \
     -d '{"prompt": "Write a plugin that returns the current time", "context": {}}'

The server port can be overridden via the HTTP_PORT environment variable.

Docker sandbox (WSL)

For more isolated plugin execution in WSL, use the Docker backend. In this mode, untrusted plugins run under a separate user and container filesystem boundary rather than sharing the orchestrator process privileges.

rawllm-core - orchestrator process user on host side
rawllm-plugin - plugin subprocess user inside sandbox container

1) Build sandbox image

docker build -t rawllm/plugin-sandbox:latest -f docker/sandbox/Dockerfile .

2) Enable docker backend

echo "SANDBOX_BACKEND=docker" >> .env
echo "SANDBOX_DOCKER_IMAGE=rawllm/plugin-sandbox:latest" >> .env

3) Run tests in WSL (Docker required)

pytest -q

When docker backend is enabled, plugin execution uses isolated volumes:

rawllm_workspace (rw)
rawllm_core_repo (ro snapshot)
rawllm_plugin_store (ro snapshot)

Running with Free / Lightweight LLMs

run.py supports any OpenAI-compatible provider via the LLM_PROVIDER environment variable (default: anthropic). All security and versioning features are automatically active.

Supported providers

`LLM_PROVIDER`	API key env var	Default model
`anthropic` (default)	`ANTHROPIC_API_KEY`	`claude-3-5-sonnet-20241022`
`groq`	`GROQ_API_KEY`	`llama3-70b-8192`
`gemini`	`GEMINI_API_KEY`	`gemini-2.0-flash`
`openrouter`	`OPEN_ROUTER_API_KEY`	`qwen/qwen3-coder:free`
`deepseek`	`DEEPSEEK_API_KEY`	`deepseek-chat`
`ollama`	(none required)	`llama3.2:3b`
`ollama-qwen-coder`	(none required)	`qwen2.5-coder:7b`

Override any default with LLM_MODEL and LLM_BASE_URL.

Groq (free tier)

echo "GROQ_API_KEY=gsk_..." >> .env
LLM_PROVIDER=groq python run.py

Google Gemini

echo "GEMINI_API_KEY=AIza..." >> .env
LLM_PROVIDER=gemini python run.py

OpenRouter (free models)

echo "OPEN_ROUTER_API_KEY=sk-or-..." >> .env
LLM_PROVIDER=openrouter python run.py
# Use a specific free model:
LLM_PROVIDER=openrouter LLM_MODEL=google/gemma-3-27b-it:free python run.py

DeepSeek

echo "DEEPSEEK_API_KEY=sk-..." >> .env
LLM_PROVIDER=deepseek python run.py

Ollama (fully local, no API key)

# 1. Install Ollama: https://ollama.com/
ollama pull llama3.2:3b   # or any model you prefer

# 2. Run
LLM_PROVIDER=ollama python run.py
# Custom model:
LLM_PROVIDER=ollama LLM_MODEL=mistral python run.py

Local Qwen Coder 7B for container testing

# 1. Pull the local coding model in WSL / host environment
ollama pull qwen2.5-coder:7b

# 2. Run RawLLM against the dedicated provider alias
LLM_PROVIDER=ollama-qwen-coder python run.py

If the orchestrator itself runs in a container and Ollama stays on the host, override the endpoint explicitly:

LLM_PROVIDER=ollama-qwen-coder \
LLM_BASE_URL=http://host.docker.internal:11434/v1 \
python run.py

CLI (`rawllm`)

Install the package in editable mode to get the rawllm command:

pip install -e .

Or invoke directly without installing:

python cli.py <command>

Orchestrator lifecycle

rawllm run                        # use default provider (anthropic)
rawllm run --provider groq        # use a specific provider

Plugin management

rawllm plugin list
rawllm plugin show my_plugin
rawllm plugin add my_plugin path/to/code.py
rawllm plugin rollback my_plugin

Plugin authoring contract

Use module-level docstring in every plugin as a prompt for RawLLM. The docstring should describe plugin role, input/output contract, operational constraints, and failure behavior. See plugins/http.py as a reference template.

Dependency approval

rawllm deps pending               # list modules awaiting approval
rawllm deps approve requests      # approve a module
rawllm deps reject requests       # reject a module

Metrics & analytics

rawllm metrics show                          # all plugins, table format
rawllm metrics show --plugin my_plugin       # one plugin
rawllm metrics show --format json            # JSON output
rawllm metrics evolution my_plugin           # chronological timeline

Configuration

rawllm config show
rawllm config set LLM_PROVIDER groq
rawllm config set ALLOWED_REQUIREMENTS "json,datetime,requests"

⚠️ Security Warning

The statement "plugins run with the same privileges as the orchestrator" applies to trusted in-process plugins and the legacy subprocess backend. When SANDBOX_BACKEND=docker is enabled, untrusted plugins run in a separate container with reduced privileges and isolated mounted volumes. Do not load plugins from untrusted sources in a production environment. This project is a research POC — run it only inside a hardened isolated environment (sandbox, Docker, VM), and review Docker runtime permissions.

Architecture

rawllm/
├── core/
│   ├── llm/                    # LLM abstraction subpackage
│   │   ├── protocol.py         # LLMClientProtocol structural Protocol
│   │   ├── registry.py         # LLM_PROVIDERS — single source of truth
│   │   ├── factory.py          # get_llm_client(provider) factory
│   │   └── clients/
│   │       ├── anthropic.py    # AnthropicClient
│   │       └── openai_compat.py# OpenAICompatibleClient (Groq, Gemini, Ollama, …)
│   ├── plugin_manager.py       # Plugin loading, hot-reload, versioning, sandbox
│   ├── tool_executor.py        # Tool-call routing + dependency gating
│   ├── taor_loop.py            # Think → Act → Observe → Repeat loop
│   ├── config.py               # Settings: trusted_plugins, allowed_requirements
│   ├── metrics.py              # Event logging to metrics.jsonl
│   ├── sandbox_wrapper.py      # Isolated subprocess wrapper for untrusted plugins
│   └── utils.py                # Shared utilities + extract_imports
├── plugins/
│   └── http.py                 # HTTP transport plugin (port set via HTTP_PORT)
├── plugins_store/              # Versioned plugin storage (created automatically)
│   ├── current/                # Symlinks to active versions
│   └── archive/{name}/         # Previous versions with metrics snapshots
├── cli.py                      # CLI entry point (rawllm)
├── system_prompt.txt           # LLM system prompt
└── run.py                      # Unified entry point (Anthropic / Groq / Gemini / Ollama / …)

License

MIT — use the ideas freely, fork, and improve.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

cherninkiy

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.0

Apr 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rawllm_evo-0.2.0.tar.gz (53.2 kB view details)

Uploaded Apr 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rawllm_evo-0.2.0-py3-none-any.whl (38.5 kB view details)

Uploaded Apr 21, 2026 Python 3

File details

Details for the file rawllm_evo-0.2.0.tar.gz.

File metadata

Download URL: rawllm_evo-0.2.0.tar.gz
Upload date: Apr 21, 2026
Size: 53.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rawllm_evo-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`94bbe9bd13d925ff666e4294ace72bf68d7e7332e3e1b601e68f2aae894817f6`
MD5	`628b8337e4e09261d93e6ba7af69e3bc`
BLAKE2b-256	`6fba713d5d8965197e3ced9efd388d7b9a160b5507de130b92a7ac89b279ca19`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rawllm_evo-0.2.0.tar.gz:

Publisher: publish.yaml on cherninkiy/rawllm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rawllm_evo-0.2.0.tar.gz
- Subject digest: 94bbe9bd13d925ff666e4294ace72bf68d7e7332e3e1b601e68f2aae894817f6
- Sigstore transparency entry: 1353782584
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: cherninkiy/rawllm@8a6e12852b3a2c3393a74a026ac9a62d7be698ba
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/cherninkiy
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yaml@8a6e12852b3a2c3393a74a026ac9a62d7be698ba
- Trigger Event: push

File details

Details for the file rawllm_evo-0.2.0-py3-none-any.whl.

File metadata

Download URL: rawllm_evo-0.2.0-py3-none-any.whl
Upload date: Apr 21, 2026
Size: 38.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rawllm_evo-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8e1f217294117a263a7d04221fda87426277be2e420664921f7a38696d4d9a5e`
MD5	`f37695ae3b122d09c556fb951a548a96`
BLAKE2b-256	`24218024608df309285c6f60917d4f9415f3f4118aab296cf8e0d358dec0b4a1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rawllm_evo-0.2.0-py3-none-any.whl:

Publisher: publish.yaml on cherninkiy/rawllm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rawllm_evo-0.2.0-py3-none-any.whl
- Subject digest: 8e1f217294117a263a7d04221fda87426277be2e420664921f7a38696d4d9a5e
- Sigstore transparency entry: 1353782707
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: cherninkiy/rawllm@8a6e12852b3a2c3393a74a026ac9a62d7be698ba
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/cherninkiy
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yaml@8a6e12852b3a2c3393a74a026ac9a62d7be698ba
- Trigger Event: push

rawllm-evo 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

RawLLM

Idea

Inspiration

Status

Quick start

Docker sandbox (WSL)

1) Build sandbox image

2) Enable docker backend

3) Run tests in WSL (Docker required)

Running with Free / Lightweight LLMs

Supported providers

Groq (free tier)

Google Gemini

OpenRouter (free models)

DeepSeek

Ollama (fully local, no API key)

Local Qwen Coder 7B for container testing

CLI (rawllm)

Orchestrator lifecycle

Plugin management

Plugin authoring contract

Dependency approval

Metrics & analytics

Configuration

⚠️ Security Warning

Architecture

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

CLI (`rawllm`)