Python package + Typer CLI implementation of the RLM with Modal notebook

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

fleet-rlm

A Python package implementing Recursive Language Models (RLM) with DSPy and Modal for secure, cloud-based code execution. This project demonstrates how LLMs can treat long contexts as external environments, using programmatic code exploration in sandboxed environments.

Reference: Recursive Language Models (Zhang, Kraska, Khattab, 2025)

Overview

Recursive Language Models (RLM) represent an inference strategy where:

LLMs treat long contexts as an external environment rather than direct input
The model writes Python code to programmatically explore data
Code executes in a sandboxed environment (Modal cloud)
Only relevant snippets are sent to sub-LLMs for semantic analysis

This package provides both a comprehensive Jupyter notebook and a Typer CLI for running RLM workflows.

Using dspy.RLM with Claude Code

fleet-rlm is designed to work seamlessly with Claude Code (Claude's agentic coding capabilities). The bundled skills, agents, team templates, and hooks enable Claude to leverage dspy.RLM for complex, long-context tasks.

Why Use RLM with Claude Code?

Challenge	RLM Solution
Context window limits	Code explores data programmatically instead of loading everything
Complex multi-step analysis	Sandbox code can delegate semantic work via `llm_query()` / `llm_query_batched()` (and Claude subagents can orchestrate this workflow)
Reproducibility	All exploration steps are Python code — auditable and replayable
Secure execution	Modal sandboxes isolate untrusted code from your environment

Agent Architecture

┌─────────────────────────────────────────────────────────────┐
│ CLAUDE CODE (Orchestrator)                                  │
│  ┌─────────────────┐  ┌─────────────────┐                  │
│  │ rlm-orchestrator│  │ rlm-specialist  │                  │
│  │ (coordinates    │→ │ (executes       │                  │
│  │  multi-agent    │  │  complex RLM    │                  │
│  │  workflows)     │  │  tasks)         │                  │
│  └─────────────────┘  └─────────────────┘                  │
│           │                    │                            │
│           ▼                    ▼                            │
│  ┌─────────────────────────────────────────┐               │
│  │ ModalInterpreter (dspy.CodeInterpreter) │               │
│  │  • Manages sandbox lifecycle            │               │
│  │  • Bridges tools to/from sandbox        │               │
│  │  • Handles llm_query() sub-calls        │               │
│  └─────────────────────────────────────────┘               │
│                        │                                    │
└────────────────────────┼────────────────────────────────────┘
                         │ JSON protocol
                         ▼
              ┌──────────────────────┐
              │ MODAL SANDBOX        │
              │  • Python 3.12       │
              │  • Isolated exec()   │
              │  • Sub-LLM calls     │
              └──────────────────────┘

Quick Start with Claude Code

Install scaffold assets to your Claude configuration:
```
uv run fleet-rlm init
```
This installs skills, agents, teams, and hooks to ~/.claude/ by default.

Use the rlm skill in Claude Code for long-context tasks:

@rlm Analyze this 500KB log file and extract all error patterns

Use the rlm-orchestrator agent for multi-step workflows:

@rlm-orchestrator Process these 10 documentation files, 
extract API endpoints, and generate a summary report

Sub-LLM Patterns

The RLM approach enables semantic delegation from sandbox code:

# Inside Modal sandbox, the LLM-generated code can call:
result = llm_query("Extract all function signatures from this code snippet")

# Or batch multiple sub-queries in parallel:
results = llm_query_batched([
    "Summarize section 1",
    "Summarize section 2",
])

In Claude workflows, the packaged rlm-subcall agent can be used as an orchestration pattern around these calls.

This pattern allows you to:

Delegate semantic analysis to sub-LLMs while keeping orchestration logic in Python
Process chunks in parallel for large documents
Accumulate results across multiple iterations using stateful buffers

Features

Secure Cloud Execution: Code runs in Modal's isolated sandbox environment
DSPy Integration: Built on DSPy 3.1.3 with custom signatures for RLM tasks
CLI Interface: Typer-based CLI with multiple demo commands
Extensible Tools: Support for custom tools that bridge sandbox and host
Secret Management: Secure handling of API keys via Modal secrets

Technology Stack

Component	Technology
Language	Python >= 3.10
Package Manager	`uv` (modern Python package manager)
Core Framework	DSPy 3.1.3
Cloud Sandbox	Modal
CLI Framework	Typer >= 0.12
Testing	pytest >= 8.2
Linting/Formatting	ruff >= 0.8

Installation

# Clone the repository
git clone https://github.com/qredence/fleet-rlm.git
cd fleet-rlm

# Install dependencies with uv
uv sync

# For development (includes test tools)
uv sync --extra dev

Scaffold Installation

fleet-rlm includes custom Claude skills, agents, team templates, and hooks optimized for RLM workflows. Install them to your user directory for use across all projects:

# List available scaffold assets
uv run fleet-rlm init --list

# Install all scaffold assets to ~/.claude/
uv run fleet-rlm init

# Or install to a custom directory
uv run fleet-rlm init --target ~/.config/claude

# Force overwrite existing files
uv run fleet-rlm init --force

# Install only team templates
uv run fleet-rlm init --teams-only

# Install only hooks
uv run fleet-rlm init --hooks-only

# Install all except hooks
uv run fleet-rlm init --no-hooks

Note: Scaffold assets are workflow definitions only. You still need to configure Modal authentication and secrets separately (see Setup Modal above). Agent Teams are experimental in Claude Code and require setting CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1 in Claude settings or your environment. Team config lives under ~/.claude/teams/{team}/config.json; runtime task state is managed by Claude under ~/.claude/tasks/{team}/.

Available Skills:

dspy-signature - Generate and validate DSPy signatures
modal-sandbox - Manage Modal sandboxes
rlm - Run RLM for long-context tasks
rlm-batch - Execute parallel tasks
rlm-debug - Debug RLM execution
rlm-execute - Execute Python in sandboxes
rlm-long-context - (EXPERIMENTAL) Research implementation
rlm-memory - Long-term memory persistence
rlm-run - Run RLM tasks with proper configuration
rlm-test-suite - Test and evaluate workflows

Available Agents:

modal-interpreter-agent - Direct Modal sandbox interaction
rlm-orchestrator - Multi-agent RLM coordination (recommended for complex tasks)
rlm-specialist - Complex RLM task execution with full sandbox access
rlm-subcall - Lightweight sub-LLM calls for delegation patterns

Available Team Templates:

fleet-rlm - Preconfigured multi-agent team layout for RLM orchestration

Available Hooks:

hookify.fleet-rlm-document-process.local.md - Prompt hook for document processing workflows
hookify.fleet-rlm-large-file.local.md - Prompt hook for large-file workflow suggestions
hookify.fleet-rlm-llm-query-error.local.md - Prompt hook for llm_query troubleshooting guidance
hookify.fleet-rlm-modal-error.local.md - Prompt hook for Modal/sandbox troubleshooting guidance

Tip: Start with rlm-orchestrator for multi-file or multi-step tasks. It coordinates rlm-specialist and rlm-subcall automatically.

Quick Start

1. Configure Environment

Copy the environment template and fill in your values:

# Copy the template
cp .env.example .env

# Edit with your API keys and model configuration
# See .env.example for detailed documentation on each variable
vim .env

Required variables:

DSPY_LM_MODEL - LLM model identifier (e.g., openai/gpt-4, google/gemini-3-flash-preview)
DSPY_LLM_API_KEY - API key for your LLM provider

Optional variables:

DSPY_LM_API_BASE - Custom API endpoint (if using proxy or self-hosted)
DSPY_LM_MAX_TOKENS - Maximum response length (default: 8192)

⚠️ Security: The .env file is gitignored and will never be committed. Keep your API keys safe!

2. Setup Modal

Important: Modal authentication and secrets are per user. Each developer needs to run Modal setup in their own environment and create secrets in their own Modal account. Credentials are not shared or bundled with fleet-rlm.

# Authenticate with Modal
uv run modal setup

# Create a Modal volume for data
uv run modal volume create rlm-volume-dspy

# Create Modal secret for API keys
uv run modal secret create LITELLM \
  DSPY_LM_MODEL=... \
  DSPY_LM_API_BASE=... \
  DSPY_LLM_API_KEY=... \
  DSPY_LM_MAX_TOKENS=...

3. Run CLI Commands

# Show all available commands
uv run fleet-rlm --help

# Run a basic demo
uv run fleet-rlm run-basic --question "What are the first 12 Fibonacci numbers?"

# Doc-analysis commands require --docs-path
# Extract architecture from documentation
uv run fleet-rlm run-architecture \
    --docs-path rlm_content/dspy-knowledge/dspy-doc.txt \
    --query "Extract all modules and optimizers"

# Extract API endpoints
uv run fleet-rlm run-api-endpoints --docs-path rlm_content/dspy-knowledge/dspy-doc.txt

# Find error patterns
uv run fleet-rlm run-error-patterns --docs-path rlm_content/dspy-knowledge/dspy-doc.txt

# Inspect trajectory on a document sample
uv run fleet-rlm run-trajectory \
    --docs-path rlm_content/dspy-knowledge/dspy-doc.txt \
    --chars 5000

# Use custom regex tool
uv run fleet-rlm run-custom-tool \
    --docs-path rlm_content/dspy-knowledge/dspy-doc.txt \
    --chars 5000

# Analyze or summarize long-context docs with sandbox helpers
uv run fleet-rlm run-long-context \
    --docs-path rlm_content/dspy-knowledge/dspy-doc.txt \
    --query "What are the main design decisions?" \
    --mode analyze

# Check Modal secrets are configured
uv run fleet-rlm check-secret

CLI Commands

Command	Description
`init`	Install bundled Claude scaffold assets
`run-basic`	Basic code generation (Fibonacci example)
`run-architecture`	Extract DSPy architecture from documentation
`run-api-endpoints`	Extract API endpoints from documentation
`run-error-patterns`	Find and categorize error patterns in docs
`run-trajectory`	Examine RLM execution trajectory
`run-custom-tool`	Demo with custom regex tool
`run-long-context`	Analyze or summarize a long document
`check-secret`	Verify Modal secret presence
`check-secret-key`	Inspect specific secret key

Architecture

┌─────────────────────────────────────────────────────────────┐
│ LOCAL (Jupyter/CLI)                                         │
│  ┌─────────────┐  ┌──────────────┐  ┌──────────────────┐   │
│  │ Planner LM  │  │ RLM Module   │  │ ModalInterpreter │   │
│  │ (decides    │→ │ (builds      │→ │ (manages sandbox │   │
│  │  what code  │  │  signatures) │  │  lifecycle)      │   │
│  │  to write)  │  │              │  │                  │   │
│  └─────────────┘  └──────────────┘  └──────────────────┘   │
│           │                │                  │             │
│           │                │ JSON stdin       │ gRPC        │
│           │                ↓                  ↓             │
└───────────┼────────────────┼──────────────────┼─────────────┘
            │                │                  │
            │                │                  ▼
            │                │     ┌──────────────────────┐
            │                │     │ MODAL CLOUD          │
            │                │     │  ┌────────────────┐  │
            │                └────→│  │ Sandbox        │  │
            │                      │  │ - Python 3.12  │  │
            │                      │  │ - Volume /data │  │
            │                      │  │ - Secrets      │  │
            │                      │  └────────────────┘  │
            │                      │           │          │
            │                      │           ▼          │
            │                      │  ┌────────────────┐  │
            │                      │  │ Driver Process │  │
            │                      │  │ - exec() code  │  │
            │                      │  │ - tool bridging│  │
            │                      │  └────────────────┘  │
            │                      └──────────────────────┘
            │                                │
            └────────────────────────────────┘
                        tool_call requests
                        (llm_query, etc.)

Package Structure

src/fleet_rlm/
├── __init__.py      # Package exports
├── cli.py           # Typer CLI interface
├── config.py        # Environment configuration
├── driver.py        # Sandbox protocol driver
├── interpreter.py   # ModalInterpreter implementation
├── runners.py       # High-level RLM demo runners
├── signatures.py    # DSPy signatures for RLM tasks
└── tools.py         # Custom RLM tools

Module Descriptions

config.py: Loads environment variables, configures DSPy's planner LM, guards against Modal package shadowing
cli.py: Typer CLI with commands for running demos and checking secrets
driver.py: Runs inside Modal's sandbox as a long-lived JSON protocol driver
interpreter.py: DSPy-compatible CodeInterpreter managing Modal sandbox lifecycle
runners.py: High-level functions orchestrating complete RLM workflows
signatures.py: RLM task signatures (ExtractArchitecture, ExtractAPIEndpoints, etc.)
tools.py: Custom tools like regex_extract() for RLM use

RLM Patterns

Pattern 1: Navigate → Query → Synthesize

Code searches for headers in documentation
llm_query() extracts info from relevant sections
SUBMIT(modules=list, optimizers=list, principles=str) returns structured output

Pattern 2: Parallel Chunk Processing

Split documents into chunks by headers
llm_query_batched([chunk1, chunk2, ...]) executes in parallel
Aggregate results into final output

Pattern 3: Stateful Multi-Step

Search for keywords in documentation
Save matches to variable (persists across iterations)
Query LLM to categorize findings
Iterate with refined queries

Testing

# Run all tests
uv run pytest

# Or via Make
make test

Test File	Purpose
`test_cli_smoke.py`	CLI help display, command discovery, error handling
`test_config.py`	Environment loading, quoted values, fallback API keys
`test_driver_protocol.py`	SUBMIT output mapping, tool call round-trips
`test_tools.py`	Regex extraction, groups, flags

Development

# Install dev dependencies
make sync-dev

# Run linting
make lint

# Format code
make format

# Run all checks
make check

# Run release validation (lint, tests, build, twine check)
make release-check

# Install pre-commit hooks
make precommit-install
make precommit-run

Release process documentation is in RELEASING.md, including the TestPyPI-first workflow.

Jupyter Notebook

The original implementation is available as a Jupyter notebook:

# Launch Jupyter Lab
uv run jupyter lab notebooks/rlm-dspy-modal.ipynb

# Execute headlessly (for CI/validation)
uv run jupyter nbconvert \
  --to notebook \
  --execute \
  --inplace \
  --ExecutePreprocessor.timeout=3600 \
  notebooks/rlm-dspy-modal.ipynb

Security

Secrets Management: All credentials stored in Modal secrets, never in code
Sandbox Isolation: Code executes in Modal's isolated sandbox environment
Local .env: Contains API keys - is gitignored and should never be committed
Shadow Protection: config.py guards against modal.py naming conflicts

Troubleshooting

"Planner LM not configured"

Set DSPY_LM_MODEL and DSPY_LLM_API_KEY in .env, then restart your shell or kernel.

"Modal sandbox process exited unexpectedly"

uv run modal token set
uv run modal volume list

"No module named 'modal'"

uv sync

Modal package shadowing

Remove any modal.py file or __pycache__/modal.*.pyc in the working directory.

References

RLM Paper: Recursive Language Models (also available in rlm-volume-dspy at /data/rlm-knowledge/)
DSPy Docs: https://dspy-docs.vercel.app/
Modal Docs: https://modal.com/docs
UV Docs: https://docs.astral.sh/uv/
Project Docs: See docs/ directory for detailed guides and architecture documentation

License

This project is licensed under the MIT License.

Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

Acknowledgments

This project is based on research from the Recursive Language Models paper by Zhang, Kraska, and Khattab (2025).

Built with:

DSPy - Framework for programming with language models
Modal - Serverless computing for AI
Typer - CLI framework
Claude Code - Agentic coding with Claude

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

agenticfleet

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.5.40

May 23, 2026

0.5.31

May 17, 2026

0.5.3

May 17, 2026

0.5.2

Apr 29, 2026

0.5.1

Apr 27, 2026

0.5.0

Apr 25, 2026

0.4.99

Mar 26, 2026

0.4.98

Mar 17, 2026

0.4.97

Mar 15, 2026

0.4.95

Mar 5, 2026

0.4.94

Mar 3, 2026

0.4.93

Mar 3, 2026

0.4.92

Mar 1, 2026

0.4.9

Feb 27, 2026

0.4.8

Feb 25, 2026

0.4.7

Feb 22, 2026

0.4.6

Feb 20, 2026

0.4.5

Feb 18, 2026

0.4.4

Feb 15, 2026

0.4.3

Feb 15, 2026

0.4.2

Feb 15, 2026

0.4.1

Feb 12, 2026

0.4.0

Feb 12, 2026

This version

0.3.2

Feb 10, 2026

0.3.1

Feb 9, 2026

0.1.0

Feb 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fleet_rlm-0.3.2.tar.gz (114.3 kB view details)

Uploaded Feb 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fleet_rlm-0.3.2-py3-none-any.whl (113.4 kB view details)

Uploaded Feb 10, 2026 Python 3

File details

Details for the file fleet_rlm-0.3.2.tar.gz.

File metadata

Download URL: fleet_rlm-0.3.2.tar.gz
Upload date: Feb 10, 2026
Size: 114.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for fleet_rlm-0.3.2.tar.gz
Algorithm	Hash digest
SHA256	`d28df7c1e7548e173490c40d59cbb64bb57e99749c3f5135c1a67c7062fdb668`
MD5	`9dfd89d4418ba959e8bda932e1f53179`
BLAKE2b-256	`8867500925b4f92c458fcea014af9a71f6d0583f435c1898d1e3ead3ee86ddfb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fleet_rlm-0.3.2.tar.gz:

Publisher: release.yml on Qredence/fleet-rlm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fleet_rlm-0.3.2.tar.gz
- Subject digest: d28df7c1e7548e173490c40d59cbb64bb57e99749c3f5135c1a67c7062fdb668
- Sigstore transparency entry: 937089900
- Sigstore integration time: Feb 10, 2026
Source repository:
- Permalink: Qredence/fleet-rlm@2e30103834804205ff5587f47c5428fdad7ae919
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Qredence
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@2e30103834804205ff5587f47c5428fdad7ae919
- Trigger Event: workflow_dispatch

File details

Details for the file fleet_rlm-0.3.2-py3-none-any.whl.

File metadata

Download URL: fleet_rlm-0.3.2-py3-none-any.whl
Upload date: Feb 10, 2026
Size: 113.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for fleet_rlm-0.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1d46e3bd397f613cb6a670f0ed6bfbe1d6d02d8e763fedd8a529534f130c37cd`
MD5	`6ab88fbda453b09a81daf8950d2bdae7`
BLAKE2b-256	`d689099ba1d720046b4566804f1bebbdd5ea6721f059e6124b8bd644f7f9f1b1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fleet_rlm-0.3.2-py3-none-any.whl:

Publisher: release.yml on Qredence/fleet-rlm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fleet_rlm-0.3.2-py3-none-any.whl
- Subject digest: 1d46e3bd397f613cb6a670f0ed6bfbe1d6d02d8e763fedd8a529534f130c37cd
- Sigstore transparency entry: 937089904
- Sigstore integration time: Feb 10, 2026
Source repository:
- Permalink: Qredence/fleet-rlm@2e30103834804205ff5587f47c5428fdad7ae919
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Qredence
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@2e30103834804205ff5587f47c5428fdad7ae919
- Trigger Event: workflow_dispatch

fleet-rlm 0.3.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

fleet-rlm

Overview

Using dspy.RLM with Claude Code

Why Use RLM with Claude Code?

Agent Architecture

Quick Start with Claude Code

Sub-LLM Patterns

Features

Technology Stack

Installation

Scaffold Installation

Quick Start

1. Configure Environment

2. Setup Modal

3. Run CLI Commands

CLI Commands

Architecture

Package Structure

Module Descriptions

RLM Patterns

Pattern 1: Navigate → Query → Synthesize

Pattern 2: Parallel Chunk Processing

Pattern 3: Stateful Multi-Step

Testing

Development

Jupyter Notebook

Security

Troubleshooting

"Planner LM not configured"

"Modal sandbox process exited unexpectedly"

"No module named 'modal'"

Modal package shadowing

References

License

Contributing

Acknowledgments

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance