Recursive Language Models with DSPy + Modal for secure long-context code execution

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

agenticfleet

These details have not been verified by PyPI

Project links

Documentation

Project description

fleet-rlm

Secure, cloud-sandboxed Recursive Language Models (RLM) with DSPy and Modal.

fleet-rlm provides a production-ready implementation of Recursive Language Modeling aligned with the DSPy RLM API. It gives your AI agent a secure "computer" in the cloud to read, search, and analyze massive datasets without local resource constraints.

Paper | Contributing | Docs

Architecture

graph TB
    subgraph entry ["🚪 Entry Points"]
        CLI["CLI (Typer)"]
        API["FastAPI<br/>(WS/REST)"]
        TUI["Ink TUI<br/>(stdio bridge)"]
        MCP["MCP Server"]
    end

    subgraph orchestration ["🧠 Orchestration Layer"]
        Agent["RLMReActChatAgent<br/>(dspy.Module)"]
        History["Chat History"]
        Memory["Core Memory<br/>(Persona/Human/Scratchpad)"]
        DocCache["Document Cache"]
    end

    subgraph tools ["🔧 ReAct Tools"]
        DocTools["📄 load_document<br/>read_file_slice<br/>chunk_by_*"]
        RecursiveTools["🔄 rlm_query<br/>llm_query<br/>(recursive delegation)"]
        ExecTools["⚡ execute_code<br/>edit_file<br/>search_code"]
    end

    subgraph execution ["⚙️ Execution Layer"]
        Interpreter["ModalInterpreter<br/>(JSON protocol)"]
        Profiles["Execution Profiles:<br/>ROOT | DELEGATE | MAINTENANCE"]
    end

    subgraph cloud ["☁️ Modal Cloud"]
        Sandbox["Sandbox Driver<br/>(Python REPL)"]
        Volume[("💾 Persistent Volume<br/>/data/<br/>• workspaces<br/>• artifacts<br/>• memory<br/>• session state")]
    end

    CLI --> Agent
    API --> Agent
    TUI --> Agent
    MCP --> Agent

    Agent --> History
    Agent --> Memory
    Agent --> DocCache

    Agent --> DocTools
    Agent --> RecursiveTools
    Agent --> ExecTools

    DocTools --> Interpreter
    RecursiveTools --> Interpreter
    ExecTools --> Interpreter

    Interpreter --> Profiles
    Interpreter -->|"stdin/stdout<br/>JSON commands"| Sandbox
    Sandbox -->|"read/write"| Volume

    style entry fill:#e3f2fd,stroke:#1976d2,stroke-width:2px
    style orchestration fill:#f3e5f5,stroke:#7b1fa2,stroke-width:2px
    style tools fill:#fff3e0,stroke:#f57c00,stroke-width:2px
    style execution fill:#e8f5e9,stroke:#388e3c,stroke-width:2px
    style cloud fill:#fce4ec,stroke:#c2185b,stroke-width:2px

Layers:

🚪 Entry Points → 🧠 Orchestration → 🔧 Tools → ⚙️ Execution → ☁️ Modal Cloud

Features

Interactive Agent: RLMReActChatAgent (a dspy.Module) combines fast, interactive chat with deep, recursive task execution via rlm_query.
DSPy Aligned: Implements dspy.RLM, dspy.Module, and dspy.Tool interfaces — compatible with DSPy optimizers (BootstrapFewShot, MIPROv2).
Secure Sandbox: Code runs in isolated Modal containers with persistent storage volumes, execution profiles, and sensitive data redaction.
Recursive Delegation: All delegate tools (rlm_query, analyze_long_document, grounded_answer, etc.) spawn true recursive sub-agents via spawn_delegate_sub_agent() with unified depth enforcement.
PDF Ingestion: Native document loading via MarkItDown with pypdf fallback; OCR guidance for scanned PDFs.
Session State: Per-workspace, per-user session persistence with manifests stored on Modal volumes.
MCP Server: Expose fleet-rlm capabilities as an MCP tool server via serve-mcp.
Observability: Real-time streaming of thoughts, tool execution, trajectory normalization, and structured logging.

Quick Start

1. Install

uv pip install fleet-rlm

Optional extras for server and MCP support:

uv pip install fleet-rlm[server]   # FastAPI server + WebSocket
uv pip install fleet-rlm[mcp]      # MCP server
uv pip install fleet-rlm[full]     # All extras

2. Configure

Set up your Modal and LLM credentials:

modal setup
modal volume create rlm-volume-dspy
modal secret create LITELLM DSPY_LM_MODEL=openai/gemini-3-pro-preview DSPY_LLM_API_KEY=sk-...

3. Run

Interactive Chat (OpenTUI):

# Requires OpenTUI / Bun
fleet-rlm code-chat --opentui

Standalone Interactive Chat (Ink):

# Prefers Ink UI; falls back to Python UI
fleet

# Force a specific runtime
fleet --ui ink
fleet --ui python

One-shot Tasks:

# Basic question
fleet-rlm run-basic --question "What are the first 12 Fibonacci numbers?"

# Document analysis
fleet-rlm run-architecture --docs-path docs/architecture.md --query "Extract all components"

Servers:

# API server (FastAPI + WebSocket)
uv run fleet-rlm serve-api --port 8000

# MCP server
fleet-rlm serve-mcp --transport stdio

fleet and fleet-rlm code-chat serve different interactive paths:

fleet = standalone bridge chat launcher (Ink preferred, Python fallback)
fleet-rlm code-chat = OpenTUI runtime (OpenTUI/Bun required)

Development Setup

# Clone and install
git clone https://github.com/qredence/fleet-rlm.git
cd fleet-rlm
uv sync --extra dev

# With server/MCP support
uv sync --extra dev --extra server --extra mcp

# Build Ink frontend bundle for `fleet --ui ink`
cd tui-ink
npm install
npm run build
npm run test
cd ..

# Copy environment template
cp .env.example .env

# Quality gate
uv run ruff check src tests
uv run ruff format --check src tests
uv run ty check src
uv run pytest -q

# Auto-fix formatting when needed
uv run ruff format src tests

Documentation

Concepts — Core architecture (Agent, RLM, Sandbox)
User Flows — Interaction diagrams (Chat, Tools, Delegation)
Architecture — System components and hierarchy
Tutorials — Step-by-step lessons
How-To Guides — Installation, deployment, troubleshooting
CLI Reference — Full CLI command reference
HTTP API Reference — Server endpoints and WebSocket protocol
Source Layout — Package structure guide

Contributing

We welcome contributions! Please see our Contribution Guide and run the quality gate before submitting:

uv run ruff check src tests
uv run ruff format --check src tests
uv run ty check src
uv run pytest -q

License

MIT License — see LICENSE.

Based on Recursive Language Modeling research by Alex L. Zhang (MIT CSAIL), Omar Khattab (Stanford), and Tim Kraska (MIT).

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

agenticfleet

These details have not been verified by PyPI

Project links

Documentation

Release history Release notifications | RSS feed

0.5.40

May 23, 2026

0.5.31

May 17, 2026

0.5.3

May 17, 2026

0.5.2

Apr 29, 2026

0.5.1

Apr 27, 2026

0.5.0

Apr 25, 2026

0.4.99

Mar 26, 2026

0.4.98

Mar 17, 2026

0.4.97

Mar 15, 2026

0.4.95

Mar 5, 2026

0.4.94

Mar 3, 2026

0.4.93

Mar 3, 2026

0.4.92

Mar 1, 2026

0.4.9

Feb 27, 2026

0.4.8

Feb 25, 2026

0.4.7

Feb 22, 2026

0.4.6

Feb 20, 2026

This version

0.4.5

Feb 18, 2026

0.4.4

Feb 15, 2026

0.4.3

Feb 15, 2026

0.4.2

Feb 15, 2026

0.4.1

Feb 12, 2026

0.4.0

Feb 12, 2026

0.3.2

Feb 10, 2026

0.3.1

Feb 9, 2026

0.1.0

Feb 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fleet_rlm-0.4.5.tar.gz (216.0 kB view details)

Uploaded Feb 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fleet_rlm-0.4.5-py3-none-any.whl (274.9 kB view details)

Uploaded Feb 18, 2026 Python 3

File details

Details for the file fleet_rlm-0.4.5.tar.gz.

File metadata

Download URL: fleet_rlm-0.4.5.tar.gz
Upload date: Feb 18, 2026
Size: 216.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for fleet_rlm-0.4.5.tar.gz
Algorithm	Hash digest
SHA256	`43995c318305562f7f33dce7a5a225d4ba70ca1a8bf0eb2fca2ef8cd4307e9a2`
MD5	`ec4c3d1d5ea3118d1f9380a43c919d87`
BLAKE2b-256	`24436b801ea2584738ba1a9c2586ba353cc6d2948ea6a65affd7a2e2fb181687`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fleet_rlm-0.4.5.tar.gz:

Publisher: release.yml on Qredence/fleet-rlm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fleet_rlm-0.4.5.tar.gz
- Subject digest: 43995c318305562f7f33dce7a5a225d4ba70ca1a8bf0eb2fca2ef8cd4307e9a2
- Sigstore transparency entry: 962548530
- Sigstore integration time: Feb 18, 2026
Source repository:
- Permalink: Qredence/fleet-rlm@6dcc2cc3c54f7563b15d1c22eda00b6704355718
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Qredence
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6dcc2cc3c54f7563b15d1c22eda00b6704355718
- Trigger Event: workflow_dispatch

File details

Details for the file fleet_rlm-0.4.5-py3-none-any.whl.

File metadata

Download URL: fleet_rlm-0.4.5-py3-none-any.whl
Upload date: Feb 18, 2026
Size: 274.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for fleet_rlm-0.4.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9d84fc16951485c6258bc1e571269dbb1a83b439e8e3a15448a2f8327c27f1e4`
MD5	`eb184b90eaa2417fb02c8dffe1071380`
BLAKE2b-256	`43630ea389be664467546bc509feb5ad5124e84af41d7a2a224c1518e53a60c5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fleet_rlm-0.4.5-py3-none-any.whl:

Publisher: release.yml on Qredence/fleet-rlm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fleet_rlm-0.4.5-py3-none-any.whl
- Subject digest: 9d84fc16951485c6258bc1e571269dbb1a83b439e8e3a15448a2f8327c27f1e4
- Sigstore transparency entry: 962548533
- Sigstore integration time: Feb 18, 2026
Source repository:
- Permalink: Qredence/fleet-rlm@6dcc2cc3c54f7563b15d1c22eda00b6704355718
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Qredence
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6dcc2cc3c54f7563b15d1c22eda00b6704355718
- Trigger Event: workflow_dispatch

fleet-rlm 0.4.5

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

fleet-rlm

Architecture

Features

Quick Start

1. Install

2. Configure

3. Run

Development Setup

Documentation

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance