MCP server for recursive LLM reasoning—load context, iterate with search/code/think tools, converge on answers

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

hmbown

These details have not been verified by PyPI

Project description

Aleph

Aleph is an MCP server and skill for Recursive Language Models (RLMs). It keeps working state — search indexes, code execution, evidence, recursion — in a Python process outside the prompt window, so the LLM reasons iteratively over repos, logs, documents, and data without burning context on raw content.

+-----------------+    tool calls     +-----------------------------+
|   LLM client    | ---------------> |  Aleph (Python process)     |
| (context budget)| <--------------- |  search / peek / exec / sub |
+-----------------+   small results  +-----------------------------+

Why Aleph:

Load once, reason many times. Data lives in Aleph memory, not the prompt.
Compute server-side. exec_python runs code over the full context and returns only derived results.
Recurse. Sub-queries and recipes split complex work across multiple reasoning passes.
Persist. Save sessions and resume long investigations later.

Quick Start

pip install "aleph-rlm[mcp]"
aleph-rlm install --profile claude   # or: codex, portable, api
aleph-rlm doctor                     # verify everything is wired up

Then restart your MCP client and confirm Aleph is available:

get_status()
list_contexts()

The optional /aleph (Claude Code) or $aleph (Codex) skill shortcut starts a structured RLM workflow. Install docs/prompts/aleph.md into your client's command/skill folder — see MCP_SETUP.md for exact paths.

Entry Points

Command	Module	What it does
`aleph`	`aleph.mcp.local_server:main`	MCP server. This is what MCP clients launch. Exposes 30+ tools for context management, search, code execution, reasoning, recursion, and action tools.
`aleph-rlm`	`aleph.cli:main`	Installer and CLI. `install`, `configure`, `doctor`, `uninstall` for setting up MCP clients. Also: `run` (single query), `shell` (interactive REPL), `serve` (start MCP server manually).

Install Profiles

aleph-rlm install asks which sub-query profile to use. Profiles configure the nested backend that sub_query and sub_query_batch spawn for recursive reasoning.

Profile	What it pins
`portable`	No nested backend — you choose later or rely on auto-detection
`claude`	Claude CLI: `--model opus`, `--effort low`, shared session enabled
`codex`	Codex MCP: `gpt-5.4`, low reasoning effort, shared session enabled
`api`	OpenAI-compatible API — set `ALEPH_SUB_QUERY_API_KEY` and `ALEPH_SUB_QUERY_MODEL`

aleph-rlm install claude-code --profile claude
aleph-rlm configure --profile codex   # overwrite existing config

See docs/CONFIGURATION.md for all env vars, CLI flags, and runtime configure(...) options.

First Workflow

Aleph is best when you load data once, do the heavy work inside Aleph, and only pull back compact answers.

load_file(path="/absolute/path/to/large_file.log", context_id="doc")
search_context(pattern="ERROR|WARN", context_id="doc")
peek_context(start=1, end=60, unit="lines", context_id="doc")
exec_python(code="""
errors = [line for line in ctx.splitlines() if "error" in line.lower()]
result = {
    "error_count": len(errors),
    "first_error": errors[0] if errors else None,
}
""", context_id="doc")
get_variable(name="result", context_id="doc")
save_session(context_id="doc", path=".aleph/doc.json")

The important habit is to compute server-side. Do not treat get_variable("ctx") as the default path. Search, filter, chunk, or summarize first, then retrieve a small result.

If you want terminal-only mode instead of MCP, use:

aleph run "Summarize this log" --provider cli --model codex --context-file app.log

Local Models (llama.cpp)

Aleph can use a local model instead of a cloud API. This runs the full RLM loop — search, code execution, convergence — entirely on your machine with zero API cost.

Prerequisites: llama.cpp and a GGUF model file.

# Install llama.cpp
brew install llama.cpp          # Mac
winget install ggml.LlamaCpp    # Windows

# Start the server with your model
llama-server -m /path/to/model.gguf -c 16384 -ngl 99 --port 8080

Point Aleph at the running server:

export ALEPH_PROVIDER=llamacpp
export ALEPH_LLAMACPP_URL=http://127.0.0.1:8080
export ALEPH_MODEL=local
aleph

Or let Aleph start the server automatically:

export ALEPH_PROVIDER=llamacpp
export ALEPH_LLAMACPP_MODEL=/path/to/model.gguf
export ALEPH_LLAMACPP_CTX=16384
export ALEPH_MODEL=local
aleph

Tested with Qwen 3.5 9B (Q8_0, ~9 GB). Any GGUF model works — larger models give better results in the RLM loop. Models with reasoning/thinking support (Qwen 3.5, QwQ, etc.) are handled automatically. See CONFIGURATION.md for all ALEPH_LLAMACPP_* variables.

Common Workloads

Scenario	What Aleph Is Good At
Large log analysis	Load big files, trace patterns, correlate events
Codebase navigation	Search symbols, inspect routes, trace behavior
Data exploration	Analyze JSON, CSV, and mixed text with Python helpers
Long document review	Load PDFs, Word docs, HTML, and compressed logs
Recursive investigations	Split work into sub-queries instead of one giant prompt
Long-running sessions	Save and resume memory packs across sessions

Core Tools

Category	Primary tools	What they do
Load context	`load_context`, `load_file`, `list_contexts`, `diff_contexts`	Put data into Aleph memory and inspect what is loaded
Navigate	`search_context`, `semantic_search`, `peek_context`, `chunk_context`, `rg_search`	Find the relevant slice before asking for an answer
Compute	`exec_python`, `get_variable`	Run code over the full context and retrieve only the derived result
Reason	`think`, `evaluate_progress`, `get_evidence`, `finalize`	Structure progress and close out with evidence
Orchestrate	`configure`, `validate_recipe`, `estimate_recipe`, `run_recipe`, `run_recipe_code`	Switch backends and automate repeated reasoning patterns
Persist	`save_session`, `load_session`	Keep long investigations outside the prompt window

Inside exec_python, Aleph also exposes helpers such as search(...), chunk(...), lines(...), sub_query(...), sub_query_batch(...), and sub_aleph(...). Recursive helpers live inside the REPL, not as top-level MCP tools.

Safety Model

Aleph is built to keep raw context out of the model window unless you explicitly pull it back:

Tool responses are capped and truncated.
get_variable("ctx") is policy-aware and should not be your default path.
exec_python stdout, stderr, and return values are bounded independently.
ALEPH_CONTEXT_POLICY=isolated adds stricter session export/import rules and more defensive defaults.

The safest pattern is always:

Load the large context into Aleph memory.
Search or compute inside Aleph.
Retrieve only the small result you need.

Docs Map

MCP_SETUP.md: client-by-client MCP and skill installation.
docs/prompts/aleph.md: the /aleph and $aleph workflow plus tool patterns.
docs/CONFIGURATION.md: flags, env vars, limits, and safety settings.
docs/langgraph-rlm-default.md: LangGraph integration with Aleph-style tool usage.
examples/langgraph_rlm_repo_improver.py: repo improvement example with optional LangSmith tracing.
CHANGELOG.md: release history.
DEVELOPMENT.md: contributor guide.

Development

git clone https://github.com/Hmbown/aleph.git
cd aleph
pip install -e ".[dev,mcp]"
pytest tests/ -v
ruff check aleph/ tests/

References

Zhang, A. L., Kraska, T., Khattab, O. (2025) Recursive Language Models (arXiv:2512.24601)

License

MIT

Project details

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

hmbown

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.9.4

Apr 11, 2026

0.9.3

Apr 5, 2026

0.9.2

Apr 5, 2026

This version

0.9.1

Apr 4, 2026

0.9.0

Mar 15, 2026

0.8.9

Mar 13, 2026

0.8.8

Mar 11, 2026

0.8.7

Mar 8, 2026

0.8.6

Mar 7, 2026

0.8.5

Feb 15, 2026

0.8.3

Feb 10, 2026

0.8.2

Feb 8, 2026

0.8.1

Feb 7, 2026

0.8.0

Feb 6, 2026

0.7.11

Feb 4, 2026

0.7.10

Jan 30, 2026

0.7.9

Jan 29, 2026

0.7.6

Jan 25, 2026

0.7.5

Jan 25, 2026

0.7.4

Jan 25, 2026

0.7.3

Jan 25, 2026

0.7.1

Jan 23, 2026

0.7.0

Jan 23, 2026

0.6.2

Jan 21, 2026

0.6.0

Jan 20, 2026

0.5.9

Jan 20, 2026

0.5.8

Jan 19, 2026

0.5.7

Jan 19, 2026

0.5.6

Jan 19, 2026

0.5.5

Jan 17, 2026

0.5.2

Jan 7, 2026

0.5.1

Jan 5, 2026

0.2.0

Dec 15, 2025

0.1.3

Dec 15, 2025

0.1.1

Dec 15, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aleph_rlm-0.9.1.tar.gz (331.2 kB view details)

Uploaded Apr 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aleph_rlm-0.9.1-py3-none-any.whl (157.3 kB view details)

Uploaded Apr 4, 2026 Python 3

File details

Details for the file aleph_rlm-0.9.1.tar.gz.

File metadata

Download URL: aleph_rlm-0.9.1.tar.gz
Upload date: Apr 4, 2026
Size: 331.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aleph_rlm-0.9.1.tar.gz
Algorithm	Hash digest
SHA256	`9c471812ef4c2d0e25f1208db6b2a2446913fc6057da3392e01d335595427232`
MD5	`f0a551ca0124f466a407a436e6afee28`
BLAKE2b-256	`81dcb4fbfe5394bfe6c574a307d81e69603514d15d8db3eb77ffad691e042f7f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for aleph_rlm-0.9.1.tar.gz:

Publisher: publish.yml on Hmbown/aleph

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: aleph_rlm-0.9.1.tar.gz
- Subject digest: 9c471812ef4c2d0e25f1208db6b2a2446913fc6057da3392e01d335595427232
- Sigstore transparency entry: 1230907025
- Sigstore integration time: Apr 4, 2026
Source repository:
- Permalink: Hmbown/aleph@7884c61e2130cc32e8369c9ba8a86330a0b75a65
- Branch / Tag: refs/tags/v0.9.1
- Owner: https://github.com/Hmbown
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@7884c61e2130cc32e8369c9ba8a86330a0b75a65
- Trigger Event: release

File details

Details for the file aleph_rlm-0.9.1-py3-none-any.whl.

File metadata

Download URL: aleph_rlm-0.9.1-py3-none-any.whl
Upload date: Apr 4, 2026
Size: 157.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aleph_rlm-0.9.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dd3ac9dae98f07dddff4579dd07cb7980a9ba17c61b3ff66aa128a4e0c67613d`
MD5	`8af1805b05473562878a1aeb5db4e5c5`
BLAKE2b-256	`aaa57be9839595aec72dd0aee68f7df66aed2649738090f1e8e0c4ee1d2c26ac`

See more details on using hashes here.

Provenance

The following attestation bundles were made for aleph_rlm-0.9.1-py3-none-any.whl:

Publisher: publish.yml on Hmbown/aleph

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: aleph_rlm-0.9.1-py3-none-any.whl
- Subject digest: dd3ac9dae98f07dddff4579dd07cb7980a9ba17c61b3ff66aa128a4e0c67613d
- Sigstore transparency entry: 1230907055
- Sigstore integration time: Apr 4, 2026
Source repository:
- Permalink: Hmbown/aleph@7884c61e2130cc32e8369c9ba8a86330a0b75a65
- Branch / Tag: refs/tags/v0.9.1
- Owner: https://github.com/Hmbown
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@7884c61e2130cc32e8369c9ba8a86330a0b75a65
- Trigger Event: release

aleph-rlm 0.9.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

Aleph

Quick Start

Entry Points

Install Profiles

First Workflow

Local Models (llama.cpp)

Common Workloads

Core Tools

Safety Model

Docs Map

Development

References

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance