Information-theoretic context optimization for AI coding agents. Knapsack-optimal token budgeting, Shannon entropy scoring, SimHash dedup, predictive pre-fetch. MCP server.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ashuabhi

These details have not been verified by PyPI

Project links

Full Framework

Project description

Entroly

Your AI coding tool wastes 40–60% of its context window on irrelevant files. Entroly fixes that.

When you ask Cursor to "fix the SQL injection bug," it stuffs your context window with README.md, CSS, changelogs, and duplicate boilerplate — then drops the actual database code because it ran out of room.

Entroly is an MCP server that selects the mathematically optimal subset of context for every query. Your AI sees the right code, not all the code.

https://github.com/juyterman1000/entroly/raw/main/docs/assets/entroly_demo.mp4

Live engine metrics from entroly demo — real Rust engine, zero mocks.

What You Get

Metric	Without Entroly	With Entroly
Context relevance	~50% (FIFO truncation)	91% (knapsack-optimized)
Cost per API call	$0.0115	$0.0044 (62% savings)
Duplicate detection	None	Automatic (SimHash)
Selection speed	N/A	320µs (sub-millisecond)
Crash recovery	Lost	Checkpoint & resume

At scale (100K+ files, 128K token budget), these savings compound: more noise to filter = bigger improvement.

Quick Start (30 seconds)

pip install entroly
cd your-project
entroly init        # auto-detects your AI tool, writes mcp.json
# Restart your AI tool — done.

🐳 Mac / Windows users: Entroly auto-detects Docker Desktop and runs the Rust engine inside a container — zero compilation, zero Rust toolchain, just works. Make sure Docker Desktop is running when you start your AI tool.

entroly init auto-detects Cursor, VS Code, Windsurf, and Claude Code and writes the correct MCP config for you. For other MCP clients:

Manual MCP configuration

Claude Code:

claude mcp add entroly -- entroly serve

Any MCP client (mcp.json):

{
  "mcpServers": {
    "entroly": {
      "command": "entroly",
      "args": ["serve"]
    }
  }
}

npm (for tools that prefer npx):

npx -y entroly-mcp

Tip: Run entroly demo to see a side-by-side before/after comparison using the real Rust engine.

How It Works

Entroly sits between your AI tool and the LLM as an MCP server. When your agent asks for context, Entroly:

Scores every code fragment on 4 dimensions (recency, frequency, semantic similarity, information density)
Deduplicates via 64-bit SimHash fingerprints — catches near-identical code in O(1)
Solves the 0/1 Knapsack Problem to select the optimal subset within your token budget
Learns from feedback — fragments that lead to good outputs get boosted next time

All computation runs in 100% Rust via PyO3. The Python layer only handles MCP protocol and I/O.

Your AI Tool → MCP (JSON-RPC) → Python (FastMCP) → Rust Engine → Optimal Context
                                                      ↑
                                              Selection in 320µs

MCP Tools

Once entroly serve is running, your AI agent has access to these tools:

Tool	What it does
`optimize_context`	Select the optimal context subset for a token budget. The core tool.
`remember_fragment`	Store context with auto-dedup, entropy scoring, and security scanning
`recall_relevant`	Semantic search over stored fragments via multi-probe LSH
`record_outcome`	Feed the RL loop: mark fragments as helpful or unhelpful
`explain_context`	See exactly why each fragment was included or excluded
`prefetch_related`	Predict what files will be needed next (import analysis + co-access)
`checkpoint_state`	Save full session state for crash recovery
`resume_state`	Restore from the latest checkpoint
`entroly_dashboard`	Live ROI metrics — cost saved, latency, compression ratio
`get_stats`	Comprehensive session statistics
`scan_fragment`	Security scan (SQL injection, hardcoded secrets, unsafe patterns)
`analyze_health`	Codebase health report (clone detection, dead code, god files)

Example: optimize_context

optimize_context(token_budget=128000, query="fix payment bug")
→ {
    "selected_fragments": [...],  // The good stuff
    "tokens_saved_this_call": 42000,
    "sufficiency": 0.91,         // 91% of referenced symbols included
    "hallucination_risk": "low",
    "optimization_stats": {"method": "exact_dp", "budget_utilization": 0.73}
  }

Example: entroly_dashboard

entroly_dashboard()
→ {
    "money": {
      "cost_per_call_without_entroly": "$0.0115",
      "cost_per_call_with_entroly": "$0.0044",
      "savings_pct": "62%"
    },
    "performance": {"avg_optimize_latency": "320µs"},
    "bloat_prevention": {"context_compression": "39%", "duplicates_caught": 12}
  }

Try It (See the Value in 5 Seconds)

pip install entroly
entroly demo

This runs a simulated coding session (fixing a SQL injection bug) and shows you side-by-side what happens with and without Entroly. Uses the real Rust engine — zero mocks.

Architecture

Hybrid Rust + Python. CPU-intensive math runs in Rust via PyO3 for 50-100x speedup. MCP protocol and orchestration run in Python via FastMCP.

Component	What it does	How
Knapsack Optimizer	Selects optimal context subset	Exact DP (N ≤ 2000) or greedy (N > 2000)
Entropy Scorer	Measures information density	Shannon entropy + boilerplate + cross-fragment redundancy
SimHash Dedup	Catches near-duplicates in O(1)	64-bit fingerprints, Hamming threshold ≤ 3
Multi-Probe LSH	Sub-linear semantic recall	12-table LSH with multi-probe queries
Dependency Graph	Pulls related code together	Symbol table + import/type/call linking
PRISM Optimizer	Adapts weights to your codebase	Anisotropic spectral optimization (4×4 covariance)
Feedback Loop	Learns from outcomes	Wilson score confidence intervals
Predictive Pre-fetch	Pre-loads likely context	Import analysis + co-access patterns
Long-Term Memory	Cross-session recall	Ebbinghaus decay + salience boosting
Security Scanner	Finds vulnerabilities	Pattern-based SAST (SQL injection, secrets, unsafe)
Health Analyzer	Codebase quality metrics	Clone detection, dead symbols, god files

The Math (For the Curious)

Click to expand the mathematical foundations

Multi-Dimensional Relevance Scoring

Each fragment is scored across four dimensions:

r(f) = (w_rec · recency + w_freq · frequency + w_sem · semantic + w_ent · entropy)
       / (w_rec + w_freq + w_sem + w_ent)
       × feedback_multiplier

Default weights: recency 0.30, frequency 0.25, semantic 0.25, entropy 0.20.

Recency: Ebbinghaus forgetting curve — exp(-ln(2) × Δt / half_life), half_life = 15 turns
Frequency: Normalized access count (spaced repetition boost)
Semantic similarity: SimHash Hamming distance to query, normalized to [0, 1]
Information density: Shannon entropy + boilerplate + redundancy

Knapsack Context Selection

Maximize:   Σ r(fᵢ) · x(fᵢ)     for selected fragments
Subject to: Σ c(fᵢ) · x(fᵢ) ≤ B  (token budget)

N ≤ 2000: Exact DP with budget quantization into 1000 bins — O(N × 1000)
N > 2000: Greedy density sort — O(N log N), Dantzig 0.5-optimality guarantee

Task-Aware Budget Multipliers

Bug tracing / debugging     → 1.5× budget
Exploration / understanding → 1.3× budget
Refactoring / code review   → 1.0× budget
Code generation             → 0.7× budget

PRISM Spectral Optimizer

Tracks a 4×4 covariance matrix with EMA updates (β=0.95). Jacobi eigendecomposition finds principal axes. Anisotropic spectral gain dampens noisy dimensions — automatic learning rate adaptation without hyperparameter tuning.

Configuration

EntrolyConfig(
    default_token_budget=128_000,     # GPT-4 Turbo equivalent
    max_fragments=10_000,             # session fragment cap
    weight_recency=0.30,              # scoring weights (sum to 1.0)
    weight_frequency=0.25,
    weight_semantic_sim=0.25,
    weight_entropy=0.20,
    decay_half_life_turns=15,         # Ebbinghaus half-life
    min_relevance_threshold=0.05,     # auto-evict below this
    dedup_similarity_threshold=0.92,
    prefetch_depth=2,
    auto_checkpoint_interval=5,       # checkpoint every N tool calls
)

Build from Source

If you want the Rust engine locally (instead of Docker):

git clone https://github.com/juyterman1000/entroly
cd entroly
pip install maturin
cd entroly-core && maturin develop --release && cd ..
pip install -e ".[native]"
entroly init

References

Shannon (1948) • Charikar (2002) SimHash • Ebbinghaus (1885) Forgetting Curve • Dantzig (1957) Greedy Knapsack • Wilson (1927) Score Intervals • ICPC (2025) Prompt Compression • Proximity (2025) LSH Caching • RCC (ICLR 2025) Context Compression

Part of the Ebbiforge Ecosystem

Entroly integrates with hippocampus-sharp-memory for persistent memory and Ebbiforge for TF embeddings. Both are optional — Entroly works standalone.

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ashuabhi

These details have not been verified by PyPI

Project links

Full Framework

Release history Release notifications | RSS feed

0.8.5

Apr 23, 2026

0.8.4

Apr 23, 2026

0.8.1

Apr 14, 2026

0.7.0

Apr 12, 2026

0.6.5

Apr 11, 2026

0.6.4

Apr 5, 2026

0.6.3

Apr 3, 2026

0.6.2

Mar 29, 2026

0.6.1

Mar 29, 2026

0.6.0

Mar 29, 2026

0.5.4

Mar 15, 2026

0.5.3

Mar 15, 2026

0.5.2

Mar 15, 2026

0.5.1

Mar 15, 2026

0.5.0

Mar 15, 2026

0.4.9

Mar 15, 2026

0.4.8

Mar 15, 2026

0.4.7

Mar 15, 2026

0.4.6

Mar 15, 2026

0.4.5

Mar 15, 2026

0.4.4

Mar 15, 2026

0.4.3

Mar 13, 2026

0.4.2

Mar 13, 2026

This version

0.4.1

Mar 13, 2026

0.4.0

Mar 11, 2026

0.3.2

Mar 15, 2026

0.3.1

Mar 7, 2026

0.3.0

Mar 7, 2026

0.2.1

Mar 7, 2026

0.2.0

Mar 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

entroly-0.4.1.tar.gz (2.8 MB view details)

Uploaded Mar 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

entroly-0.4.1-py3-none-any.whl (117.9 kB view details)

Uploaded Mar 13, 2026 Python 3

File details

Details for the file entroly-0.4.1.tar.gz.

File metadata

Download URL: entroly-0.4.1.tar.gz
Upload date: Mar 13, 2026
Size: 2.8 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for entroly-0.4.1.tar.gz
Algorithm	Hash digest
SHA256	`ad30e032603d6f1d76654765f53257e3be1195fb60724d1895daaaeae9e5c814`
MD5	`cef459aa5b67888df1c94feb76ef3574`
BLAKE2b-256	`1a6391257f8b1e3cfbdc821d300971cae8182053a7f31f1ed18a34bc654d6ba2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for entroly-0.4.1.tar.gz:

Publisher: entroly-publish.yml on juyterman1000/entroly

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: entroly-0.4.1.tar.gz
- Subject digest: ad30e032603d6f1d76654765f53257e3be1195fb60724d1895daaaeae9e5c814
- Sigstore transparency entry: 1095466617
- Sigstore integration time: Mar 13, 2026
Source repository:
- Permalink: juyterman1000/entroly@26f60cb4ebd36b00a7722ab5477ca40e33f8de78
- Branch / Tag: refs/heads/main
- Owner: https://github.com/juyterman1000
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: entroly-publish.yml@26f60cb4ebd36b00a7722ab5477ca40e33f8de78
- Trigger Event: workflow_dispatch

File details

Details for the file entroly-0.4.1-py3-none-any.whl.

File metadata

Download URL: entroly-0.4.1-py3-none-any.whl
Upload date: Mar 13, 2026
Size: 117.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for entroly-0.4.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`20745a61fa65b6dd5bf87f6a01025f7ec457b715a873e9ffa52ae07240d85262`
MD5	`ea181e8ae4cf4c3be98be28c0353edc5`
BLAKE2b-256	`0b53d18da4eebec86788e00d283f05f59f68067ff3205fac32be0bd0a1561140`

See more details on using hashes here.

Provenance

The following attestation bundles were made for entroly-0.4.1-py3-none-any.whl:

Publisher: entroly-publish.yml on juyterman1000/entroly

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: entroly-0.4.1-py3-none-any.whl
- Subject digest: 20745a61fa65b6dd5bf87f6a01025f7ec457b715a873e9ffa52ae07240d85262
- Sigstore transparency entry: 1095466650
- Sigstore integration time: Mar 13, 2026
Source repository:
- Permalink: juyterman1000/entroly@26f60cb4ebd36b00a7722ab5477ca40e33f8de78
- Branch / Tag: refs/heads/main
- Owner: https://github.com/juyterman1000
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: entroly-publish.yml@26f60cb4ebd36b00a7722ab5477ca40e33f8de78
- Trigger Event: workflow_dispatch

entroly 0.4.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Entroly

Your AI coding tool wastes 40–60% of its context window on irrelevant files. Entroly fixes that.

What You Get

Quick Start (30 seconds)

How It Works

MCP Tools

Example: optimize_context

Example: entroly_dashboard

Try It (See the Value in 5 Seconds)

Architecture

The Math (For the Curious)

Multi-Dimensional Relevance Scoring

Knapsack Context Selection

Task-Aware Budget Multipliers

PRISM Spectral Optimizer

Configuration

Build from Source

References

Part of the Ebbiforge Ecosystem

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance