Skip to main content

Asynchronous Self-Healing KV Cache for Silicon-Native LLMs by GDI Nexus

Project description


title: MLX-ASH-KV emoji: ⚡ colorFrom: green colorTo: gray sdk: gradio sdk_version: 5.16.0 app_file: app.py pinned: false license: apache-2.0

ASH-KV: The Self-Healing Middleware for LLMs

Hardware License Version Company

ASH-KV is a high-performance, hardware-aware middleware layer designed for High-Assurance Inference. Developed by GDI Nexus, it surgically intercepts and corrects the KV cache at the silicon level, preventing logical drift and clinical hallucinations with zero detectable latency.


🏛️ Core Value Pillars

⚡ Zero-Latency Integrity

Surgical KV cache mutation at Metal (Apple Silicon) and CUDA (NVIDIA) speeds. Our Fused Kernels ensure that the "Immune System" adds virtually 0% overhead to inference throughput.

🔌 Hardware Agnostic (Universal HAL)

The Hardware Abstraction Layer (HAL) automatically detects your silicon and hot-swaps between MLX and PyTorch backends. The same code runs on an M4 MacBook or an NVIDIA H100 server.

🛡️ Zero-Shot Healing (Universal Tensor Math)

No hardcoded rules. ASH-KV monitors Attention Manifold Entropy (Varentropy) in real-time. By detecting mathematical uncertainty at the tensor level, it prunes logical drift across any domain—coding, medicine, or creative writing.

♾️ Infinite Horizon (NVMe Paging)

Break the VRAM ceiling. ASH-KV dynamically offloads "Cold" context chunks to NVMe storage, allowing for 100k+ token windows on consumer-grade hardware without OOM crashes.


🚀 Quick Start

1. Installation

pip install mlx-ash-kv

2. Corporate Integration (3 Lines of Code)

Integrate ASH-KV into any production pipeline to add an immediate safety layer.

from mlx_ash_kv.api import protect

# Wrap your existing model with the ASH-KV shield
protected_model, cache, shield, proxies = protect(model, sensitivity=0.85)

# Inference continues normally, but with real-time surgical healing

🛠️ Command Center (CLI)

ASH-KV comes with a professional CLI for systems verification and benchmarking.

  • ash-kv install: Verify hardware drivers, silicon backend, and NVMe Paging Stress Test.
  • ash-kv benchmark: Run the 100-case "Hard Truth" evaluation suite.
  • ash-kv monitor: Launch the Live Diagnostic TUI to see layer-wise health and [HOT/WARM] memory distribution.
  • ash-kv demo: Launch the Gradio B2B Reliability Playground.

🔬 About GDI Nexus

GDI Nexus is a premier AI infrastructure firm. We are the architects of the AI-first era, blending deep data science with elite cloud orchestration. Our mission is to empower global enterprises with autonomous, reliable, and structurally resilient AI ecosystems.

Locations

  • USA (HQ): Woodbridge, VA 22191
  • India: Fingerpost Kandal, Udagamandalam, Tamil Nadu 643001

Contact: contactus@gdinexus.com | www.gdinexus.com


⚠️ DISCLAIMER

ASH-KV is a hardware-level reliability layer designed to assist professionals. It is NOT a substitute for professional medical or legal judgment. All AI-generated outputs, even those "healed" by ASH-KV, must be verified by qualified human professionals before making clinical or legal decisions.


© 2026 GDI Nexus Software Solutions LLP. All rights reserved.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlx_ash_kv-8.2.2.tar.gz (15.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mlx_ash_kv-8.2.2-py3-none-any.whl (16.9 kB view details)

Uploaded Python 3

File details

Details for the file mlx_ash_kv-8.2.2.tar.gz.

File metadata

  • Download URL: mlx_ash_kv-8.2.2.tar.gz
  • Upload date:
  • Size: 15.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for mlx_ash_kv-8.2.2.tar.gz
Algorithm Hash digest
SHA256 be039af4c45659ba30fc32e5788f7fed23194ebc4581eccbb56a6156cc0a9bca
MD5 a5f71315e095435b12f334f33791185d
BLAKE2b-256 eb1a2531d6d9a879836abecb4e3d06fa1ee00681c181c05666559997cfa5c024

See more details on using hashes here.

File details

Details for the file mlx_ash_kv-8.2.2-py3-none-any.whl.

File metadata

  • Download URL: mlx_ash_kv-8.2.2-py3-none-any.whl
  • Upload date:
  • Size: 16.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for mlx_ash_kv-8.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 6ccd94ea8d5d72b246be7b3428b541bce50084ad1bdc122b6d9cda767f5e2fa1
MD5 8051ff67ee4180c7482b0ba3d20a6db2
BLAKE2b-256 f056d4dce01c36035fe51b1cc750e45408b756fda416126383ff62f58c0b4423

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page