Skip to main content

No project description provided

Project description

mypy and pytests black-lint Cumulative Clones

dgxarley

Tooling for the DGX Arley K3s inference cluster — integration tests, streaming utilities, and CLI entry points for SGLang, Ollama, and OpenWebUI services.

What's included

CLI tools

Command Description
sglang-raw Interactive SSE stream viewer with dual-panel Rich display (interpreted output + raw JSON chunks)
sglang-test Direct SGLang client with sequential and parallel load testing (live Rich TUI)
openwebui-test OpenWebUI / LLM client with preset management and streaming
ollama-test Ollama API health, model, embedding, and chat completions tests

Libraries

Module Description
dgxarley.integration.repetition_detector Offline n-gram, sentence, and loop repetition analysis for completed LLM outputs
dgxarley.integration.streaming_repetition_guard Real-time repetition detection for token streams with configurable thresholds

Installation

pip install dgxarley

Quick start

from dgxarley.integration.repetition_detector import detect_repetition

report = detect_repetition(llm_output)
print(report.summary())
# [LOW] score=0.12 — N-Gram 'this is a test' x2
from dgxarley.integration.streaming_repetition_guard import RepetitionGuard

guard = RepetitionGuard()
for chunk in llm_stream:
    token = chunk.choices[0].delta.content or ""
    result = guard.feed(token)
    if result.should_stop:
        print(f"STOP: {result.reason}")
        break

Requirements

  • Python >= 3.14

Source & documentation

Full documentation, network architecture, and Ansible playbooks: GitHub

License

This project is licensed under the LGPL where applicable/possible — see LICENSE.md. Some files/parts may use other licenses: MIT | GPL | LGPL. Always check per‑file headers/comments.

Authors

  • Repo owner (primary author)
  • Additional attributions are noted inline in code comments

Acknowledgments

  • Inspirations and snippets are referenced in code comments where appropriate.

⚠️ Note

This is a development/experimental project. For production use, review security settings, customize configurations, and test thoroughly in your environment. Provided "as is" without warranty of any kind, express or implied, including but not limited to the warranties of merchantability, fitness for a particular purpose and noninfringement. In no event shall the authors or copyright holders be liable for any claim, damages or other liability, whether in an action of contract, tort or otherwise, arising from, out of or in connection with the software or the use or other dealings in the software. Use at your own risk.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dgxarley-0.0.4.tar.gz (55.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dgxarley-0.0.4-py3-none-any.whl (60.0 kB view details)

Uploaded Python 3

File details

Details for the file dgxarley-0.0.4.tar.gz.

File metadata

  • Download URL: dgxarley-0.0.4.tar.gz
  • Upload date:
  • Size: 55.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Hatch/1.16.5 cpython/3.14.3 HTTPX/0.28.1

File hashes

Hashes for dgxarley-0.0.4.tar.gz
Algorithm Hash digest
SHA256 2e5b7324b5e25ac74a69cc5b08e910f2b9315c09484c3b04c59ef5ff2effc0d8
MD5 42bb033acac0d991fddcaa384ac2fd2c
BLAKE2b-256 4b534d2bc702bdcb010252b19842b303bb5f9118450810202a31f4734203fac9

See more details on using hashes here.

File details

Details for the file dgxarley-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: dgxarley-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 60.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Hatch/1.16.5 cpython/3.14.3 HTTPX/0.28.1

File hashes

Hashes for dgxarley-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 f77edfa2295b8ce8df9e3a4c6a9ea49657fb215eaec6b2dc9fe2bc81db898142
MD5 d3521cda516b19d24453d8b24a6c9fdd
BLAKE2b-256 0793ac4a0c1f62401aef6b4e118b4ab63d3fdae84e6d51205a5aa53e2ee54562

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page