Skip to main content

Unified platform for self-hosted LLM inference + enterprise safety governance

Project description

TurboPrivate AI

PyPI version Python versions CI status Downloads License Stars

Unified platform for self-hosted LLM inference + enterprise safety governance.

Features

  • Inference Engine — vLLM/llama.cpp backends with auto hardware detection, TurboQuant-v3 INT4 quantization, QoS routing, dynamic batching, KV/prompt caching
  • Safety Gate — Pre-flight and post-flight verification with 7 verifiers: anti-hacking, PII detection, prompt injection, overengineering, patch analysis, vulnerability scanning, hallucination scoring
  • Memory & RAG — Vector store with cosine similarity search, multi-format document parser (PDF, DOCX, HTML, CSV, JSON, XML, YAML, Markdown)
  • Infrastructure — K3s provisioning via SSH/Terraform, age-encrypted backups, Helm chart deployment
  • Frontend — React SPA dashboard with model management, safety governance, memory search, and settings
  • Auth — JWT-based authentication with RBAC
  • Observability — Prometheus metrics, OpenTelemetry tracing, structured logging
  • CLI — 20+ commands for model management, safety policies, infrastructure, and backups

Quick Start

pip install turboprivate-ai
turbo serve --reload

Changelog

0.1.1 (2026-05-11)

  • Migrated to hatchling build system
  • Fixed missing InferenceEngine import in turbo.inference
  • Fixed TracerProvider bug in OpenTelemetry instrumentation
  • Added structured logging to all exception handlers
  • Consolidated Celery workers into shared worker.celery_app
  • Added CI workflow with ruff linting + pytest
  • Improved graceful shutdown (audit trail flush)
  • Updated dependencies (replaced unstructured with actual used libs)

Documentation

  • turbo/cli.py — CLI entry point and command reference
  • turbo/api/main.py — FastAPI application
  • turbo/safety/gate.py — Safety gate configuration
  • frontend/ — React SPA
  • helm/turboprivate/ — Kubernetes deployment

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

turboprivate_ai-0.1.1.tar.gz (47.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

turboprivate_ai-0.1.1-py3-none-any.whl (44.0 kB view details)

Uploaded Python 3

File details

Details for the file turboprivate_ai-0.1.1.tar.gz.

File metadata

  • Download URL: turboprivate_ai-0.1.1.tar.gz
  • Upload date:
  • Size: 47.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for turboprivate_ai-0.1.1.tar.gz
Algorithm Hash digest
SHA256 faa5457043f2742dfd95494c2dc04d14c46801dc98562cb28685380449bb19ce
MD5 033c6f11ee4d95ee661768b9dfc4f09c
BLAKE2b-256 47ea10b4ccd13a26033f12ef894a64b2a444e01a23a9716c8d250bb02e7584a5

See more details on using hashes here.

File details

Details for the file turboprivate_ai-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for turboprivate_ai-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 1f89ca1b5a3e5f3ff4d2bfaaa2647d7db14baff4fbfd56326b5bdebef8c74a22
MD5 5e3360a00177ee7364070bcc89cc689a
BLAKE2b-256 98d2ac0a1d54056bef8c949c9ff27f11d7cb666758b77390001892746d3584a2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page