Skip to main content

InferiaLLM: distributed inference, filtration, and compute orchestration platform

Project description

InferiaLLM

The Operating System for LLMs in Production

LicensePythonStatusPRs Welcome

InferiaLLM acts as the authoritative execution layer between your applications and your AI infrastructure. It governs how LLMs are accessed, secured, routed, and run on compute.


Installation

pip install inferiallm

Quick Start

InferiaLLM requires a .env file in your current working directory to configure connections to PostgreSQL and Redis.

# 1. Initialize your environment
# Create a .env file with your DATABASE_URL and Redis settings
cp .env.sample .env

# 2. Bootstrap the platform
# This creates the database, roles, and applying schemas
inferiallm init

# 3. Launch all services
# Starts API, Orchestration, Inference, and the Dashboard in one go
inferiallm start all

Configuration

The CLI manages configuration through environment variables. The most critical settings are:

1. Database & Security

Variable Description Default
DATABASE_URL Primary database connection string postgresql://inferia:inferia@localhost:5432/inferia
PG_ADMIN_USER Postgres admin user (required for init) postgres
PG_ADMIN_PASSWORD Postgres admin password (required for init) -
JWT_SECRET_KEY Secret for signing access tokens -
INTERNAL_API_KEY Secret for service-to-service auth -
SECRET_ENCRYPTION_KEY 32-byte base64 key for encrypting credentials -

CLI Reference

inferiallm init

Bootstraps the unified database environment. Output:

[inferia:init] Connecting as admin to bootstrap inferia
[inferia:init] Creating role: inferia
[inferia:init] Creating database: inferia
[inferia:init] Applying schema: global_schema
[inferia:init] Bootstrap complete

For existing databases, apply incremental schema updates manually:

psql "$DATABASE_URL" -f db/migrations/20260212_add_inference_logs_ip.sql

inferiallm start

Starts all InferiaLLM gateways (API, Orchestration, Inference) and the Dashboard.

You can also start specific services:

  • inferiallm start all: Starts all core services (default).
  • inferiallm start api-gateway: Starts only the API Gateway.
  • inferiallm start orchestration: Starts the Orchestration stack (API, Sidecar, Worker).
  • inferiallm start inference: Starts the Inference engine.

Package Structure

The inferia package is a monorepo-style library that contains all backend services:

package/src/inferia/
├── cli.py                  # Entry point for the CLI
├── services/
│   ├── api_gateway/        # Security & Policy Service (Port 8000)
│   ├── inference/          # Inference Proxy Service (Port 8001)
│   ├── orchestration/      # Compute Control Plane (Port 8080)
│   ├── data/               # Data Engine (Port 8003)
│   └── guardrail/          # Guardrail Engine (Port 8002)

Core Capabilities

  • Unified Control Plane: Orchestrate LLMs across heterogeneous compute (K8s, DePIN, VPS).
  • Policy Enforcement: Centralized RBAC, safety guardrails, and budget controls.
  • Execution Boundary: Authority-based routing between applications and infrastructure.

For full documentation, architecture diagrams, and deployment guides, visit the main repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inferiallm-0.1.0b1.tar.gz (830.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

inferiallm-0.1.0b1-py3-none-any.whl (1.0 MB view details)

Uploaded Python 3

File details

Details for the file inferiallm-0.1.0b1.tar.gz.

File metadata

  • Download URL: inferiallm-0.1.0b1.tar.gz
  • Upload date:
  • Size: 830.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for inferiallm-0.1.0b1.tar.gz
Algorithm Hash digest
SHA256 4fb6ab7c7701f278a362c2b4109557e82a362118c9ab970a8fdd96767c0af4af
MD5 359ad68f51fed31da0584b90be1f74cb
BLAKE2b-256 f82e095d33e1e06d0bb8679a7098b3eea04bf2eb7f94306ab427401e5efcf6f0

See more details on using hashes here.

File details

Details for the file inferiallm-0.1.0b1-py3-none-any.whl.

File metadata

  • Download URL: inferiallm-0.1.0b1-py3-none-any.whl
  • Upload date:
  • Size: 1.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for inferiallm-0.1.0b1-py3-none-any.whl
Algorithm Hash digest
SHA256 9d6726a61f3485432305d121019212cc0ead346c700f97eda104a04503e44d45
MD5 5e3d849d61ae4f649f195926f58284e3
BLAKE2b-256 a4fb23d1524e4424eb53a8ae8126ccb75c533f49470afcf4a00a4b78085e2769

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page