InferiaLLM: distributed inference, filtration, and compute orchestration platform
Project description
InferiaLLM acts as the authoritative execution layer between your applications and your AI infrastructure. It governs how LLMs are accessed, secured, routed, and run on compute.
Installation
pip install inferiallm
Quick Start
InferiaLLM requires a .env file in your current working directory to configure connections to PostgreSQL and Redis.
# 1. Initialize your environment
# Create a .env file with your DATABASE_URL and Redis settings
cp .env.sample .env
# 2. Bootstrap the platform
# This creates the database, roles, and applying schemas
inferiallm init
# 3. Launch all services
# Starts Orchestration, Inference, and Filtration gateways in a single process
inferiallm start
Configuration
The CLI manages configuration through environment variables. The most critical settings are:
1. Database & Security
| Variable | Description | Default |
|---|---|---|
DATABASE_URL |
Primary database connection string | postgresql://inferia:inferia@localhost:5432/inferia |
PG_ADMIN_USER |
Postgres admin user (required for init) |
postgres |
PG_ADMIN_PASSWORD |
Postgres admin password (required for init) |
- |
JWT_SECRET_KEY |
Secret for signing access tokens | - |
INTERNAL_API_KEY |
Secret for service-to-service auth | - |
SECRET_ENCRYPTION_KEY |
32-byte base64 key for encrypting credentials | - |
CLI Reference
inferiallm init
Bootstraps the unified database environment. Output:
[inferia:init] Connecting as admin to bootstrap inferia
[inferia:init] Creating role: inferia
[inferia:init] Creating database: inferia
[inferia:init] Applying schema: global_schema
[inferia:init] Bootstrap complete
For existing databases, apply incremental schema updates manually:
psql "$DATABASE_URL" -f db/migrations/20260212_add_inference_logs_ip.sql
inferiallm start
Starts all InferiaLLM gateways (Orchestration, Inference, Filtration) and the Dashboard in one command.
You can also start specific services:
inferiallm start orchestration: Starts the Orchestration Gateway stack.inferiallm start inference: Starts the Inference Gateway standalone.inferiallm start filtration: Starts the Filtration Gateway standalone.
Package Structure
The inferia package is a monorepo-style library that contains all backend services:
package/src/inferia/
├── cli.py # Entry point for the CLI
├── gateways/
│ ├── filtration_gateway # Security & Policy Service (Port 8000)
│ ├── inference_gateway # Inference Proxy Service (Port 8001)
│ └── orchestration_gateway # Compute Control Plane (Port 8080)
└── services/ # Shared business logic
├── filtration/ # Guardrails, RBAC, Audit logic
└── orchestration/ # Compute Adapters, Scheduling logic
Core Capabilities
- Unified Control Plane: Orchestrate LLMs across heterogeneous compute (K8s, DePIN, VPS).
- Policy Enforcement: Centralized RBAC, safety guardrails, and budget controls.
- Execution Boundary: Authority-based routing between applications and infrastructure.
For full documentation, architecture diagrams, and deployment guides, visit the main repository.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file inferiallm-1.1.0.tar.gz.
File metadata
- Download URL: inferiallm-1.1.0.tar.gz
- Upload date:
- Size: 582.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3b56cd31aad48cc31ed905688d37d85b3f939375d7f7e91feb308dd38dccfe23
|
|
| MD5 |
04b54d4e44eee1231b34c5e101ca99d4
|
|
| BLAKE2b-256 |
ec8fc29a337e5797b8ea70680657e4f6af0b1db82a9c25083635e20f2009b8e0
|
File details
Details for the file inferiallm-1.1.0-py3-none-any.whl.
File metadata
- Download URL: inferiallm-1.1.0-py3-none-any.whl
- Upload date:
- Size: 705.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5f484b409ab11b09ca7a3422186d4b3f18c83407fc842b9dbc6573e100d6ef8c
|
|
| MD5 |
7cfee09b4daa050e53da521575fdac44
|
|
| BLAKE2b-256 |
7c34137d0e5983a2f1fdce880b19f22b1918e7fcc89cf6e6075bdf9bfe0560f2
|