Skip to main content

InferiaLLM: distributed inference, filtration, and compute orchestration platform

Project description

InferiaLLM CLI

The Operating System for LLMs in Production

LicensePythonStatusPRs Welcome

InferiaLLM provides a unified CLI to manage the platform's control plane, initialize infrastructure, and orchestrate gateways (Orchestration, Inference, and Filtration).


Installation

pip install inferiallm

Quick Start

InferiaLLM requires a .env file in your current working directory to configure connections to PostgreSQL and Redis.

# 1. Initialize your environment
# Create a .env file with your DATABASE_URL and Redis settings
cp .env.sample .env

# 2. Bootstrap the platform
# This creates the database, roles, and applying schemas
inferiallm init

# 3. Launch all services
# Starts Orchestration, Inference, and Filtration gateways in a single process
inferiallm api-start

Configuration

The CLI manages configuration through environment variables. The most critical settings are:

1. Database & Security

Variable Description Default
DATABASE_URL Primary database connection string postgresql://inferia:inferia@localhost:5432/inferia
PG_ADMIN_USER Postgres admin user (required for init) postgres
PG_ADMIN_PASSWORD Postgres admin password (required for init) -
JWT_SECRET_KEY Secret for signing access tokens -
INTERNAL_API_KEY Secret for service-to-service auth -
SECRET_ENCRYPTION_KEY 32-byte base64 key for encrypting credentials -

2. Provider Specifics (Optional)

Required for provisioning compute from external providers:

  • AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, AWS_REGION
  • AKASH_MNEMONIC
  • NOSANA_WALLET_PRIVATE_KEY

CLI Reference

inferiallm init

Bootstraps the unified database environment. Output:

[inferia:init] Connecting as admin to bootstrap inferia
[inferia:init] Creating role: inferia
[inferia:init] Creating database: inferia
[inferia:init] Applying schema: global_schema
[inferia:init] Bootstrap complete

inferiallm api-start

Starts all InferiaLLM gateways (Orchestration, Inference, Filtration) and the Dashboard in one command.

inferiallm orchestration-gateway

Starts the Orchestration Gateway standalone (manages compute and routing).

inferiallm inference-gateway

Starts the Inference Gateway standalone (handles data-plane ingress).

inferiallm filtration-gateway

Starts the Filtration Gateway standalone (enforces RBAC, quotas, and guardrails).


Core Capabilities

  • Unified Control Plane: Orchestrate LLMs across heterogeneous compute (K8s, DePIN, VPS).
  • Policy Enforcement: Centralized RBAC, safety guardrails, and budget controls.
  • Execution Boundary: Authority-based routing between applications and infrastructure.

For full documentation, architecture diagrams, and deployment guides, visit the main repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inferiallm-1.0.4.tar.gz (417.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

inferiallm-1.0.4-py3-none-any.whl (526.4 kB view details)

Uploaded Python 3

File details

Details for the file inferiallm-1.0.4.tar.gz.

File metadata

  • Download URL: inferiallm-1.0.4.tar.gz
  • Upload date:
  • Size: 417.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.8

File hashes

Hashes for inferiallm-1.0.4.tar.gz
Algorithm Hash digest
SHA256 3cdaf4c2a8776ead63d9ea7a1a5844eccf598d47788413b093aad3a4fd331e82
MD5 4cbb924c23d6c827f078120ced8bbdd4
BLAKE2b-256 15d870f4a52cca26a4bf5217f5910f6d478d0fbf3aa6d90d2bb3728f37f1d717

See more details on using hashes here.

File details

Details for the file inferiallm-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: inferiallm-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 526.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.8

File hashes

Hashes for inferiallm-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 8d6e00ab52b85953c40df1050f0db0e5675c0cacd06317c2c2d06dfde278f5c4
MD5 6957572aa73e9580f408ee69af8fe3ad
BLAKE2b-256 e4d4d41851a20ba2636ae50bb0d790f2bc43ab8f91539bbbfb83be2f17c89451

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page