Skip to main content

Open-source, OpenAI-compatible API server with pluggable providers for any model and any infrastructure

Project description

OGX

PyPI Version PyPI Downloads Docker Hub Pulls License Discord Unit Tests Integration Tests OpenResponses Conformance Ask DeepWiki

Quick Start | Documentation | OpenAI API Compatibility | Discord

[!IMPORTANT] Llama Stack is now OGX. The name changed, and so did the mission — model-agnostic, multi-SDK, production-grade. Read the full announcement →

Open-source agentic API server for building AI applications. OpenAI-compatible. Any model, any infrastructure.

OGX Architecture

OGX is a drop-in replacement for the OpenAI API that you can run anywhere — your laptop, your datacenter, or the cloud. Use any OpenAI-compatible client or agentic framework. Swap between Llama, GPT, Gemini, Mistral, or any model without changing your application code.

from openai import OpenAI

client = OpenAI(base_url="http://localhost:8321/v1", api_key="fake")
response = client.chat.completions.create(
    model="llama-3.3-70b",
    messages=[{"role": "user", "content": "Hello"}],
)

What you get

  • Chat Completions & Embeddings — standard /v1/chat/completions, /v1/completions, and /v1/embeddings endpoints, compatible with any OpenAI client
  • Responses API — server-side agentic orchestration with tool calling, MCP server integration, and built-in file search (RAG) in a single API call (learn more)
  • Vector Stores & Files/v1/vector_stores and /v1/files for managed document storage and search
  • Batches/v1/batches for offline batch processing
  • Open Responses conformant — the Responses API implementation passes the Open Responses conformance test suite
  • Multi-SDK support — use the Anthropic SDK (/v1/messages) or Google GenAI SDK (/v1alpha/interactions) natively alongside the OpenAI API

Use any model, use any infrastructure

OGX has a pluggable provider architecture. Develop locally with Ollama, deploy to production with vLLM, or connect to a managed service — the API stays the same.

See the provider documentation for the full list.

Get started

Install and run a OGX server:

# One-line install
curl -LsSf https://github.com/ogx-ai/ogx/raw/main/scripts/install.sh | bash

# Or install via uv
uv pip install ogx[starter]

# Start the server (uses the starter distribution with Ollama)
uv run ogx stack run starter

Then connect with any OpenAI, Anthropic, or Google GenAI client — Python, TypeScript, curl, or any framework that speaks these APIs.

See the Quick Start guide for detailed setup.

Resources

Client SDKs:

OGX provides official client SDKs for Python and TypeScript:

Language SDK Package
Python ogx-client-python PyPI version
TypeScript ogx-client-typescript NPM version

Alternative Python SDK:

For users who prefer an OpenAPI Generator-based SDK, an alternative Python client is available:

  • ogx-open-client — Auto-generated from OpenAPI spec, provides similar functionality with a different generation approach
  • Usage Examples — End-to-end code examples for all major features
  • Strategy & Rationale — Why two SDKs, when to use which, and long-term plans

The official ogx_client SDK is recommended for most use cases. The ogx_open_client package offers an alternative for teams with specific OpenAPI tooling requirements.

Community

We hold regular community calls every Thursday at 09:00 AM PST — see the Community Event on Discord for details.

Star History Chart

Thanks to all our amazing contributors!

OGX contributors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ogx-1.1.2.tar.gz (17.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ogx-1.1.2-py3-none-any.whl (733.8 kB view details)

Uploaded Python 3

File details

Details for the file ogx-1.1.2.tar.gz.

File metadata

  • Download URL: ogx-1.1.2.tar.gz
  • Upload date:
  • Size: 17.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for ogx-1.1.2.tar.gz
Algorithm Hash digest
SHA256 34967275cc7c667aa82c68cc4fc00d560ca65bb66f0ce74ff9894c77c2c3ee20
MD5 5dc74278a72854224420832d3fcc8413
BLAKE2b-256 0707867dc93ac014b9d8ea6a81bb9b4a04e11fb172ea3a8c004fca2167988485

See more details on using hashes here.

Provenance

The following attestation bundles were made for ogx-1.1.2.tar.gz:

Publisher: pypi.yml on ogx-ai/ogx

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file ogx-1.1.2-py3-none-any.whl.

File metadata

  • Download URL: ogx-1.1.2-py3-none-any.whl
  • Upload date:
  • Size: 733.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for ogx-1.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a6ac6d8ee231173d1252d84125a1846ee3dc4a3e0278b2958a9f454fa0f98609
MD5 ab066857c4e17f785e5b1dd331d9400d
BLAKE2b-256 9ab5146b77ba380c815a7f45df702e3ead28716ff5f5618cdfdb41e105e21a04

See more details on using hashes here.

Provenance

The following attestation bundles were made for ogx-1.1.2-py3-none-any.whl:

Publisher: pypi.yml on ogx-ai/ogx

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page