Open-source, OpenAI-compatible API server with pluggable providers for any model and any infrastructure
Project description
OGX
Quick Start | Documentation | OpenAI API Compatibility | Discord
[!IMPORTANT] Llama Stack is now OGX. The name changed, and so did the mission — model-agnostic, multi-SDK, production-grade. Read the full announcement →
Open-source agentic API server for building AI applications. OpenAI-compatible. Any model, any infrastructure.
OGX is a drop-in replacement for the OpenAI API that you can run anywhere — your laptop, your datacenter, or the cloud. Use any OpenAI-compatible client or agentic framework. Swap between Llama, GPT, Gemini, Mistral, or any model without changing your application code.
from openai import OpenAI
client = OpenAI(base_url="http://localhost:8321/v1", api_key="fake")
response = client.chat.completions.create(
model="llama-3.3-70b",
messages=[{"role": "user", "content": "Hello"}],
)
What you get
- Chat Completions & Embeddings — standard
/v1/chat/completions,/v1/completions, and/v1/embeddingsendpoints, compatible with any OpenAI client - Responses API — server-side agentic orchestration with tool calling, MCP server integration, and built-in file search (RAG) in a single API call (learn more)
- Vector Stores & Files —
/v1/vector_storesand/v1/filesfor managed document storage and search - Batches —
/v1/batchesfor offline batch processing - Open Responses conformant — the Responses API implementation passes the Open Responses conformance test suite
- Multi-SDK support — use the Anthropic SDK (
/v1/messages) or Google GenAI SDK (/v1alpha/interactions) natively alongside the OpenAI API
Use any model, use any infrastructure
OGX has a pluggable provider architecture. Develop locally with Ollama, deploy to production with vLLM, or connect to a managed service — the API stays the same.
See the provider documentation for the full list.
Get started
Install and run a OGX server:
# One-line install
curl -LsSf https://github.com/ogx-ai/ogx/raw/main/scripts/install.sh | bash
# Or install via uv
uv pip install ogx[starter]
# Start the server (uses the starter distribution with Ollama)
uv run ogx stack run starter
Then connect with any OpenAI, Anthropic, or Google GenAI client — Python, TypeScript, curl, or any framework that speaks these APIs.
See the Quick Start guide for detailed setup.
Resources
- Documentation — full reference
- OpenAI API Compatibility — endpoint coverage and provider matrix
- Getting Started Notebook — text and vision inference walkthrough
- Contributing — how to contribute
Client SDKs:
| Language | SDK | Package |
|---|---|---|
| Python | ogx-client-python | |
| TypeScript | ogx-client-typescript |
Community
We hold regular community calls every Thursday at 09:00 AM PST — see the Community Event on Discord for details.
Thanks to all our amazing contributors!
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ogx-1.0.1.tar.gz.
File metadata
- Download URL: ogx-1.0.1.tar.gz
- Upload date:
- Size: 16.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d79f9969c7fda764f94d826ab1a8986ceb657c02d4d06b8d9c550169234981c3
|
|
| MD5 |
6f48cf58050f8d0664e30fa90db19c74
|
|
| BLAKE2b-256 |
23f0f412d36987b27fb4abc69894098b89ba760e0a775fd0fef5758d1d81d31c
|
Provenance
The following attestation bundles were made for ogx-1.0.1.tar.gz:
Publisher:
pypi.yml on ogx-ai/ogx
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ogx-1.0.1.tar.gz -
Subject digest:
d79f9969c7fda764f94d826ab1a8986ceb657c02d4d06b8d9c550169234981c3 - Sigstore transparency entry: 1525266440
- Sigstore integration time:
-
Permalink:
ogx-ai/ogx@83e155871e556c0e825353eec60a8894851a0dc7 -
Branch / Tag:
refs/tags/v1.0.1 - Owner: https://github.com/ogx-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi.yml@83e155871e556c0e825353eec60a8894851a0dc7 -
Trigger Event:
release
-
Statement type:
File details
Details for the file ogx-1.0.1-py3-none-any.whl.
File metadata
- Download URL: ogx-1.0.1-py3-none-any.whl
- Upload date:
- Size: 696.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
81fe1f712e540b5b8058aae116b88c111cf9b8f53a977a5ff5b31b1172fe1563
|
|
| MD5 |
3366b1a45a80bcc5d3f697164afb78ae
|
|
| BLAKE2b-256 |
8c89dba23c3f16080207e286339b31990bd234731643ffac9db00e889a4f29b1
|
Provenance
The following attestation bundles were made for ogx-1.0.1-py3-none-any.whl:
Publisher:
pypi.yml on ogx-ai/ogx
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ogx-1.0.1-py3-none-any.whl -
Subject digest:
81fe1f712e540b5b8058aae116b88c111cf9b8f53a977a5ff5b31b1172fe1563 - Sigstore transparency entry: 1525266518
- Sigstore integration time:
-
Permalink:
ogx-ai/ogx@83e155871e556c0e825353eec60a8894851a0dc7 -
Branch / Tag:
refs/tags/v1.0.1 - Owner: https://github.com/ogx-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi.yml@83e155871e556c0e825353eec60a8894851a0dc7 -
Trigger Event:
release
-
Statement type: