Skip to main content

Fully OpenAI-compatible API proxy for Kilo

Project description

Kilo Proxy

A fully OpenAI-compatible API proxy for Kilo.

Features

  • Full OpenAI API compatibility - chat completions, completions, embeddings, models
  • All OpenAI parameters supported - max_tokens, temperature, tools, stream, response_format, etc.
  • Streaming SSE support - real-time streaming responses
  • Background server mode - run as a daemon process
  • Token authentication management - easy auth configuration
  • Cross-platform - Windows, macOS, Linux

Installation

pip install -e .

Or install from source:

pip install .

CLI Commands

Authentication

Set your Kilo authentication token:

kilo-proxy auth [token]

If no token is provided, anonymous access is used by default.

Start Server (Background)

Start the server in background mode:

kilo-proxy start [--host HOST] [--port PORT]

Options:

  • --host - Host to bind to (default: 127.0.0.1)
  • --port - Port to listen on (default: 5380)

Stop Server

Stop the background server:

kilo-proxy stop

Run Server (Foreground)

Run the server in foreground mode:

kilo-proxy proxy [--host HOST] [--port PORT]

Options:

  • --host - Host to bind to (default: 127.0.0.1)
  • --port - Port to listen on (default: 5380)

Server Status

Check the current server status:

kilo-proxy status

List Models

List all available models:

kilo-proxy models

Broke Mode (Toggle Free Models)

Toggle broke mode (only show free models):

kilo-proxy broke          # Toggle broke mode
kilo-proxy broke on       # Enable broke mode
kilo-proxy broke off      # Disable broke mode
kilo-proxy broke --list   # List all free models

Configuration Wizard

Interactive configuration:

kilo-proxy config         # Run configuration wizard
kilo-proxy config --show  # Show current config

Install OpenCode Integration

Install and configure OpenCode to use Kilo models:

kilo-proxy install opencode

This interactive wizard will:

  1. Find or create your OpenCode config
  2. Ask if you're broke (free models only)
  3. Let you select models for all or per agent/category
  4. Optionally install Oh My OpenCode support
  5. Map Oh My OpenCode models to Kilo models
  6. Write the configuration files

Show Configuration

Display the current configuration:

kilo-proxy config-show

Autolaunch (Run on Boot)

Install the proxy to start automatically on system boot:

kilo-proxy autolaunch

Remove the proxy from boot:

kilo-proxy unautolaunch

Works on Windows (Task Scheduler), macOS (LaunchAgents), and Linux (systemd user services).

API Endpoints

Method Endpoint Description
GET /v1/models List all available models
GET /v1/models/{model_id} Get a specific model
POST /v1/chat/completions Create a chat completion
POST /v1/completions Create a text completion
POST /v1/embeddings Create embeddings
GET /v1/engines List engines (legacy)
GET /health Health check endpoint

Usage Examples

List Models

curl http://localhost:5380/v1/models

Chat Completion (Non-streaming)

curl http://localhost:5380/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "z-ai/glm-5:free",
    "messages": [
      {"role": "user", "content": "Hello, how are you?"}
    ],
    "max_tokens": 100,
    "temperature": 0.7
  }'

Chat Completion (Streaming)

curl http://localhost:5380/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "z-ai/glm-5:free",
    "messages": [
      {"role": "user", "content": "Tell me a short story"}
    ],
    "stream": true,
    "max_tokens": 500
  }'

Configuration

File Location
Config file ~/.kilo-proxy/config.json
PID file ~/.kilo-proxy/server.pid

Supported Models

Popular free models available through Kilo:

  • z-ai/glm-5:free
  • minimax/minimax-m2.5:free
  • deepseek/deepseek-chat:free
  • meta-llama/llama-3-8b-instruct:free
  • qwen/qwen-2-7b-instruct:free

Run kilo-proxy models to see all available models.

Requirements

  • Python 3.9+

License

MIT

Author

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kilo_proxy-1.1.0.tar.gz (23.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kilo_proxy-1.1.0-py3-none-any.whl (23.8 kB view details)

Uploaded Python 3

File details

Details for the file kilo_proxy-1.1.0.tar.gz.

File metadata

  • Download URL: kilo_proxy-1.1.0.tar.gz
  • Upload date:
  • Size: 23.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for kilo_proxy-1.1.0.tar.gz
Algorithm Hash digest
SHA256 de17de9ae5516d6905bc04356aac67c868dcff3c92489331a21a480867870304
MD5 57cc92c9e070d49707fcd6b27509f25a
BLAKE2b-256 476e6680025a9aa4358e3a85d033fd18f880713ac45f32b4c17229b51f4cbc67

See more details on using hashes here.

File details

Details for the file kilo_proxy-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: kilo_proxy-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 23.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for kilo_proxy-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f09bb4c6b3ed3808776750b608ee78547f41e089ec944355d7d3cc92259612a1
MD5 1804e85d391e779514d7f9767e19cf3f
BLAKE2b-256 3b62669911cad493f33d7646eccea042ff0b5ad6990fc3f0e0263a5a10ab609a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page