CLI for setting up a self-hosted alternative to OpenAI's Responses API
Project description
Open Responses CLI
A CLI tool for setting up a self-hosted alternative to OpenAI's Responses API. This API lets you create and manage open-ended AI responses for your applications, similar to OpenAI's Responses API, but fully under your control.
Features
- Easy setup with Docker Compose
- Compatible API endpoints with OpenAI's Responses API
- Management UI for creating, viewing, and managing responses
- Local data storage with PostgreSQL
- Customizable authentication and timeout settings
Installation
You can install this CLI using either npm or pip:
Using npm
npx open-responses
Or install it globally:
npm install -g open-responses
open-responses
Using pip
pipx open-responses
Or install it globally:
pip install open-responses
open-responses
Usage
First-time Setup
Before using any commands, you must run the setup command:
open-responses setup
This will:
- Ask for configuration settings with default values:
- Host (default: 127.0.0.1)
- Port (default: 8080)
- Docker tag (default: latest_responses)
- Base Docker Compose URI (default: https://u.julep.ai/responses-compose.yaml)
- Environment file location (default: .env in Git root or current directory)
- API version (default: 0.0.1)
- Ask for API configuration values (port, authentication key, timeout)
- Create a .env file with your settings
- Download or generate a docker-compose.yml file with the necessary services:
- API server
- Database
- Management UI
- Create a configuration file (open-responses.json) to track your settings
The CLI will automatically check for this configuration before running any other commands. If the configuration file doesn't exist, it will prompt you to run the setup command first.
Configuration File
The CLI stores its configuration in open-responses.json, which can be located in:
- The current directory
- The parent directory
- The Git repository root directory
The configuration file tracks:
- All user-defined settings
- Environment variable values
- Creation and update timestamps (both
camelCaseandsnake_caseformats are supported) - File locations and version information
When you run setup again with an existing configuration, it will let you update your settings while preserving your previous values as defaults. If timestamps are missing from an existing configuration, they'll be added automatically when the configuration is updated.
API Configuration
The API service includes the following configuration options with sensible defaults:
Basic Settings
HOST: Host address for the API (default:127.0.0.1)PORT: Port for the UI service (default:8080)RESPONSES_API_PORT: Port for the API service (default:8080)DOCKER_TAG: Docker image tag (default:latest_responses)API_VERSION: API version (default:0.0.1)
Performance & Limits
NODE_ENV: Node.js environment (default:production)LOG_LEVEL: Logging level (default:info)REQUEST_TIMEOUT: API request timeout in ms (default:120000- 2 minutes)MAX_PAYLOAD_SIZE: Maximum request payload size (default:10mb)RATE_LIMIT_WINDOW: Rate limit window in ms (default:60000- 1 minute)RATE_LIMIT_MAX: Maximum requests per rate limit window (default:100)
Resource Allocation
The Docker Compose configuration also includes resource limits to ensure stable operation:
- API Service: 1 CPU, 2GB memory (min: 0.25 CPU, 512MB)
- Database: 1 CPU, 1GB memory (min: 0.1 CPU, 256MB)
- Redis: 0.5 CPU, 768MB memory (min: 0.1 CPU, 128MB)
- UI: 0.5 CPU, 512MB memory (min: 0.1 CPU, 128MB)
These settings provide a good balance for most deployments, but you can adjust them in the docker-compose.yml file if needed.
User-Friendly Commands
The CLI provides easy-to-use commands for common operations:
Starting the service
open-responses start
This user-friendly command is an alias for open-responses compose up -d that:
- Starts all services in detached mode (background)
- Shows the status of services after startup
- Displays access URLs for the API and admin UI
Stopping the service
open-responses stop
This command stops all services and performs cleanup (alias for open-responses compose down).
Checking service status
open-responses status
Shows detailed information about all services, including:
- Running state and health status
- Uptime information
- Resource usage summary
- Access URLs
Viewing logs
open-responses logs [SERVICE]
Shows logs from services with sensible defaults:
- Follows logs in real-time
- Shows colorized output
- Displays last 100 lines by default
- Can target specific services
Examples:
# View logs from all services:
open-responses logs
# View logs from a specific service:
open-responses logs api
Initializing a new project
open-responses init
Creates a new project structure with guided setup:
- Creates directory structure (data, config, logs)
- Generates helpful documentation files
- Runs interactive configuration
- Sets up Docker Compose with best practices
Managing API keys
open-responses key <action>
Manages API keys for the Responses API service:
# List all API keys (masked):
open-responses key list
# Generate a new API key:
open-responses key generate [type]
# Update an API key:
open-responses key set <type> [value]
Updating components
open-responses update
Updates all components to the latest version:
- Updates Docker Compose configuration
- Pulls latest Docker images
- Backs up your configuration
Advanced Docker Compose Commands
For more advanced operations, use the compose command group:
open-responses compose <command> [args...]
Available commands include:
# Start services with additional options:
open-responses compose up [flags]
# Stop and clean up services:
open-responses compose down [flags]
# View logs with custom options:
open-responses compose logs [flags] [SERVICE...]
# List containers:
open-responses compose ps [flags]
# Build services:
open-responses compose build [flags] [SERVICE...]
# Restart services:
open-responses compose restart [flags] [SERVICE...]
# Pull service images:
open-responses compose pull [flags] [SERVICE...]
# Execute commands in containers:
open-responses compose exec [flags] SERVICE COMMAND [ARGS...]
# Run one-off commands:
open-responses compose run [flags] SERVICE COMMAND [ARGS...]
# Validate Docker Compose configuration:
open-responses compose config [flags]
# View processes in containers:
open-responses compose top [SERVICE...]
# Monitor resource usage:
open-responses compose stats [SERVICE...]
Each compose command is a direct proxy to the equivalent Docker Compose command and accepts all the same flags and arguments. This provides full access to Docker Compose functionality when needed.
For detailed examples of each command, use the --help flag:
open-responses compose up --help
open-responses compose logs --help
Or for general help:
open-responses --help
API Endpoints
Once your service is running, the following endpoints will be available:
POST /v1/responses- Create a new responseGET /v1/responses/{id}- Retrieve a responseGET /v1/responses- List all responsesDELETE /v1/responses/{id}- Delete a response
You can access the management UI at http://localhost:8080 (or your configured port).
Requirements
- Docker must be installed on your system
- Docker Compose must be installed (either as a standalone binary or integrated plugin)
- Docker Compose V2 (≥ 2.21.0) is recommended for best compatibility
- Docker Compose V1 is supported but with limited functionality
- No other runtime dependencies required (no Node.js or Python needed for running the service)
The CLI will check Docker and Docker Compose requirements and provide helpful instructions if they're not met.
How it works
This CLI is built with Go and compiled to native binaries for Windows, macOS, and Linux. When installed via npm or pip, the appropriate binary for your platform is used automatically.
The service itself runs in Docker containers, providing a compatible alternative to OpenAI's Responses API.
Development
Project Structure
main.go: Core CLI functionality built with Goopen_responses/__init__.py: Python wrapper for binary distributionscripts/postinstall.js: Node.js script for platform detection and setupbin/: Directory for compiled binaries
Building from Source
Build for your current platform:
npm run build
Build for all platforms:
npm run build:all
This will generate binaries in the bin/ directory:
bin/open-responses-linuxbin/open-responses-macosbin/open-responses-win.exe
Installing for Development
For Python:
pip install -e .
For npm:
npm link
Development Guidelines
This project follows strict formatting and linting guidelines to maintain code quality. We use:
- Go: Standard Go formatting with
go fmt - Python: Ruff for linting and formatting
- JavaScript: ESLint and Prettier for linting and formatting
Setting Up Development Environment
- Install development dependencies:
# Install JavaScript dependencies
npm install
# Install Python dependencies
uv pip install ruff
# Install git hooks for automatic linting/formatting
npm run install:hooks
- The git hooks will automatically run formatting and linting checks before each commit.
Code Formatting and Linting
You can manually run code formatting and linting using these commands:
# Format all code
npm run format:all
# Lint all code
npm run lint:all
# Format/lint individual languages
npm run format # JavaScript/JSON/Markdown files
npm run py:format # Python files
npm run go:format # Go files
npm run lint # JavaScript files
npm run py:lint # Python files
License
Apache-2.0
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file open_responses-0.1.4.tar.gz.
File metadata
- Download URL: open_responses-0.1.4.tar.gz
- Upload date:
- Size: 75.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
539599b236fdd70fd758de43cd883e2b3aab10eab044c1bb58cdf1a0e4a45e26
|
|
| MD5 |
2bb7dbe5c058f88ab3501afb230f37a7
|
|
| BLAKE2b-256 |
a45d60bcdd24a0f51d707ab614c8ef4152333db99ad420f150a3557934ee25bb
|
File details
Details for the file open_responses-0.1.4-py3-none-any.whl.
File metadata
- Download URL: open_responses-0.1.4-py3-none-any.whl
- Upload date:
- Size: 21.1 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
056260d3882ae0888e35dd287e55580d0042bb148ed2a5e3d5bf7adec7d179c2
|
|
| MD5 |
af599fd41f28544242ce0f0eeb55b389
|
|
| BLAKE2b-256 |
6f3ba0417a9db59219dd62d91280048961c0c9807fffb0b5633b7816c4e586b1
|