A lightweight proxy server that converts Anthropic Messages API to OpenAI API

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

dongfangzan

These details have not been verified by PyPI

Project description

local-openai2anthropic

English | 中文

A lightweight proxy that lets applications built with Claude SDK talk to locally-hosted OpenAI-compatible LLMs.

What Problem This Solves

Many local LLM tools (vLLM, SGLang, etc.) provide an OpenAI-compatible API. But if you've built your app using Anthropic's Claude SDK, you can't use them directly.

This proxy translates Claude SDK calls to OpenAI API format in real-time, enabling:

Local LLM inference with Claude-based apps
Offline development without cloud API costs
Privacy-first AI - data never leaves your machine
Seamless model switching between cloud and local

Supported Local Backends

Currently tested and supported:

Backend	Description	Status
vLLM	High-throughput LLM inference	✅ Fully supported
SGLang	Fast structured language model serving	✅ Fully supported

Other OpenAI-compatible backends may work but are not fully tested.

Quick Start

1. Install

pip install local-openai2anthropic

2. Start Your Local LLM Server

Example with vLLM:

vllm serve meta-llama/Llama-2-7b-chat-hf
# vLLM starts OpenAI-compatible API at http://localhost:8000/v1

Or with SGLang:

sglang launch --model-path meta-llama/Llama-2-7b-chat-hf --port 8000
# SGLang starts at http://localhost:8000/v1

3. Start the Proxy

Option A: Run in background (recommended)

export OA2A_OPENAI_BASE_URL=http://localhost:8000/v1  # Your local LLM endpoint
export OA2A_OPENAI_API_KEY=dummy  # Any value, not used by local backends

oa2a start              # Start server in background
# Server starts at http://localhost:8080

# View logs
oa2a logs               # Show last 50 lines of logs
oa2a logs -f            # Follow logs in real-time (Ctrl+C to exit)

# Check status
oa2a status             # Check if server is running

# Stop server
oa2a stop               # Stop background server

# Restart server
oa2a restart            # Restart with same settings

Option B: Run in foreground

export OA2A_OPENAI_BASE_URL=http://localhost:8000/v1
export OA2A_OPENAI_API_KEY=dummy

oa2a                    # Run server in foreground (blocking)
# Press Ctrl+C to stop

4. Use in Your App

import anthropic

client = anthropic.Anthropic(
    base_url="http://localhost:8080",  # Point to proxy
    api_key="dummy-key",  # Not used
)

message = client.messages.create(
    model="meta-llama/Llama-2-7b-chat-hf",  # Your local model name
    max_tokens=1024,
    messages=[{"role": "user", "content": "Hello!"}],
)

print(message.content[0].text)

Using with Claude Code

You can configure Claude Code to use your local LLM through this proxy.

Configuration Steps

Create or edit Claude Code config file at ~/.claude/CLAUDE.md:

# Claude Code Configuration

## API Settings

- Claude API Base URL: http://localhost:8080
- Claude API Key: dummy-key

## Model Settings

Use model: meta-llama/Llama-2-7b-chat-hf  # Your local model name

Alternatively, set environment variables before running Claude Code:

export ANTHROPIC_BASE_URL=http://localhost:8080
export ANTHROPIC_API_KEY=dummy-key

claude

Or use the --api-key and --base-url flags:

claude --api-key dummy-key --base-url http://localhost:8080

Complete Workflow Example

Terminal 1 - Start your local LLM:

vllm serve meta-llama/Llama-2-7b-chat-hf

Terminal 2 - Start the proxy:

export OA2A_OPENAI_BASE_URL=http://localhost:8000/v1
export OA2A_OPENAI_API_KEY=dummy
export OA2A_TAVILY_API_KEY="tvly-your-tavily-api-key"  # Optional: enable web search

oa2a

Terminal 3 - Launch Claude Code with local LLM:

export ANTHROPIC_BASE_URL=http://localhost:8080
export ANTHROPIC_API_KEY=dummy-key

claude

Now Claude Code will use your local LLM instead of the cloud API.

Features

✅ Streaming responses - Real-time token streaming via SSE
✅ Tool calling - Local LLM function calling support
✅ Vision models - Multi-modal input for vision-capable models
✅ Web Search - Give your local LLM internet access (see below)
✅ Thinking mode - Supports reasoning/thinking model outputs

Web Search Capability 🔍

Bridge the gap: Give your local LLM the web search power that Claude Code users enjoy!

When using locally-hosted models with Claude Code, you lose access to the built-in web search tool. This proxy fills that gap by providing a server-side web search implementation powered by Tavily.

The Problem

Scenario	Web Search Available?
Using Claude (cloud) in Claude Code	✅ Built-in
Using local vLLM/SGLang in Claude Code	❌ Not available
Using this proxy + local LLM	✅ Enabled via Tavily

How It Works

Claude Code → Anthropic SDK → This Proxy → Local LLM
                                      ↓
                                 Tavily API (Web Search)

The proxy intercepts web_search_20250305 tool calls and handles them directly, regardless of whether your local model supports web search natively.

Setup Tavily Search

Get a free API key at tavily.com - generous free tier available
Configure the proxy:

export OA2A_OPENAI_BASE_URL=http://localhost:8000/v1
export OA2A_OPENAI_API_KEY=dummy
export OA2A_TAVILY_API_KEY="tvly-your-tavily-api-key"  # Enable web search

oa2a

Use in your app:

import anthropic

client = anthropic.Anthropic(
    base_url="http://localhost:8080",
    api_key="dummy-key",
)

message = client.messages.create(
    model="meta-llama/Llama-2-7b-chat-hf",
    max_tokens=1024,
    tools=[
        {
            "name": "web_search_20250305",
            "description": "Search the web for current information",
            "input_schema": {
                "type": "object",
                "properties": {
                    "query": {"type": "string", "description": "Search query"},
                },
                "required": ["query"],
            },
        }
    ],
    messages=[{"role": "user", "content": "What happened in AI today?"}],
)

if message.stop_reason == "tool_use":
    tool_use = message.content[-1]
    print(f"Searching: {tool_use.input}")
    # The proxy automatically calls Tavily and returns results

Tavily Configuration Options

Variable	Default	Description
`OA2A_TAVILY_API_KEY`	-	Your Tavily API key (get free at tavily.com)
`OA2A_TAVILY_MAX_RESULTS`	5	Number of search results to return
`OA2A_TAVILY_TIMEOUT`	30	Search timeout in seconds
`OA2A_WEBSEARCH_MAX_USES`	5	Max search calls per request

Configuration

Variable	Required	Default	Description
`OA2A_OPENAI_BASE_URL`	✅	-	Your local LLM's OpenAI-compatible endpoint
`OA2A_OPENAI_API_KEY`	✅	-	Any value (local backends usually ignore this)
`OA2A_PORT`	❌	8080	Proxy server port
`OA2A_HOST`	❌	0.0.0.0	Proxy server host
`OA2A_TAVILY_API_KEY`	❌	-	Enable web search (tavily.com)

Architecture

Your App (Claude SDK)
         │
         ▼
┌─────────────────────┐
│  local-openai2anthropic  │  ← This proxy
│  (Port 8080)        │
└─────────────────────┘
         │
         ▼
Your Local LLM Server
(vLLM / SGLang)
(OpenAI-compatible API)

Development

git clone https://github.com/dongfangzan/local-openai2anthropic.git
cd local-openai2anthropic
pip install -e ".[dev]"

pytest

License

Apache License 2.0

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

dongfangzan

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.6.6

May 6, 2026

0.6.5

May 6, 2026

0.6.4

May 6, 2026

0.6.3

Apr 21, 2026

0.6.2

Apr 15, 2026

0.6.1

Mar 18, 2026

0.6.0

Mar 16, 2026

0.5.9

Feb 28, 2026

0.5.8

Feb 26, 2026

0.5.7

Feb 26, 2026

0.5.6

Feb 23, 2026

0.5.5

Feb 23, 2026

0.5.4

Feb 23, 2026

0.5.3

Feb 23, 2026

0.5.2

Feb 23, 2026

0.5.0

Feb 23, 2026

0.3.11

Feb 14, 2026

0.3.10

Feb 14, 2026

0.3.9

Feb 14, 2026

0.3.8

Feb 3, 2026

0.3.7

Feb 3, 2026

0.3.6

Feb 3, 2026

0.3.5

Feb 3, 2026

0.3.4

Feb 2, 2026

0.3.3

Feb 2, 2026

0.3.2

Feb 2, 2026

0.3.1

Feb 2, 2026

0.3.0

Feb 2, 2026

0.2.9

Feb 2, 2026

0.2.8

Feb 2, 2026

0.2.7

Feb 2, 2026

0.2.5

Jan 29, 2026

0.2.4

Jan 29, 2026

0.2.3

Jan 29, 2026

0.2.2

Jan 29, 2026

This version

0.2.0

Jan 29, 2026

0.1.1

Jan 29, 2026

0.1.0

Jan 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

local_openai2anthropic-0.2.0.tar.gz (42.6 kB view details)

Uploaded Jan 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

local_openai2anthropic-0.2.0-py3-none-any.whl (37.5 kB view details)

Uploaded Jan 29, 2026 Python 3

File details

Details for the file local_openai2anthropic-0.2.0.tar.gz.

File metadata

Download URL: local_openai2anthropic-0.2.0.tar.gz
Upload date: Jan 29, 2026
Size: 42.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for local_openai2anthropic-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`72a0ae87fe3a53746fa17f87b5dc16e3d1b45d9aeb3f8684baaf48e0ec0a10e1`
MD5	`27eb6bc800d0ce53f3e3d367171b109b`
BLAKE2b-256	`0867ef79cd269c19fdfa4548ac1e7b26f28f59d9ade4b6984332d127f82f5cd3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for local_openai2anthropic-0.2.0.tar.gz:

Publisher: publish.yml on dongfangzan/local-openai2anthropic

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: local_openai2anthropic-0.2.0.tar.gz
- Subject digest: 72a0ae87fe3a53746fa17f87b5dc16e3d1b45d9aeb3f8684baaf48e0ec0a10e1
- Sigstore transparency entry: 869776266
- Sigstore integration time: Jan 29, 2026
Source repository:
- Permalink: dongfangzan/local-openai2anthropic@f44f18ef97ea14c26a91cd1e9ad907e49e217f7b
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/dongfangzan
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f44f18ef97ea14c26a91cd1e9ad907e49e217f7b
- Trigger Event: push

File details

Details for the file local_openai2anthropic-0.2.0-py3-none-any.whl.

File metadata

Download URL: local_openai2anthropic-0.2.0-py3-none-any.whl
Upload date: Jan 29, 2026
Size: 37.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for local_openai2anthropic-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dc678a0039d99d3a6bcc10a25cfd33b61856b7e0a3ca6740678dbae9f5924efe`
MD5	`0dd71a7d6d95c46c23ec68afdeff4c1e`
BLAKE2b-256	`bc33bfdf5f156e1654fd6532f62605d65bb623afa1cca68aef3932b3b6bc9fdd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for local_openai2anthropic-0.2.0-py3-none-any.whl:

Publisher: publish.yml on dongfangzan/local-openai2anthropic

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: local_openai2anthropic-0.2.0-py3-none-any.whl
- Subject digest: dc678a0039d99d3a6bcc10a25cfd33b61856b7e0a3ca6740678dbae9f5924efe
- Sigstore transparency entry: 869776287
- Sigstore integration time: Jan 29, 2026
Source repository:
- Permalink: dongfangzan/local-openai2anthropic@f44f18ef97ea14c26a91cd1e9ad907e49e217f7b
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/dongfangzan
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f44f18ef97ea14c26a91cd1e9ad907e49e217f7b
- Trigger Event: push

local-openai2anthropic 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

local-openai2anthropic

What Problem This Solves

Supported Local Backends

Quick Start

1. Install

2. Start Your Local LLM Server

3. Start the Proxy

4. Use in Your App

Using with Claude Code

Configuration Steps

Complete Workflow Example

Features

Web Search Capability 🔍

The Problem

How It Works

Setup Tavily Search

Tavily Configuration Options

Configuration

Architecture

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance