Lightweight Anthropic Messages API proxy with LiteLLM-style config — load your middleware from any project

These details have not been verified by PyPI

Project description

litelitellm

A small, easy-to-run proxy for LiteLLM local development. No LiteLLM stack, no database — just a small FastAPI server that reads a LiteLLM-style config and loads your project’s middleware from the current directory using the same callbacks that LiteLLM provides. Optional Langfuse observability is built in.

Use it when you want the same callback/middleware contract as LiteLLM without running the full proxy. Run it from any repo via uvx.

Quick start

From a project that has a config and middleware:

In your project, add a config file (same format as LiteLLM):

config.yaml (or proxy_config.yaml, or set LITELITELLM_CONFIG / LITELLM_CONFIG_PATH):
```
litellm_settings:
  callbacks: ["my_middleware_loader"]
```
Ensure your project has a Python file my_middleware_loader.py (or whatever name you put in callbacks) that exports an object with an async_pre_call_hook method (LiteLLM CustomLogger-style).
From your project directory:
```
uvx litelitellm
```
Set ANTHROPIC_API_KEY (env or .env). Then point your client at the proxy, e.g.:
```
ANTHROPIC_BASE_URL=http://localhost:4000 claude
```

From this repo (no middleware):

uv sync
cp .env.example .env   # set ANTHROPIC_API_KEY
uv run python -m litelitellm

Runs as passthrough only (no config callbacks in this repo).

Local testing (before PyPI): From another folder you can run the GitHub version with uvx --from git+https://github.com/elliottcarlson/litelitellm litelitellm. For editable local dev, run uv tool install --editable . from this repo, then run litelitellm from any directory; uninstall with uv tool uninstall litelitellm.

Example: instruction-injection middleware

Here’s a minimal project that injects an instruction so the model always responds like a pirate.

1. Project layout

my_project/
  config.yaml
  pirate_middleware.py
  .env          # ANTHROPIC_API_KEY=sk-ant-...

2. config.yaml

litellm_settings:
  callbacks: ["pirate_middleware"]

3. pirate_middleware.py

The middleware must expose an object with async_pre_call_hook(user_api_key_dict, cache, data, call_type). It receives the request data (e.g. model, messages, system, max_tokens) and returns the modified body.

class PirateMiddleware:
    async def async_pre_call_hook(self, user_api_key_dict, cache, data, call_type):
        # Inject a system instruction so the model responds like a pirate
        prefix = "You are a pirate. Respond only in pirate speak, with 'arr' and nautical terms. "
        existing = data.get("system")
        if existing is None:
            data["system"] = prefix.strip()
        elif isinstance(existing, str):
            data["system"] = prefix + existing
        else:
            # Anthropic can send system as a list of content blocks
            data["system"] = [{"type": "text", "text": prefix}] + list(existing)
        return data

middleware = PirateMiddleware()

4. Run

cd my_project
uvx litelitellm

Then point your client at http://localhost:4000 (e.g. ANTHROPIC_BASE_URL=http://localhost:4000). Every request will get the pirate instruction applied before being sent to the API.

Config

Config is read from the directory you run from (your project). This repo does not ship a config.

Config path: LITELITELLM_CONFIG or LITELLM_CONFIG_PATH, then ./config.yaml, ./config.yml, ./proxy_config.yaml, or ./litellm_config.yaml.
Callbacks: litellm_settings.callbacks — list of strings. Each entry is a module name (e.g. my_middleware_loader) or module.attribute. The module is loaded from the directory containing the config file (project root when you run from the project). The first callback that has async_pre_call_hook is used as the middleware.

Environment

Variable	Default	Description
`ANTHROPIC_API_KEY`	(none)	Used for outbound requests when middleware modifies the body.
`LITELITELLM_PORT`	`4000`	Port the proxy listens on.
`ANTHROPIC_API_URL`	`https://api.anthropic.com`	Upstream API.
`LANGFUSE_PUBLIC_KEY`	(none)	With `LANGFUSE_SECRET_KEY`, send traces to Langfuse.
`LANGFUSE_SECRET_KEY`	(none)	Langfuse secret key.
`LANGFUSE_BASE_URL`	`https://cloud.langfuse.com`	Langfuse server (e.g. `http://localhost:3000` for self-hosted).

Observability

One JSON line per request to stdout (latency, tokens, etc.).
Optional Langfuse: set LANGFUSE_PUBLIC_KEY and LANGFUSE_SECRET_KEY (env or .env); if both are set, the proxy sends traces to Langfuse. No extra install. See LANGFUSE.md for setup with Langfuse Cloud or a self-hosted instance.

Publishing (maintainers)

Manual: Bump version in pyproject.toml, then run uv build and uv publish (set UV_PUBLISH_TOKEN or use uv publish and enter token when prompted).
GitHub Action: The Publish to PyPI workflow is manually triggerable (Actions → Publish to PyPI → Run workflow). Add a repository secret PYPI_API_TOKEN with your PyPI API token (pypi.org → Account → API tokens).

License

MIT. See LICENSE.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.3

Feb 9, 2026

0.1.2

Feb 9, 2026

This version

0.1.1

Feb 7, 2026

0.1.0

Feb 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

litelitellm-0.1.1.tar.gz (12.3 kB view details)

Uploaded Feb 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

litelitellm-0.1.1-py3-none-any.whl (14.6 kB view details)

Uploaded Feb 7, 2026 Python 3

File details

Details for the file litelitellm-0.1.1.tar.gz.

File metadata

Download URL: litelitellm-0.1.1.tar.gz
Upload date: Feb 7, 2026
Size: 12.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for litelitellm-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`9a7eafb8e4875b6c5f11f93abce7dd3e424ade26b8eeb381f654cbd1bbbe04c1`
MD5	`7b2ddc1e26f1ca1f95a6965c8641bb7a`
BLAKE2b-256	`2afc465a9c0dd00b2de95bb2ee242af8091bd93b296a6e9deefd94eaed414a48`

See more details on using hashes here.

File details

Details for the file litelitellm-0.1.1-py3-none-any.whl.

File metadata

Download URL: litelitellm-0.1.1-py3-none-any.whl
Upload date: Feb 7, 2026
Size: 14.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for litelitellm-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`604e41fefdb4090a837bed4afe3bae4fe09cf85babe122707f65aff2988de2e7`
MD5	`52e0d05b2899bb5d600e92faaa9334cf`
BLAKE2b-256	`79a25183ceac303c6aa57960922eabb0c1bc39fc716913e7ad6299a8fe2d5135`

See more details on using hashes here.

litelitellm 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

litelitellm

Quick start

Example: instruction-injection middleware

Config

Environment

Observability

Publishing (maintainers)

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes