Ollama for free cloud inference. Local OpenAI-compatible gateway routing across OpenRouter, Groq, NVIDIA NIM, Cloudflare Workers AI, HuggingFace, Cerebras, and your own Ollama with automatic failover.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

shaivpidadi

These details have not been verified by PyPI

Project links

Homepage

Project description

FreeRide

Ollama for free cloud inference.

A local OpenAI-compatible gateway that routes across every free-tier provider you have a key for — OpenRouter, Groq, NVIDIA NIM, Cloudflare Workers AI, HuggingFace, Cerebras, and your own Ollama. Hits a rate limit, fails over. Your agent never knows.

Install

macOS / Linux:

curl -sSL https://api.free-ride.xyz/install.sh | sh

Windows (PowerShell):

powershell -ExecutionPolicy ByPass -c "irm https://api.free-ride.xyz/install.ps1 | iex"

Then:

freeride init           # interactive — collects keys, writes ~/.freeride/.env
freeride serve          # gateway listens on localhost:11343

Point any OpenAI-shaped client at http://localhost:11343/v1 with OPENAI_API_KEY=any. That's it.

The installer bootstraps uv if missing, then uv tool installs freeride-gateway. Binary lands at ~/.local/bin/freeride (Linux/macOS) or %USERPROFILE%\.local\bin\freeride.exe (Windows). Same shape as the bun.sh and astral.sh installers.

Or install manually

# uv (what the installer does)
uv tool install --prerelease=allow freeride-gateway

# pipx
pipx install --pip-args=--pre freeride-gateway

# pip + venv (the venv only — re-activate per shell)
python3 -m venv .venv && source .venv/bin/activate
pip install --pre freeride-gateway

# from source
git clone https://github.com/Shaivpidadi/FreeRideV3 && cd FreeRideV3
pip install -e .

PyPI distribution: freeride-gateway. CLI: freeride. Python ≥ 3.10.

Get keys (any one is enough; more = better failover)

Provider	Where	Env var
OpenRouter	https://openrouter.ai/keys	`OPENROUTER_API_KEY`
Groq	https://console.groq.com/keys	`GROQ_API_KEY`
NVIDIA NIM	https://build.nvidia.com	`NVIDIA_API_KEY`
Cloudflare Workers AI	https://dash.cloudflare.com/profile/api-tokens	`CLOUDFLARE_API_TOKEN` + `CLOUDFLARE_ACCOUNT_ID`
HuggingFace	https://huggingface.co/settings/tokens	`HF_TOKEN`
Cerebras	https://cloud.cerebras.ai/platform	`CEREBRAS_API_KEY`
Ollama (local)	https://ollama.com/download	`OLLAMA_BASE_URL=http://localhost:11434`

Set whichever you have, then freeride serve. The gateway picks them up and rotates between them.

Or use the wizard: freeride init writes ~/.freeride/.env for you. The gateway auto-loads that file at startup — no manual source needed.

Wire your agent

The fastest way is a binder:

freeride bind aider       # writes ~/.aider.conf.yml
freeride bind continue    # writes ~/.continue/config.yaml
freeride bind hermes      # writes ~/.hermes/config.yaml
freeride bind openclaw    # writes ~/.openclaw/openclaw.json

Or set the OpenAI vars yourself:

export OPENAI_API_BASE=http://localhost:11343/v1
export OPENAI_API_KEY=any

Anything OpenAI-shaped works. Tested with the openai-python SDK, Aider, Continue, Hermes, OpenClaw.

Multi-key rotation

Got several free keys for the same provider? Pass them as a JSON array:

export OPENROUTER_API_KEY='["sk-or-v1-key1","sk-or-v1-key2","sk-or-v1-key3"]'

When key 1 hits 429 it goes on cooldown for 120s; key 2 takes the next request. Cooldowns persist across restarts (~/.freeride/cooldown.json).

How failover works

Per request, FreeRide walks (provider, key) pairs in order:

RATE_LIMIT or AUTH → mark this key cooling, try the next key.
MODEL_NOT_FOUND → skip this provider, try the next provider.
Anything 5xx-ish → next pair.
First successful response → ship it; stamp X-FreeRide-Provider header (or _freeride_provider field on JSON) so you can tell who actually served it.

Streaming uses buffer-first-chunk failover: hold the first SSE event until upstream confirms the stream is real. If it fails before the first chunk, retry. After the first chunk has shipped, mid-stream errors propagate (rare; documented).

Recommended: run `freeride audit-models` after install

Providers list models they can't always serve. NVIDIA NIM lists Gemma-3-27B but sometimes returns 500. HuggingFace lists models that need PRO credits. The smart-router doesn't know which entries are real until it tries.

freeride audit-models                  # probe every catalog model, ~30s
freeride audit-models --provider groq  # one provider only

This writes ~/.freeride/cache/model_health.json that the smart-router reads at request time, so model: "auto" skips known-broken upstream models without paying a failover-attempt cost. Re-run after big provider changes or if you start seeing surprising 503s.

Stale cache (older than 24h) is auto-refreshed on the next request, but a manual audit-models run is faster than discovering staleness mid-request.

Telemetry

On by default. Hourly POST to https://telemetry.free-ride.xyz/v1/beacon:

{
  "installation_id": "random-uuid-v4",
  "version": "0.3.0",
  "os": "darwin",
  "tokens_served": 412034,
  "request_count": 187,
  "providers_active": ["openrouter", "groq"],
  "uptime_hours": 8
}

Prompts, completions, model IDs, API keys, hostnames, IPs — never sent. The Worker doesn't log cf-connecting-ip. The first time you run any freeride command a banner prints the exact payload.

freeride telemetry off    # turn it off
freeride telemetry        # show what would be sent

Embeddings

Same endpoint shape as OpenAI's /v1/embeddings. Failover across the 4 providers that support embeddings (Groq doesn't):

curl http://localhost:11343/v1/embeddings \
  -H 'Content-Type: application/json' \
  -d '{"model": "text-embedding-3-small", "input": "hello world"}'

The same X-FreeRide-Provider header tells you which provider served the embedding. Same multi-key rotation, same per-provider failover.

See what FreeRide is doing

freeride watch

Tails live failover events from a running gateway. Every request, every provider attempt, every rate-limit, every retry. Useful for seeing failover happen in real time, debugging "is my agent actually using FreeRide", or just demoing.

[14:23:01.412] req_a3f8e2c1  ▶ request model=openrouter/free stream
[14:23:01.421] req_a3f8e2c1  → openrouter[k0] openrouter/free
[14:23:01.833] req_a3f8e2c1  ← openrouter[k0] 412ms RATE_LIMIT ✗ (retry-after 47s)
[14:23:01.835] req_a3f8e2c1  → groq[k0] openrouter/free
[14:23:02.153] req_a3f8e2c1  ← groq[k0] 318ms OK ✓ first-chunk
[14:23:02.154] req_a3f8e2c1  ■ complete via groq

Events are written to ~/.freeride/events.jsonl. Opt out with FREERIDE_EVENTS=0 if you don't want them. File caps at 1 MiB with single-backup rotation.

Commands

freeride serve                  start the gateway
freeride bind <agent>           write gateway URL into agent config
freeride watch                  tail live failover events
freeride bench                  per-provider latency comparison (needs serve running)
freeride reload                 refresh provider registry from env vars (no restart)
freeride providers              live provider health from a running gateway
freeride doctor                 diagnose common setup issues (env vars, PATH, port)
freeride upgrade                bump installed package to latest PyPI release
freeride init                   interactive setup wizard — prompts for keys, writes ~/.freeride/.env
freeride keys                   show which provider keys are available vs cooling
freeride telemetry [on|off]     manage telemetry
freeride list                   list available free models
freeride status                 show OpenClaw config + cache age (v2)
freeride auto                   auto-configure OpenClaw (v2)
freeride rotate                 swap primary if it fails (v2)
freeride-watcher                background daemon that rotates on failure

freeride bench example output:

$ freeride bench
Benchmarking 5 providers, 3 requests each via http://localhost:11343/v1...

provider              ok    p50      p95      tok/s
─────────────────────────────────────────────────────
groq                  3/3   142ms    287ms    98
cloudflare_wai        3/3   284ms    410ms    81
nvidia_nim            3/3   389ms    502ms    72
openrouter            3/3   412ms    721ms    63
huggingface           2/3   612ms    1840ms   41

Fastest: groq (142ms p50)

The v2 commands keep working for existing OpenClaw users.

Providers

Provider	Status	Notes
OpenRouter	shipped	full surface — chat, streaming, tools, vision, structured outputs
NVIDIA NIM	shipped	curated free-model allowlist; `NVIDIA_NIM_FREE_MODELS_OVERRIDE` to expand
Groq	shipped	hardcoded allowlist (Llama 3.x, Gemma 2, Mixtral, DeepSeek-R1-distill); `GROQ_FREE_MODELS_OVERRIDE` to expand
Cloudflare Workers AI	shipped	curated allowlist of cheap-per-neuron chat models; needs `CLOUDFLARE_ACCOUNT_ID`
HuggingFace Inference	shipped	full HF router catalog; budget governs access ($0.10/mo Free, $2/mo PRO)
Cerebras	shipped	fastest Llama / Qwen inference; chat-only (no embeddings). `CEREBRAS_FREE_MODELS_OVERRIDE` to restrict catalog.
Ollama (local)	shipped	local-only; mix with remote providers in the same failover chain. Set `OLLAMA_BASE_URL` to opt in.

Adding a sixth: implement freeride.core.provider.Provider (api_version=1) in freeride/providers/<name>.py, register it in the conformance suite, done. See CONTRIBUTING.md.

Agents

Agent	`freeride bind`	Hot reload
OpenClaw	yes	needs restart
Aider	yes (`--scope home/cwd/git`)	needs restart
Continue	yes	yes
Hermes (NousResearch/hermes-agent)	yes	needs restart

Or anything else: OPENAI_API_BASE=http://localhost:11343/v1 + OPENAI_API_KEY=any.

Claude Code

Two ways FreeRide plays with Claude Code:

1. `freeride run claude` — companion mode (the main path)

freeride run claude

Wraps a Claude Code session so free providers are available alongside your subscription. Your Pro/Max OAuth (or ANTHROPIC_API_KEY) is preserved. Inside the session, flip per request via /model:

You type	What happens
`/model claude-opus-4-7`	Your subscription answers (passthrough to `api.anthropic.com`).
`/model freeride/free`	Free provider answers via smart-routing.
`/model freeride/fast`	Free, prefers groq (low TTFT).
`/model freeride/quality`	Free, prefers OpenRouter (widest catalog).
`/model freeride/coding`	Free, prefers code-tuned models (Qwen-Coder, DeepSeek).

Plain claude (no wrapper) goes direct to Anthropic — FreeRide is invisible. The wrapper sets ANTHROPIC_BASE_URL for the child process only; nothing system-wide changes.

Probe the setup: freeride doctor --claude-code.

Full guide: docs/claude-code.md.

2. Skill / plugin install (in-Claude awareness)

If you want Claude itself to know about FreeRide (detect it running, suggest the wrapper, help troubleshoot):

/plugin install https://github.com/Shaivpidadi/FreeRideV3

See skills/README.md for manual-install instructions.

Docs

docs/providers/SURVEY.md — Provider Protocol fit per provider (auth shape, free-tier semantics, error mapping)
docs/providers/nvidia_nim.md — NVIDIA NIM specifics (free-model allowlist, 403=AUTH quirk)
docs/agent-binders.md — per-agent bind reference (config locations, hot-reload behavior, edge cases)
docs/hermes.md — Hermes identification + bind plan
CONTRIBUTING.md — adding a provider or binder

License

MIT.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

shaivpidadi

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.4.0a20 pre-release

May 29, 2026

0.4.0a19 pre-release

May 29, 2026

0.4.0a18 pre-release

May 14, 2026

0.4.0a17 pre-release

May 14, 2026

0.4.0a16 pre-release

May 14, 2026

0.4.0a15 pre-release

May 13, 2026

0.4.0a14 pre-release

May 12, 2026

0.4.0a13 pre-release

May 12, 2026

0.4.0a12 pre-release

May 12, 2026

0.4.0a11 pre-release

May 12, 2026

0.4.0a10 pre-release

May 12, 2026

0.4.0a9 pre-release

May 12, 2026

This version

0.4.0a8 pre-release

May 12, 2026

0.4.0a7 pre-release

May 12, 2026

0.4.0a6 pre-release

May 12, 2026

0.4.0a5 pre-release

May 12, 2026

0.4.0a4 pre-release

May 10, 2026

0.4.0a3 pre-release

May 9, 2026

0.4.0a2 pre-release

May 8, 2026

0.4.0a1 pre-release

May 7, 2026

0.3.0a8 pre-release

May 7, 2026

0.3.0a7 pre-release

May 7, 2026

0.3.0a6 pre-release

May 7, 2026

0.3.0a5 pre-release

May 7, 2026

0.3.0a4 pre-release

May 7, 2026

0.3.0a3 pre-release

May 7, 2026

0.3.0a2 pre-release

May 7, 2026

0.3.0a1 pre-release

May 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

freeride_gateway-0.4.0a8.tar.gz (297.2 kB view details)

Uploaded May 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

freeride_gateway-0.4.0a8-py3-none-any.whl (170.1 kB view details)

Uploaded May 12, 2026 Python 3

File details

Details for the file freeride_gateway-0.4.0a8.tar.gz.

File metadata

Download URL: freeride_gateway-0.4.0a8.tar.gz
Upload date: May 12, 2026
Size: 297.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for freeride_gateway-0.4.0a8.tar.gz
Algorithm	Hash digest
SHA256	`0819f350c14788f27afffb7ef1fc5d588960d9207efc058461feec9c81f75b4e`
MD5	`952045cf523664fae49a8d6933709198`
BLAKE2b-256	`0f6233858529ac74f3991dea0e59779e2ece8fb28fb7a1e7be8ce0b6bfeb8bff`

See more details on using hashes here.

Provenance

The following attestation bundles were made for freeride_gateway-0.4.0a8.tar.gz:

Publisher: release.yml on Shaivpidadi/FreeRideV3

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: freeride_gateway-0.4.0a8.tar.gz
- Subject digest: 0819f350c14788f27afffb7ef1fc5d588960d9207efc058461feec9c81f75b4e
- Sigstore transparency entry: 1519993979
- Sigstore integration time: May 12, 2026
Source repository:
- Permalink: Shaivpidadi/FreeRideV3@03d6529a78fff59aa2568d22ca145ab6269e7e95
- Branch / Tag: refs/tags/v0.4.0a8
- Owner: https://github.com/Shaivpidadi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@03d6529a78fff59aa2568d22ca145ab6269e7e95
- Trigger Event: push

File details

Details for the file freeride_gateway-0.4.0a8-py3-none-any.whl.

File metadata

Download URL: freeride_gateway-0.4.0a8-py3-none-any.whl
Upload date: May 12, 2026
Size: 170.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for freeride_gateway-0.4.0a8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`baae2b59ac56f6e301a3d38c3f50a42ed39dc96d07a9976c68b8a7ce78fa484a`
MD5	`576193135669e8aca45e1a91a0be7754`
BLAKE2b-256	`414725f22c83d68dd791e09f829d0b3c1a1ef40fa7450195f115f179044708f1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for freeride_gateway-0.4.0a8-py3-none-any.whl:

Publisher: release.yml on Shaivpidadi/FreeRideV3

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: freeride_gateway-0.4.0a8-py3-none-any.whl
- Subject digest: baae2b59ac56f6e301a3d38c3f50a42ed39dc96d07a9976c68b8a7ce78fa484a
- Sigstore transparency entry: 1519993996
- Sigstore integration time: May 12, 2026
Source repository:
- Permalink: Shaivpidadi/FreeRideV3@03d6529a78fff59aa2568d22ca145ab6269e7e95
- Branch / Tag: refs/tags/v0.4.0a8
- Owner: https://github.com/Shaivpidadi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@03d6529a78fff59aa2568d22ca145ab6269e7e95
- Trigger Event: push

freeride-gateway 0.4.0a8

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

FreeRide

Install

Get keys (any one is enough; more = better failover)

Wire your agent

Multi-key rotation

How failover works

Recommended: run freeride audit-models after install

Telemetry

Embeddings

See what FreeRide is doing

Commands

Providers

Agents

Claude Code

1. freeride run claude — companion mode (the main path)

2. Skill / plugin install (in-Claude awareness)

Docs

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Recommended: run `freeride audit-models` after install

1. `freeride run claude` — companion mode (the main path)