Skip to main content

Launch Claude Code through inferroute — typed less, routed smarter

Project description

ir — Claude Code through inferroute

Tiny launcher: one command (ir) spawns claude with the model you pick and your inferroute key, so you don't have to set env vars every time. Your normal claude is never touchedir only configures the subprocess it spawns.

You choose the model per session — there's no auto-routing. Optionally, an on-device recorder logs your choices privately (see Local recording).

Install

pipx install inferroute
ir setup

That's it. ir setup walks you through it: log in with your key (saved to ~/.config/inferroute/credentials, mode 600) and optionally turn on private on-device recording. Update any time with pipx upgrade inferroute.

Sign up for a key at https://inferroute.ai if you don't have one. (Prefer the steps separately? ir login then ir add recording do the same thing.)

Commands

ir setup            first-time onboarding: log in + optionally enable recording
ir                  open the model picker, then launch (same as `ir choose`)
ir --model NAME     pin a model — short alias or canonical id; any claude flag passes through
ir choose           interactive picker — pick a model, then launch
ir anthropic        escape hatch — plain Claude, your own setup
ir status           personal usage view (TUI)
ir login            save / refresh your inferroute API key
ir help             one-screen explainer

ir add recording    turn on private on-device recording (see below)
ir data show        what's been recorded (counts, models, size)
ir data export DIR  copy the metadata layer  ·  ir data wipe  delete it all
ir remove recording stop recording

NAME is a short alias (minimax, minimax-m3, kimi, glm) or any canonical id (e.g. claude-opus-4-8), which passes through verbatim.

Every command bakes in --dangerously-skip-permissions, which is the right default for an agentic workflow. If you want the prompts back, just use plain claude.

What does each command do under the hood?

ir --model NAME      →  ANTHROPIC_BASE_URL=<recorder daemon if running,         \\
                                            else https://api.inferroute.ai>      \\
                        ANTHROPIC_AUTH_TOKEN=$saved_key                          \\
                        exec claude --dangerously-skip-permissions              \\
                                    --model <resolved-id>
ir anthropic         →  exec claude --dangerously-skip-permissions
                        (no env mutation — pure pass-through)

The env vars are scoped to the spawned claude process. Your shell doesn't see them. When the local recorder daemon is running it sits in front of api.inferroute.ai — it records locally, then forwards there (the cloud still does its own backend fallbacks).

Local recording (optional)

ir add recording runs a small daemon on localhost:5005 that records — on this machine only — which model you pick per task and how the turn went, so you (or a future personal router) can learn your preferences. It is never uploaded; we never see it.

  • Default is full — choices, outcomes, and prompt text. The prompt text is what actually lets it learn your preferences, and it never leaves this machine. Pick minimal at install for choices + outcomes only (no prompt text), which is lighter but can't train a personal router.
  • Inspect it any time with ir data show, copy the metadata layer with ir data export DIR (never raw prompt text), or delete everything with ir data wipe.
  • It lives under ~/.inferroute. Remove the daemon with ir remove recording (--purge also deletes the recorded data).

Where your config lives

Path Contents
~/.config/inferroute/credentials Your inferroute API key + base URL (mode 600). Edit by hand or re-run ir login.

Privacy

The CLI talks to:

  • https://api.inferroute.ai for your traffic (everything except ir anthropic)
  • https://inferroute.ai only on ir help — to print the signup URL

It does not phone home, no telemetry. If you enable ir add recording, that data is written only to ~/.inferroute on your machine and is never uploaded — inspect or delete it any time with ir data show / ir data wipe. Whatever your claude --model … sends to whatever upstream is on you.

ir anthropic is literally exec claude plus one flag — the inferroute service sees nothing.

Source

https://github.com/InferRoute/inferroute

Issues / feature requests / PRs welcome.

License

MIT — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inferroute-0.3.3.tar.gz (50.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

inferroute-0.3.3-py3-none-any.whl (60.1 kB view details)

Uploaded Python 3

File details

Details for the file inferroute-0.3.3.tar.gz.

File metadata

  • Download URL: inferroute-0.3.3.tar.gz
  • Upload date:
  • Size: 50.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for inferroute-0.3.3.tar.gz
Algorithm Hash digest
SHA256 ab9955519bfbf030ed8f14c8e73a7eab885cedfe7ee53de6b41767f579358291
MD5 bf0ca3d80daa44ecbbfabf1a0dc26454
BLAKE2b-256 9c488326e892812b88139bc96d64b6f6f9db7a96b2c132e57d7472bdb1a0f371

See more details on using hashes here.

Provenance

The following attestation bundles were made for inferroute-0.3.3.tar.gz:

Publisher: release.yml on InferRoute/inferroute

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file inferroute-0.3.3-py3-none-any.whl.

File metadata

  • Download URL: inferroute-0.3.3-py3-none-any.whl
  • Upload date:
  • Size: 60.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for inferroute-0.3.3-py3-none-any.whl
Algorithm Hash digest
SHA256 269a36f23fbe9161c01a71765112cdece446be2465952e1a8c70e3afa9724cd3
MD5 2273826de4892d10ce00474870aabbf5
BLAKE2b-256 215467f72e883577663ccb1a6054fc71e36c983cbc7978376f21d5dc0ecd4035

See more details on using hashes here.

Provenance

The following attestation bundles were made for inferroute-0.3.3-py3-none-any.whl:

Publisher: release.yml on InferRoute/inferroute

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page