Skip to main content

Launch Claude Code through inferroute — typed less, routed smarter

Project description

ir — Claude Code through inferroute

Tiny launcher: one command (ir) spawns claude with the model you pick and your inferroute key, so you don't have to set env vars every time. Your normal claude is never touchedir only configures the subprocess it spawns.

You choose the model per session — there's no auto-routing. Optionally, an on-device recorder logs your choices privately (see Local recording).

Install

pipx install inferroute
ir setup

That's it. ir setup logs you in (key saved to ~/.config/inferroute/credentials, mode 600) and enables private on-device recording — full by default, since it never leaves your machine; change it with ir add recording --level minimal or turn it off with ir remove recording. Update any time with pipx upgrade inferroute.

Sign up for a key at https://inferroute.ai if you don't have one. (Prefer the steps separately? ir login then ir add recording do the same thing.)

Commands

ir setup            first-time onboarding: log in + optionally enable recording
ir                  open the model picker, then launch (same as `ir choose`)
ir --model NAME     pin a model — short alias or canonical id; any claude flag passes through
ir choose           interactive picker — pick a model, then launch
ir anthropic        escape hatch — plain Claude, your own setup
ir status           personal usage view (TUI)
ir login            save / refresh your inferroute API key
ir help             one-screen explainer

ir add recording    turn on private on-device recording (see below)
ir data show        what's been recorded (counts, models, size)
ir data export DIR  copy the metadata layer  ·  ir data wipe  delete it all
ir remove recording stop recording

NAME is a short alias (minimax, minimax-m3, kimi, glm) or any canonical id (e.g. claude-opus-4-8), which passes through verbatim.

Every command bakes in --dangerously-skip-permissions, which is the right default for an agentic workflow. If you want the prompts back, just use plain claude.

What does each command do under the hood?

ir --model NAME      →  ANTHROPIC_BASE_URL=<recorder daemon if running,         \\
                                            else https://api.inferroute.ai>      \\
                        ANTHROPIC_AUTH_TOKEN=$saved_key                          \\
                        exec claude --dangerously-skip-permissions              \\
                                    --model <resolved-id>
ir anthropic         →  exec claude --dangerously-skip-permissions
                        (no env mutation — pure pass-through)

The env vars are scoped to the spawned claude process. Your shell doesn't see them. When the local recorder daemon is running it sits in front of api.inferroute.ai — it records locally, then forwards there (the cloud still does its own backend fallbacks).

Local recording (optional)

ir add recording runs a small daemon on localhost:5005 that records — on this machine only — which model you pick per task and how the turn went, so you (or a future personal router) can learn your preferences. It is never uploaded; we never see it.

  • Default is full — choices, outcomes, and prompt text. The prompt text is what actually lets it learn your preferences, and it never leaves this machine. Pick minimal at install for choices + outcomes only (no prompt text), which is lighter but can't train a personal router.
  • Inspect it any time with ir data show, copy the metadata layer with ir data export DIR (never raw prompt text), or delete everything with ir data wipe.
  • It lives under ~/.inferroute. Remove the daemon with ir remove recording (--purge also deletes the recorded data).

Where your config lives

Path Contents
~/.config/inferroute/credentials Your inferroute API key + base URL (mode 600). Edit by hand or re-run ir login.

Privacy

The CLI talks to:

  • https://api.inferroute.ai for your traffic (everything except ir anthropic)
  • https://inferroute.ai only on ir help — to print the signup URL

It does not phone home, no telemetry. If you enable ir add recording, that data is written only to ~/.inferroute on your machine and is never uploaded — inspect or delete it any time with ir data show / ir data wipe. Whatever your claude --model … sends to whatever upstream is on you.

ir anthropic is literally exec claude plus one flag — the inferroute service sees nothing.

Source

https://github.com/InferRoute/inferroute

Issues / feature requests / PRs welcome.

License

MIT — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inferroute-0.3.4.tar.gz (50.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

inferroute-0.3.4-py3-none-any.whl (60.3 kB view details)

Uploaded Python 3

File details

Details for the file inferroute-0.3.4.tar.gz.

File metadata

  • Download URL: inferroute-0.3.4.tar.gz
  • Upload date:
  • Size: 50.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for inferroute-0.3.4.tar.gz
Algorithm Hash digest
SHA256 24daba8fe93710af1d037609b0f6aced127945b2e571fca6ab2ebf7dfa4a5ce2
MD5 d45b4a39ab61fb73cb46383d7348145b
BLAKE2b-256 6a913aaee19d04f9ffa91c5be40d129c24b50bd52c0e30180df42f65f1c00e36

See more details on using hashes here.

Provenance

The following attestation bundles were made for inferroute-0.3.4.tar.gz:

Publisher: release.yml on InferRoute/inferroute

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file inferroute-0.3.4-py3-none-any.whl.

File metadata

  • Download URL: inferroute-0.3.4-py3-none-any.whl
  • Upload date:
  • Size: 60.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for inferroute-0.3.4-py3-none-any.whl
Algorithm Hash digest
SHA256 ff6ffb31f561a0eec79fe44e555fefdeb1566ae7872e2ca4f07ac399b22d3bea
MD5 a0e4e2af0e5483cc357d64a16056202a
BLAKE2b-256 b2af0c19a452b5e6aaa2922af13d229a0ad57a8eec564d900ff895a926413e10

See more details on using hashes here.

Provenance

The following attestation bundles were made for inferroute-0.3.4-py3-none-any.whl:

Publisher: release.yml on InferRoute/inferroute

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page