Skip to main content

Moon for Claude: run Claude Code on external LLMs via LiteLLM

Project description

MoonClaude 🌙

Moon for Claude: Run Claude Code fully autonomously on external LLMs (e.g. Qwen, Gemini, MiniMax, OpenAI) through a LiteLLM proxy, while enjoying zero-downtime hot reloading and an Antigravity-grade persistent AI memory engine.

🌐 Visit the Official Landing Page for interactive setup guides and documentation.


🚀 Features

  • Lean & Stable Routing: MoonClaude now uses a "Lazy Injection" strategy—it only registers the models you select. This prevents rate-limit spikes and "no deployments available" errors by keeping the LiteLLM router lean and stable.
  • True Zero-Downtime Hot Reloading: Press Alt+M directly in the proxy terminal to instantaneously swap the active model. MoonClaude dynamically intercepts the routing under the hood—no proxy restarts, no lost context in the CLI, no downtime.
  • Antigravity-Grade Project Memory:
    • Auto-discovers and enforces MOONCLAUDE.md global project instructions into Claude Code's system prompts.
    • Generates cross-session, highly structured summarization context representing your recent AI coding sessions, intelligently managing your AI's working context without maxing out open files.
    • Easily browse transcripts natively with moon history.

📦 Installation & Setup

1. Prerequisites

If you haven't already, you need Node.js installed to run Anthropic's official Claude Code CLI.

# Install Claude Code globally via npm
npm install -g @anthropic-ai/claude-code

2. Install MoonClaude

MoonClaude is available directly on PyPI!

pip install moonclaude

3. Initialize Settings

Run the interactive setup wizard to map your API keys (OpenRouter, Gemini, Groq) and select your preferred chat models.

moon setup

🏃 Quick Start (Dual-Terminal Mode)

MoonClaude works best when you can see the API logs in real-time. We've automated this into a seamless Dual-Terminal workflow.

The Recommended Path: Simply run the following command in your project directory:

moon launch

What happens next?

  1. This terminal transforms into the MoonClaude Proxy log viewer. You'll see every API call, token count, and latency metric here. You can also press Alt+M here to swap models instantly.
  2. A new terminal window will automatically pop up and launch claude (Claude Code) linked to the proxy.

You're now ready to code with full visibility!


🎛️ Core Commands

moon setup                  Interactive wizard for API keys and preferred models
moon launch                 Quick launch (starts proxy in background, then launches Claude)
moon chat                   Launch Claude connected to the proxy
moon start                  Start LiteLLM proxy in the foreground with live token monitoring
moon switch                 Fast CLI model switcher 
moon history                Interactive rich browser for your past conversation logs
moon memory                 Detailed control over the project memory context engine
moon status                 Display system health, active models, and paths
moon                        Show CLI help and commands

Compatibility: MoonClaude preserves the old claude-ext ... legacy command aliases. Your legacy ~/.claude-ext config will automatically migrate to ~/.moonclaude/ at launch!


♻️ Zero-Downtime Hot Reloading

When you are deep in a Claude Code debug session, discovering your chosen model is failing can be frustrating.

Instead of dropping context:

  1. Keep the proxy running in the foreground (moon start).
  2. Press Alt+M.
  3. Select a new LLM from your interactive UI overlay.
  4. MoonClaude dynamically pre-warms its internal router injection without dropping the proxy connection. Your next prompt in the Claude Code terminal instantly bridges over to the new model!

🧠 Memory Engine

MoonClaude injects long-term memory into Claude Code.

  1. MOONCLAUDE.md: Place this file in your project root. MoonClaude automatically reads it and appends it to your AI's persistent context across all sessions.
  2. Context Compression: When you boot moon chat, MoonClaude evaluates your last 20 sessions and constructs an elegant, compressed timeline of files you've modified, goals you've accomplished, and context you shouldn't forget.
  3. Session Replay: Type moon history to explore Rich-formatted logs of all of your previous AI code iterations.

🛡️ Bypassing Claude Restrictions

Claude Code by default only supports Anthropic models and requires an active initial authentication to their servers. To use MoonClaude, you must authenticate once via the official CLI:

claude login

Follow the browser prompt to log into your Anthropic Developer account. Don't worry—MoonClaude will route the actual requests away from Anthropic to your selected free providers instead.

Routing Through MoonClaude

To ensure you don't hit Anthropic's paid API, never run claude directly. Always launch your environment with:

moon chat
# OR
moon launch

MoonClaude sets local HTTP proxy environmental variables (under the hood) to seamlessly override Anthropic's endpoints. Your prompts instantly hit Gemini, Llama, or Qwen instances safely and securely!


License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

moonclaude-2.1.4.tar.gz (40.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

moonclaude-2.1.4-py3-none-any.whl (42.6 kB view details)

Uploaded Python 3

File details

Details for the file moonclaude-2.1.4.tar.gz.

File metadata

  • Download URL: moonclaude-2.1.4.tar.gz
  • Upload date:
  • Size: 40.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for moonclaude-2.1.4.tar.gz
Algorithm Hash digest
SHA256 a2fa5273e6574d0166826ebd075e582f34f9e5aed8958474bd1a3003fd0fd549
MD5 e3ec40387129183fbeddb4bc20710b67
BLAKE2b-256 2c4c2783f0853fb452c1f32710e3545bdf3cdfc4377e2f31887cc7b152c6fe16

See more details on using hashes here.

File details

Details for the file moonclaude-2.1.4-py3-none-any.whl.

File metadata

  • Download URL: moonclaude-2.1.4-py3-none-any.whl
  • Upload date:
  • Size: 42.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for moonclaude-2.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 00187c3e6914e21b00843890948a74bf74af4eb2e8cdbbdc0d4c66f09d71715f
MD5 d4a82f018942490173a39eee87bd5b65
BLAKE2b-256 c4bd64e63bd33f303375f16236646116bdddf9dcf4534cad3842bd4e65d57ec7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page