NVHive — Multi-LLM orchestration platform with intelligent routing, hive consensus, and auto-agent generation
Project description
nvHive
One command. Every AI model. Your GPU or the cloud.
- Ask once, get the best model. nvHive routes your question to the right LLM across 22 providers and 63 models — automatically, based on task type, cost, and privacy.
- Run it free on your GPU. NVIDIA Nemotron models run locally via Ollama with no API keys, no cloud costs, no data leaving your machine.
- Council mode. When one model isn't enough, multiple LLMs debate your question and synthesize a consensus answer.
Quick Start
pip install nvhive
nvh "What is machine learning?"
No API keys needed — works immediately with free providers. Run nvh setup to add more.
Platform-specific installers
Linux (NVIDIA GPU):
curl -fsSL https://raw.githubusercontent.com/thatcooperguy/nvHive/main/install.sh | bash
macOS:
curl -fsSL https://raw.githubusercontent.com/thatcooperguy/nvHive/main/install-mac.sh | bash
Windows (PowerShell):
iwr -useb https://raw.githubusercontent.com/thatcooperguy/nvHive/main/install.ps1 | iex
Auto-detects GPU, downloads the right Nemotron model, configures everything. Supports Linux (NVIDIA CUDA), macOS (Apple Silicon Metal), and Windows.
How It Works
- You type:
nvh "Should I use Redis or Postgres for sessions?" - The action detector checks if this is a system command. If so, it executes directly.
- The smart router classifies the task, scores all advisors on relevance, cost, and speed.
- Local-first: simple queries stay on Nemotron (free, private, no network).
- Cloud when needed: complex queries route to the best cloud model.
Core Commands
| Command | What It Does |
|---|---|
nvh "question" |
Smart route to the best available model |
nvh convene "question" |
Council of AI experts debate and synthesize |
nvh throwdown "question" |
Two-pass deep analysis with critique |
nvh safe "question" |
Local only — nothing leaves your machine |
nvh code / write / research |
Task-optimized routing |
nvh setup |
Interactive provider setup wizard |
nvh webui |
Launch the web dashboard at http://nvhive |
nvh integrate |
Auto-detect and connect all platforms |
nvh status |
Providers, GPU, budget at a glance |
Providers
22 providers. 63 models. 25 free — no credit card required.
Ollama (local), OpenAI, Anthropic, Google Gemini, Groq, NVIDIA NIM, DeepSeek, GitHub Models, LLM7, Mistral, Cohere, Cerebras, SambaNova, and more. The smart router picks the best one. Or go direct: nvh groq "question".
Integrations
nvHive connects to your existing AI platforms. Auto-detect everything with one command:
nvh integrate --auto
Or set up each platform individually:
NVIDIA NemoClaw
nvHive works as an inference provider inside NemoClaw, giving NemoClaw agents access to smart routing, council consensus, and throwdown analysis.
nvh nemoclaw --start # start proxy
nvh nemoclaw --test # verify connectivity
graph LR
subgraph NemoClaw Sandbox
A[OpenClaw Agent] --> B[inference.local]
end
B -->|OpenShell Gateway| C[nvHive Proxy :8000]
C --> D[Smart Router]
D --> E[Ollama / Nemotron]
D --> F[22 Cloud Providers]
D --> G[Council / Throwdown]
style C fill:#76B900,color:#000
style E fill:#76B900,color:#000
Virtual models: auto, safe, council, council:N, throwdown. Set x-nvhive-privacy: local-only for sensitive queries.
Anthropic Claude Code
Register nvHive as an MCP tool server — Claude Code gets access to multi-model routing, council consensus, and provider management:
pip install "nvhive[mcp]"
claude mcp add nvhive nvh mcp
graph LR
A[Claude Code] -->|MCP stdio| B[nvHive MCP Server]
B --> C[ask / ask_safe]
B --> D[council / throwdown]
B --> E[status / list_advisors]
C --> F[nvHive Engine]
D --> F
F --> G[22 LLM Providers]
MCP tools: ask, ask_safe, council, throwdown, status, list_advisors, list_cabinets.
OpenClaw
Register nvHive tools with any OpenClaw agent:
nvh openclaw --config # generates openclaw.json
nvh openclaw --start # start MCP server
graph LR
A[OpenClaw Agent] -->|MCP stdio| B[nvHive MCP Server]
B --> C[ask / council / throwdown]
C --> D[Smart Router]
D --> E[Local + Cloud Providers]
Cursor
nvh integrate --auto # auto-detects Cursor and configures
Or manually add to ~/.cursor/mcp.json:
{ "mcpServers": { "nvhive": { "command": "nvhive-mcp" } } }
OpenAI-Compatible Proxy
Any tool that speaks the OpenAI API can use nvHive as a backend:
from openai import OpenAI
client = OpenAI(base_url="http://nvhive:8000/v1/proxy", api_key="nvhive")
response = client.chat.completions.create(
model="auto", # nvHive picks the best model
messages=[{"role": "user", "content": "Hello"}]
)
Integration Architecture
graph TB
subgraph Clients
CLI[nvh CLI]
WEB[Web UI]
NC[NemoClaw]
OC[OpenClaw]
CC[Claude Code]
CU[Cursor]
SDK[Any OpenAI SDK]
end
subgraph "nvHive Core"
API[API Server :8000]
MCP[MCP Server]
PROXY[OpenAI Proxy]
ROUTER[Smart Router]
COUNCIL[Council Engine]
end
subgraph Providers
LOCAL[Ollama / Nemotron]
CLOUD[OpenAI / Anthropic / Google]
FREE[Groq / LLM7 / GitHub Models]
end
CLI --> API
WEB --> API
NC -->|OpenShell Gateway| PROXY
OC -->|MCP| MCP
CC -->|MCP| MCP
CU -->|MCP| MCP
SDK -->|OpenAI API| PROXY
MCP --> API
PROXY --> API
API --> ROUTER
ROUTER --> COUNCIL
ROUTER --> LOCAL
ROUTER --> CLOUD
ROUTER --> FREE
style LOCAL fill:#76B900,color:#000
style NC fill:#76B900,color:#000
style API fill:#1a1a1a,color:#76B900,stroke:#76B900
NemoClaw guide → · SDK & API reference →
Web Dashboard
Launch the NVIDIA-themed dashboard:
nvh webui # auto-configures hostname + best port
Access at http://nvhive:3000 (auto-configured) or http://localhost:3000.
8 pages: Chat, Council, Query Builder, Advisors, Integrations, System, Settings, and Setup Wizard. NVIDIA dark theme with green accents, real-time streaming, command palette (Ctrl+K), and keyboard shortcuts.
Screenshots
| Advisors | System Dashboard |
|---|---|
| Integrations | Setup Wizard |
|---|---|
Privacy and Safe Mode
nvh safe— local models only, nothing leaves your machine- Local default — simple queries stay on Ollama, complex route to cloud
- HIVE.md — drop a context file in any project, all advisors see it automatically
Python SDK
from nvh import ask, convene, safe
response = await ask("What is machine learning?")
result = await convene("Should we use Rust?", cabinet="engineering")
private = await safe("Analyze my salary data")
Sync versions: ask_sync, convene_sync, safe_sync. SDK reference →
Learn More
| Guide | Description |
|---|---|
| Getting Started | First-time setup and usage |
| Commands | Full CLI reference |
| Providers | 22 providers, GPU-adaptive models |
| Council System | Multi-LLM consensus, 12 cabinets |
| NemoClaw | NVIDIA NemoClaw integration |
| SDK & API | Python SDK, proxy, MCP server |
| Web Interface | Dashboard pages and design |
| Orchestration | GPU-powered routing and eval |
| For Students | Homework, tutoring, exam prep |
| Tools | 27 built-in tools |
| Configuration | Config, HIVE.md, budget |
| Architecture | System design and data flow |
Contributing
See CONTRIBUTING.md for development setup and pull request guidelines.
License
MIT License. See LICENSE for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nvhive-0.3.0.tar.gz.
File metadata
- Download URL: nvhive-0.3.0.tar.gz
- Upload date:
- Size: 279.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9fa5219f52afc92ca726ab4501e463ea4ab841d1889959f59e9fb9525dc0b333
|
|
| MD5 |
ff7a52f39c096d443efbfcf56ec38a97
|
|
| BLAKE2b-256 |
3de2d65d010cba4238a9143f9dcc27718ce3c43c3c70d1b650d13d1dfd0c6e64
|
File details
Details for the file nvhive-0.3.0-py3-none-any.whl.
File metadata
- Download URL: nvhive-0.3.0-py3-none-any.whl
- Upload date:
- Size: 318.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2be817b0fb4f500cf5ebfd6dc7ef76a28dbbc9ef6279cc68d6ec492ad0f67b69
|
|
| MD5 |
4f404ff313745e3a837bd7d6f3124c1e
|
|
| BLAKE2b-256 |
e2f70e9b3d8c75f8c56808f403c2e35c7b537a741eec54eb03306557bfb23d32
|