Skip to main content

NVHive — Multi-LLM orchestration platform with intelligent routing, hive consensus, and auto-agent generation

Project description

nvHive

One command. Every AI model. Your GPU or the cloud.

version python license tests providers models

  • Ask once, get the best model. nvHive routes your question to the right LLM across 22 providers and 63 models — automatically, based on task type, cost, and privacy.
  • Run it free on your GPU. NVIDIA Nemotron models run locally via Ollama with no API keys, no cloud costs, no data leaving your machine.
  • Council mode. When one model isn't enough, multiple LLMs debate your question and synthesize a consensus answer.

nvHive CLI


Quick Start

pip install nvhive
nvh "What is machine learning?"

No API keys needed — works immediately with free providers. Run nvh setup to add more.

Platform-specific installers

Linux (NVIDIA GPU):

curl -fsSL https://raw.githubusercontent.com/thatcooperguy/nvHive/main/install.sh | bash

macOS:

curl -fsSL https://raw.githubusercontent.com/thatcooperguy/nvHive/main/install-mac.sh | bash

Windows (PowerShell):

iwr -useb https://raw.githubusercontent.com/thatcooperguy/nvHive/main/install.ps1 | iex

Auto-detects GPU, downloads the right Nemotron model, configures everything. Supports Linux (NVIDIA CUDA), macOS (Apple Silicon Metal), and Windows.

How It Works

  1. You type: nvh "Should I use Redis or Postgres for sessions?"
  2. The action detector checks if this is a system command. If so, it executes directly.
  3. The smart router classifies the task, scores all advisors on relevance, cost, and speed.
  4. Local-first: simple queries stay on Nemotron (free, private, no network).
  5. Cloud when needed: complex queries route to the best cloud model.

Core Commands

Command What It Does
nvh "question" Smart route to the best available model
nvh convene "question" Council of AI experts debate and synthesize
nvh throwdown "question" Two-pass deep analysis with critique
nvh safe "question" Local only — nothing leaves your machine
nvh code / write / research Task-optimized routing
nvh setup Interactive provider setup wizard
nvh webui Launch the web dashboard at http://nvhive
nvh integrate Auto-detect and connect all platforms
nvh status Providers, GPU, budget at a glance

Full command reference →

Providers

22 providers. 63 models. 25 free — no credit card required.

Ollama (local), OpenAI, Anthropic, Google Gemini, Groq, NVIDIA NIM, DeepSeek, GitHub Models, LLM7, Mistral, Cohere, Cerebras, SambaNova, and more. The smart router picks the best one. Or go direct: nvh groq "question".

Full provider table →


Integrations

nvHive connects to your existing AI platforms. Auto-detect everything with one command:

nvh integrate --auto

Or set up each platform individually:

NVIDIA NemoClaw

nvHive works as an inference provider inside NemoClaw, giving NemoClaw agents access to smart routing, council consensus, and throwdown analysis.

nvh nemoclaw --start    # start proxy
nvh nemoclaw --test     # verify connectivity
graph LR
    subgraph NemoClaw Sandbox
        A[OpenClaw Agent] --> B[inference.local]
    end
    B -->|OpenShell Gateway| C[nvHive Proxy :8000]
    C --> D[Smart Router]
    D --> E[Ollama / Nemotron]
    D --> F[22 Cloud Providers]
    D --> G[Council / Throwdown]

    style C fill:#76B900,color:#000
    style E fill:#76B900,color:#000

Virtual models: auto, safe, council, council:N, throwdown. Set x-nvhive-privacy: local-only for sensitive queries.

Full NemoClaw guide →

Anthropic Claude Code

Register nvHive as an MCP tool server — Claude Code gets access to multi-model routing, council consensus, and provider management:

pip install "nvhive[mcp]"
claude mcp add nvhive nvh mcp
graph LR
    A[Claude Code] -->|MCP stdio| B[nvHive MCP Server]
    B --> C[ask / ask_safe]
    B --> D[council / throwdown]
    B --> E[status / list_advisors]
    C --> F[nvHive Engine]
    D --> F
    F --> G[22 LLM Providers]

MCP tools: ask, ask_safe, council, throwdown, status, list_advisors, list_cabinets.

OpenClaw

Register nvHive tools with any OpenClaw agent:

nvh openclaw --config     # generates openclaw.json
nvh openclaw --start      # start MCP server
graph LR
    A[OpenClaw Agent] -->|MCP stdio| B[nvHive MCP Server]
    B --> C[ask / council / throwdown]
    C --> D[Smart Router]
    D --> E[Local + Cloud Providers]

Cursor

nvh integrate --auto      # auto-detects Cursor and configures

Or manually add to ~/.cursor/mcp.json:

{ "mcpServers": { "nvhive": { "command": "nvhive-mcp" } } }

OpenAI-Compatible Proxy

Any tool that speaks the OpenAI API can use nvHive as a backend:

from openai import OpenAI
client = OpenAI(base_url="http://nvhive:8000/v1/proxy", api_key="nvhive")
response = client.chat.completions.create(
    model="auto",  # nvHive picks the best model
    messages=[{"role": "user", "content": "Hello"}]
)

Integration Architecture

graph TB
    subgraph Clients
        CLI[nvh CLI]
        WEB[Web UI]
        NC[NemoClaw]
        OC[OpenClaw]
        CC[Claude Code]
        CU[Cursor]
        SDK[Any OpenAI SDK]
    end
    subgraph "nvHive Core"
        API[API Server :8000]
        MCP[MCP Server]
        PROXY[OpenAI Proxy]
        ROUTER[Smart Router]
        COUNCIL[Council Engine]
    end
    subgraph Providers
        LOCAL[Ollama / Nemotron]
        CLOUD[OpenAI / Anthropic / Google]
        FREE[Groq / LLM7 / GitHub Models]
    end

    CLI --> API
    WEB --> API
    NC -->|OpenShell Gateway| PROXY
    OC -->|MCP| MCP
    CC -->|MCP| MCP
    CU -->|MCP| MCP
    SDK -->|OpenAI API| PROXY

    MCP --> API
    PROXY --> API
    API --> ROUTER
    ROUTER --> COUNCIL
    ROUTER --> LOCAL
    ROUTER --> CLOUD
    ROUTER --> FREE

    style LOCAL fill:#76B900,color:#000
    style NC fill:#76B900,color:#000
    style API fill:#1a1a1a,color:#76B900,stroke:#76B900

NemoClaw guide → · SDK & API reference →


Web Dashboard

Launch the NVIDIA-themed dashboard:

nvh webui                 # auto-configures hostname + best port

Access at http://nvhive:3000 (auto-configured) or http://localhost:3000.

nvHive Web Dashboard

8 pages: Chat, Council, Query Builder, Advisors, Integrations, System, Settings, and Setup Wizard. NVIDIA dark theme with green accents, real-time streaming, command palette (Ctrl+K), and keyboard shortcuts.

Screenshots
Advisors System Dashboard
Advisors System
Integrations Setup Wizard
Integrations Setup

Full WebUI guide →


Privacy and Safe Mode

  • nvh safe — local models only, nothing leaves your machine
  • Local default — simple queries stay on Ollama, complex route to cloud
  • HIVE.md — drop a context file in any project, all advisors see it automatically

Python SDK

from nvh import ask, convene, safe

response = await ask("What is machine learning?")
result = await convene("Should we use Rust?", cabinet="engineering")
private = await safe("Analyze my salary data")

Sync versions: ask_sync, convene_sync, safe_sync. SDK reference →

Learn More

Guide Description
Getting Started First-time setup and usage
Commands Full CLI reference
Providers 22 providers, GPU-adaptive models
Council System Multi-LLM consensus, 12 cabinets
NemoClaw NVIDIA NemoClaw integration
SDK & API Python SDK, proxy, MCP server
Web Interface Dashboard pages and design
Orchestration GPU-powered routing and eval
For Students Homework, tutoring, exam prep
Tools 27 built-in tools
Configuration Config, HIVE.md, budget
Architecture System design and data flow

Contributing

See CONTRIBUTING.md for development setup and pull request guidelines.

License

MIT License. See LICENSE for details.

Project details


Release history Release notifications | RSS feed

This version

0.3.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nvhive-0.3.0.tar.gz (279.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nvhive-0.3.0-py3-none-any.whl (318.6 kB view details)

Uploaded Python 3

File details

Details for the file nvhive-0.3.0.tar.gz.

File metadata

  • Download URL: nvhive-0.3.0.tar.gz
  • Upload date:
  • Size: 279.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for nvhive-0.3.0.tar.gz
Algorithm Hash digest
SHA256 9fa5219f52afc92ca726ab4501e463ea4ab841d1889959f59e9fb9525dc0b333
MD5 ff7a52f39c096d443efbfcf56ec38a97
BLAKE2b-256 3de2d65d010cba4238a9143f9dcc27718ce3c43c3c70d1b650d13d1dfd0c6e64

See more details on using hashes here.

File details

Details for the file nvhive-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: nvhive-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 318.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for nvhive-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2be817b0fb4f500cf5ebfd6dc7ef76a28dbbc9ef6279cc68d6ec492ad0f67b69
MD5 4f404ff313745e3a837bd7d6f3124c1e
BLAKE2b-256 e2f70e9b3d8c75f8c56808f403c2e35c7b537a741eec54eb03306557bfb23d32

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page