NVHive — Multi-LLM orchestration platform with intelligent routing, hive consensus, and auto-agent generation

These details have not been verified by PyPI

Project links

Project description

nvHive

Multi-LLM orchestration platform for NVIDIA GPUs and the cloud.

version python license tests providers models

What is nvHive?

nvHive routes your questions to the right AI model automatically. It manages 22 providers and 63 models behind a single nvh command, picking the best advisor based on task type, cost, and privacy requirements. Simple questions stay local on your GPU (free, private). Complex questions route to the best cloud model. You can also convene a council of AI-generated expert personas to debate a decision, or poll every provider at once to compare answers. Twenty-five of the supported models are free with no credit card required.

Platform Support

Platform	GPU Support	Install
Linux (NVIDIA GPU)	Full (CUDA, pynvml)	`install.sh` or pip
macOS (Apple Silicon)	Metal via Ollama	`install-mac.sh` or pip
macOS (Intel)	CPU only	`pip install nvhive`
Windows (NVIDIA GPU)	Full (CUDA, pynvml)	`install.ps1` or pip
Windows (no GPU)	CPU only	`pip install nvhive`
Linux Desktop	Full (auto-detected)	`install.sh`

Quick Start

Linux with GPU:

curl -fsSL https://raw.githubusercontent.com/thatcooperguy/nvHive/main/install.sh | bash

macOS:

curl -fsSL https://raw.githubusercontent.com/thatcooperguy/nvHive/main/install-mac.sh | bash

Windows (PowerShell):

iwr -useb https://raw.githubusercontent.com/thatcooperguy/nvHive/main/install.ps1 | iex

Any platform (pip):

python3 -m pip install nvhive
nvh setup                     # configure your first provider
nvh "What is machine learning?"

From source:

git clone https://github.com/thatcooperguy/nvHive.git
cd nvHive
pip install -e ".[dev]"
nvh doctor                    # verify everything works

What Happens Automatically

When you install nvHive, everything configures itself:

Install runs:
  1. Detects your GPU (NVIDIA via pynvml, Apple Silicon via sysctl)
  2. Reads available VRAM / unified memory
  3. Downloads the right NVIDIA Nemotron model for your hardware:

     GPU Memory        Model Auto-Downloaded         Size    Speed
     ─────────────────────────────────────────────────────────────
     < 4 GB or CPU     nemotron-mini (4B)            ~2 GB   ~30 tok/s
     4–6 GB            nemotron-mini (GPU accel.)     ~2 GB   ~50 tok/s
     6–12 GB           nemotron-small (recommended)   ~5 GB   ~75 tok/s
     12–24 GB          nemotron-small + codellama     ~9 GB   ~110 tok/s
     24–48 GB          nemotron 70B (quantized)       ~40 GB  ~40 tok/s
     48–80 GB          nemotron 70B (full quality)    ~40 GB  ~120 tok/s
     80+ GB            nemotron 120B (flagship)       ~70 GB  ~180 tok/s

  4. Installs Ollama (local model server) — no root needed
  5. Creates config with Ollama + LLM7 (anonymous, free) enabled
  6. Pulls model in background — you can start chatting immediately
  7. Adds 'nvh' to your PATH

First time: ~60 seconds. Reconnect (new VM): ~3 seconds.

You never pick a model. The platform reads your hardware and downloads the best one. On Apple Silicon, it uses Metal via Ollama with unified memory. On NVIDIA, it uses CUDA. On CPU-only systems, it uses free cloud providers.

Your First 60 Seconds

$ nvh "Explain Python decorators in 3 sentences"
╭─ nemotron-small (local, free) ──────────────────────────────────────╮
│ A decorator is a function that takes another function as input and  │
│ returns a modified version of it. You apply one with @decorator     │
│ syntax above a function definition. They're used for cross-cutting  │
│ concerns like logging, caching, and access control without          │
│ modifying the original function's code.                             │
╰─────────────────────────────────────── 0.4s · 52 tokens · $0.00 ───╯

No API keys needed for your first query -- nvHive defaults to free local or anonymous providers. Run nvh setup to add more providers when you are ready.

Core Commands

Essentials

Command	Description
`nvh "question"`	Smart default -- routes to the best available advisor
`nvh ask "question"`	Ask a specific advisor (use `-a provider`)
`nvh convene "question"`	Convene a council of AI-generated expert agents
`nvh poll "question"`	Ask every configured advisor, compare answers
`nvh throwdown "question"`	Two-pass deep analysis across all providers
`nvh quick "question"`	Fastest available model, minimal latency
`nvh safe "question"`	Local models only -- nothing leaves your machine
`nvh do "task"`	Detect action intent and execute (install, open, find)

Focus Modes

Command	Description
`nvh code "question"`	Code-optimized routing and prompts
`nvh write "question"`	Writing-optimized with style guidance
`nvh research "question"`	Multi-source research with citations
`nvh math "question"`	Math and reasoning, step-by-step

Tools

Command	Description
`nvh bench`	GPU benchmark -- measure tokens/second
`nvh scan`	Scan and index project files
`nvh learn "topic"`	Interactive learning sessions
`nvh clip`	Clipboard integration
`nvh voice`	Voice input/output
`nvh imagine "prompt"`	Image generation
`nvh screenshot`	Capture and analyze screenshots
`nvh git`	Git-aware operations

System

Command	Description
`nvh status`	Show configured providers, GPU, active model
`nvh savings`	Track how much you have saved with free/local models
`nvh debug`	Debug mode with verbose output
`nvh doctor`	Diagnose configuration and connectivity
`nvh setup`	Interactive provider setup wizard
`nvh keys`	Show all free API key signup links in one table
`nvh keys --open`	Open all free provider signup pages in browser
`nvh webui`	Install and launch the web UI (optional)
`nvh update`	Check for and install updates
`nvh version`	Print version
`nvh mcp`	Start MCP server (Claude Code, Cursor, OpenClaw)
`nvh openclaw`	Generate OpenClaw/NemoClaw tool config
`nvh nemoclaw`	NemoClaw integration setup guide
`nvh nemoclaw --test`	Test NemoClaw proxy connectivity
`nvh nemoclaw --start`	Start proxy server for NemoClaw

Management

Command	Description
`nvh advisor`	Manage advisor profiles and routing weights
`nvh agent`	Manage auto-generated expert agents and cabinets
`nvh config`	View and edit configuration
`nvh conversation`	List, export, or resume conversations
`nvh budget`	Set and monitor spending limits
`nvh model`	List, pull, or remove models
`nvh template`	Manage prompt templates
`nvh workflow`	Run multi-step YAML pipelines
`nvh knowledge`	Manage knowledge base entries
`nvh schedule`	Schedule recurring queries
`nvh webhook`	Configure webhook integrations
`nvh auth`	Manage API keys and authentication
`nvh plugins`	Install and manage plugins
`nvh serve`	Start the OpenAI-compatible API server
`nvh repl`	Launch interactive REPL
`nvh completions`	Generate shell completions

Direct Advisor Access

Skip the router and talk directly to a provider:

nvh openai "question"       # Route to OpenAI
nvh groq "question"         # Route to Groq
nvh google "question"       # Route to Gemini
nvh ollama "question"       # Route to local Ollama

Works for all 22 providers. Run nvh <provider> with no question to launch that provider's setup.

How It Works

You type a question: nvh "Should I use Redis or Postgres for sessions?"
The action detector checks if this is a system action (install, open, find). If so, it executes directly -- no LLM needed.
If it is a question, the router classifies the task type, scores all configured advisors on relevance, cost, and speed, and picks the best one.
Local-first: simple queries stay on Nemotron via Ollama (free, private, no network).
Cloud when needed: complex or specialized queries route to the best cloud advisor.

Every response shows which advisor answered, how long it took, and what it cost.

Local LLM Orchestration

The local Nemotron model doesn't just answer questions — it acts as an intelligent brain that orchestrates every cloud LLM call. All orchestration runs on your GPU for free.

The Orchestrator's Role

When you ask a question, before any cloud API is called, the local model:

Analyzes your query — detects task type, complexity, privacy needs, and whether web access or code execution is required.
Picks the best advisor — goes beyond keyword matching to understand intent and route to the right cloud model.
Rewrites your prompt — optimizes wording for the target advisor's known strengths, reducing tokens and improving answer quality.
Evaluates the response — checks if the answer is complete and correct, and flags it for retry if not.
Synthesizes locally — when multiple advisors respond, merges their answers on your GPU instead of paying a cloud model to do it.
Compresses conversation history — summarizes long chats before sending context to cloud APIs, cutting token costs.

Tiers

Orchestration scales automatically based on your GPU's available VRAM:

Tier	VRAM Required	Features
`off`	Any	Keyword routing, template agents (fallback mode)
`light`	6 GB+	Smart routing + prompt optimization
`full`	20 GB+	All features: routing, agents, eval, synthesis, compression
`auto`	—	Detects tier from available VRAM (default)

With auto (the default), nvHive reads your GPU VRAM at startup and enables the highest tier your hardware supports. If no local model is available, the engine falls back gracefully to keyword-based routing — no errors, no configuration needed.

Enabling and Disabling

# Show current orchestration mode
nvh config get defaults.orchestration_mode

# Disable orchestration (keyword routing only)
nvh config set defaults.orchestration_mode off

# Enable light mode (smart routing + prompt optimization)
nvh config set defaults.orchestration_mode light

# Enable full mode (all features)
nvh config set defaults.orchestration_mode full

# Auto-detect from VRAM (default)
nvh config set defaults.orchestration_mode auto

Cost Impact

Every orchestration call runs on your local GPU — it costs nothing. The savings come indirectly:

Better routing reduces expensive cloud calls by sending more queries to cheaper or local models.
Prompt optimization sends fewer tokens to cloud APIs, directly reducing per-query cost.
Response evaluation catches bad answers before you need to re-ask, avoiding retry costs.
Local synthesis replaces cloud synthesis calls (the most expensive part of council mode) with free local inference.

Supported AI Providers

Provider	Free Tier	Best For	Models
Ollama (Local)	Unlimited	Privacy, offline	nemotron, codellama, llama3
LLM7	30 RPM, no signup	Anonymous, instant start	Multiple
Groq	30 RPM free	Ultra-fast inference	llama3, mixtral, gemma
GitHub Models	50-150 req/day	Free frontier models	GPT-4o, Llama, Mistral
Google Gemini	15 RPM free	Long context, multimodal	Gemini 1.5 Pro/Flash
NVIDIA NIM	1000 free credits	NVIDIA-optimized	Nemotron, Llama
Cerebras	30 RPM free	Fast inference	Llama3
SambaNova	Free tier	Llama models	Llama3
Fireworks AI	Free tier	Fast open-source	Multiple
SiliconFlow	1000 RPM free	High-throughput	Multiple
Hugging Face	Free API	Open-source models	Thousands
AI21 Labs	Free tier	Jamba models	Jamba
Mistral	2 RPM free	Code	Mistral, Mixtral
Cohere	Trial key	RAG, embeddings	Command R+
OpenAI	Paid	GPT-4o, reasoning	GPT-4o, o1, o3
Anthropic	Paid	Analysis, coding	Claude 3.5/4
DeepSeek	Very cheap	Code, reasoning	DeepSeek V3/R1
Grok (xAI)	Paid	Real-time knowledge	Grok
Perplexity	Paid	Search-augmented	pplx-online
Together AI	Paid	Open-source models	Multiple
OpenRouter	Paid	Meta-router, fallback	All models
Mock	N/A	Unit tests	N/A

25 models are free across 14 providers. Run nvh setup to configure any of them.

GPU-Adaptive Model Selection

nvHive detects your GPU and automatically selects the best local model:

GPU	VRAM	Best Local Model	Performance
No GPU	--	Cloud only	Free tiers: LLM7, Groq, GitHub Models
GTX 1660 / RTX 2060	6 GB	nemotron-mini (4B)	~30 tok/s
RTX 3060	12 GB	nemotron-small	~55 tok/s
RTX 3070 / 3080	8-10 GB	nemotron-small	~75 tok/s
RTX 3090	24 GB	nemotron-small + codellama	~100 tok/s
RTX 4060	8 GB	nemotron-small	~70 tok/s
RTX 4070	12 GB	nemotron-small	~90 tok/s
RTX 4080	16 GB	nemotron-small + models	~130 tok/s
RTX 4090	24 GB	nemotron 70B (Q4)	~40 tok/s (70B)
RTX 5090	32 GB	nemotron 70B (Q4)	~60 tok/s (70B)
A100 / H100	80 GB	nemotron 70B (full)	~120-180 tok/s

Models unload after inactivity to free VRAM for gaming. Run nvh bench to measure your actual throughput.

Auto-Agent Council System

When you run nvh convene, nvHive analyzes your question and generates a panel of expert personas to debate it. Each agent has a defined role, expertise area, and analytical perspective.

12 cabinets with pre-configured expert panels:

Cabinet	Experts
`executive`	CEO, CFO, CTO, Product Manager
`engineering`	Architect, Backend Engineer, DevOps/SRE, Security, QA
`security_review`	Security Engineer, DevOps/SRE, Architect, Legal/Compliance
`code_review`	Architect, Backend Engineer, QA, Performance Engineer
`product`	Product Manager, UX Designer, Engineering Manager, CEO
`data`	Data Engineer, DBA, ML/AI Engineer, Architect
`full_board`	CEO, CFO, CTO, Architect, Backend, DevOps, Security
`homework_help`	Patient Tutor, Devil's Advocate, Study Coach
`code_tutor`	Code Mentor, Bug Hunter, Best Practices Reviewer
`essay_review`	Writing Coach, Logic Checker, Style Editor
`study_group`	Socratic Questioner, ELI5 Explainer, Practice Problem Generator
`exam_prep`	Exam Coach, Flashcard Creator, Weak Spot Finder

nvh convene "Should we migrate to microservices?" --cabinet engineering
nvh convene "Review my essay on climate policy" --cabinet essay_review

Tool System

27 tools across six categories. 18 safe tools run automatically; 9 that modify state require confirmation.

Category	Tools
Files	`read_file`, `write_file`, `list_files`, `search_files`
Code	`run_code`, `shell`
System	`list_processes`, `system_info`, `disk_usage`, `open_app`, `open_url`
Packages	`pip_install`, `pip_list`, `npm_install`
Web	`download`, `web_search`
Clipboard	`get_clipboard`, `set_clipboard`
Notifications	`notify`

Enable tools per query with --tools or globally in the REPL with /tools on.

Privacy and Safe Mode

Three privacy tiers:

Safe mode (nvh safe): Local models only. Nothing leaves your machine. Use for sensitive data, salary info, proprietary code.
Local default: Simple queries use local Ollama. Complex queries route to cloud with your consent.
Cloud: Full access to all configured providers for maximum capability.

nvh safe "Analyze this salary spreadsheet"   # stays 100% local
nvh "Explain quantum computing"              # may route to cloud

HIVE.md Context Injection

Create a HIVE.md file in any project directory. nvHive automatically injects it into the system prompt for every query made from that directory.

# HIVE.md
This is a Python 3.12 FastAPI project using SQLAlchemy and PostgreSQL.
Follow Google Python Style Guide. Prefer async/await patterns.
Test with pytest. Deploy target: Ubuntu 22.04 on GKE.

Every advisor sees your project context automatically.

Python SDK

from nvh import ask, convene, poll, safe, quick

# Simple query
response = await ask("What is machine learning?")

# Specific advisor
response = await ask("Debug this code", advisor="anthropic")

# Council of experts
result = await convene("Should we use Rust?", cabinet="engineering")

# Poll all advisors
results = await poll("Write a sort function")

# Local only
response = await safe("Analyze my salary data")

Synchronous versions available: ask_sync, convene_sync.

OpenAI-Compatible Proxy

Run nvHive as a drop-in backend for any tool that speaks the OpenAI API:

nvh serve --port 8000

Then point any OpenAI SDK client at http://localhost:8000:

from openai import OpenAI
client = OpenAI(base_url="http://localhost:8000/v1", api_key="nvhive")
response = client.chat.completions.create(
    model="auto",  # nvHive picks the best model
    messages=[{"role": "user", "content": "Hello"}]
)

MCP Server (Claude Code, Cursor, OpenClaw)

nvHive exposes its tools via the Model Context Protocol, making them available to Claude Code, Cursor, OpenClaw, and any MCP-compatible client.

# Install MCP support
pip install "nvhive[mcp]"

# Register with Claude Code
claude mcp add nvhive nvh mcp

# Or start as HTTP server for remote clients
nvh mcp -t streamable-http --port 8080

Tools available via MCP: ask, ask_safe, council, throwdown, status, list_advisors, list_cabinets.

For OpenClaw agents, generate the config:

nvh openclaw              # creates openclaw.json with nvHive MCP config
nvh openclaw --agent      # generates NemoClaw agent config

NemoClaw Integration

nvHive works as an inference provider inside NVIDIA NemoClaw, giving NemoClaw agents access to multi-model smart routing, council consensus, and throwdown analysis.

# Setup in three commands:
nvh nemoclaw --start                     # 1. Start nvHive proxy
openshell provider create \              # 2. Register with NemoClaw
    --name nvhive --type openai \
    --credential OPENAI_API_KEY=nvhive \
    --config OPENAI_BASE_URL=http://host.openshell.internal:8000/v1/proxy
openshell inference set \                # 3. Set as default
    --provider nvhive --model auto

NemoClaw agents can request any virtual model:

Model	What It Does
`auto`	Smart routing — best provider for the query
`safe`	Local only — nothing leaves your machine
`council`	3-model consensus with synthesis
`council:N`	N-model council (2-10 members)
`throwdown`	Two-pass deep analysis with critique

Privacy-aware routing: set x-nvhive-privacy: local-only header to force all inference through local Ollama, integrating with NemoClaw's content sensitivity routing.

NemoClaw Sandbox → OpenShell Gateway → nvHive Proxy → 22 providers
                                          ↓
                          Smart Router / Council / Throwdown

Run nvh nemoclaw for the full setup guide, or nvh nemoclaw --test to verify connectivity.

Configuration

Configuration lives at ~/.config/nvhive/config.yaml. Manage it with:

nvh config                    # view current config
nvh config set default_advisor groq
nvh config set safe_mode true
nvh budget set --daily 1.00   # daily spending cap

Workflows

Define multi-step pipelines in YAML:

name: Code Review Pipeline
steps:
  - name: security_scan
    action: ask
    prompt: "Analyze for security vulnerabilities:\n\n{{input}}"
    advisor: anthropic
    save_as: security

  - name: quality_review
    action: ask
    prompt: "Review for quality and best practices:\n\n{{input}}"
    advisor: openai
    save_as: quality

  - name: synthesis
    action: convene
    prompt: "Synthesize findings:\n\nSecurity: {{security}}\nQuality: {{quality}}"
    cabinet: code_review
    save_as: summary

nvh workflow run code_review.yaml --input "$(cat main.py)"

For Students

nvHive was built with students in mind. Five dedicated cabinets teach rather than just answer:

homework_help -- Patient Tutor, Devil's Advocate, and Study Coach guide you to understanding
code_tutor -- Code Mentor, Bug Hunter, and Best Practices Reviewer teach programming
essay_review -- Writing Coach, Logic Checker, and Style Editor improve your writing
study_group -- Socratic Questioner, ELI5 Explainer, and Practice Problem Generator
exam_prep -- Exam Coach, Flashcard Creator, and Weak Spot Finder

All work with free models. Track your savings with nvh savings.

nvh convene "Explain recursion step by step" --cabinet code_tutor
nvh convene "Help me prepare for my calculus final" --cabinet exam_prep

For Linux Desktop

nvHive is designed for deployment on Linux Desktop instances:

Auto-detects cloud sessions and adapts to the available GPU tier
All tools operate at user level -- no root, no sudo
Session-aware: handles ephemeral environments with mounted home directories
Auto-healing: reconnects to Ollama if the instance restarts
GPU VRAM management: models unload after inactivity so games can reclaim VRAM

Web Interface

nvHive includes a full web dashboard for users who prefer a visual experience over the CLI. Launch it with:

nvh webui

The dashboard opens at http://localhost:3000 and connects to the nvHive API automatically.

Pages

Page	What It Does
Chat	Send prompts in single, council, or compare mode with streaming responses
Council	Real-time multi-LLM orchestration with live member progress and synthesis
Query Builder	Advanced query form with provider/model filters and agent presets
Advisors	Provider health status, model listings, and connectivity testing
Integrations	Auto-detect and connect NemoClaw, OpenClaw, Claude Code, Cursor
System	GPU info, cache stats, budget status, and recommendations
Settings	API URL, defaults, budget limits, theme, and council strategy
Setup Wizard	Step-by-step onboarding: GPU detection, local AI, cloud providers

Design

NVIDIA-inspired dark theme with green (#76B900) accents
Angular design language with diamond status indicators
Command palette (Ctrl+K) for quick navigation
Real-time streaming via SSE and WebSocket
Responsive layout for desktop and mobile
Keyboard shortcuts throughout (Ctrl+N, Ctrl+B, Ctrl+/)

Screenshots

Chat Interface	Integrations

Council Mode	System Dashboard

Advisors	Setup Wizard

Architecture Diagram

graph LR
    A[Web UI :3000] -->|REST + WebSocket| B[nvHive API :8000]
    C[nvh CLI] -->|direct| B
    D[OpenAI SDK] -->|proxy| B
    E[MCP Client] -->|stdio/HTTP| F[MCP Server]
    F -->|internal| B
    B --> G[Smart Router]
    G --> H[22 LLM Providers]
    G --> I[Council Engine]
    G --> J[Local Ollama]

Architecture

nvh CLI
  |
  +-- Action Detector -----> Direct execution (install, open, find)
  |
  +-- Router
  |     |-- Task classifier (code, writing, research, math, general)
  |     |-- Advisor scorer (relevance, cost, speed, privacy)
  |     +-- Model selector (GPU VRAM, provider availability)
  |
  +-- Providers (22)
  |     |-- Local: Ollama (Nemotron, CodeLlama, Llama3)
  |     |-- Cloud: OpenAI, Anthropic, Google, Groq, ...
  |     +-- Free: LLM7, GitHub Models, NVIDIA NIM, ...
  |
  +-- Agent System
  |     |-- Auto-generation from query analysis
  |     +-- 12 pre-built cabinets (22 expert personas)
  |
  +-- Tool System (27 tools, 18 safe / 9 confirm)
  |
  +-- SDK + OpenAI-compatible API server

Project Stats

Metric	Value
Python files	81
Lines of code	27,518
Functions	810
Tests	181
Providers	22
Models	63 (25 free)
Tools	27 (18 safe, 9 confirm)
Cabinets	12
Expert personas	22
Wheel size	276 KB
Commits	42

Documentation

Document	Description
Getting Started	First-time setup and usage guide
Hardware Requirements	GPU tiers, VRAM mapping, performance
Testing Guide	Running and writing tests
EULA	End User License Agreement
Privacy Policy	Data handling and privacy
Changelog	Version history

Contributing

See CONTRIBUTING.md for development setup, coding standards, and pull request guidelines.

License

MIT License. See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.39.0

May 16, 2026

0.38.0

May 15, 2026

0.37.0

May 15, 2026

0.36.0

May 14, 2026

0.35.1

May 1, 2026

0.35.0

Apr 29, 2026

0.34.0

Apr 28, 2026

0.33.2

Apr 27, 2026

0.33.1

Apr 27, 2026

0.33.0

Apr 27, 2026

0.32.0

Apr 24, 2026

0.31.2

Apr 16, 2026

0.31.1

Apr 16, 2026

0.31.0

Apr 16, 2026

0.30.1

Apr 16, 2026

0.30.0

Apr 16, 2026

0.29.3

Apr 14, 2026

0.29.2

Apr 14, 2026

0.29.1

Apr 14, 2026

0.29.0

Apr 14, 2026

0.28.5

Apr 14, 2026

0.28.4

Apr 14, 2026

0.28.3

Apr 14, 2026

0.28.2

Apr 14, 2026

0.28.1

Apr 14, 2026

0.28.0

Apr 14, 2026

0.27.8

Apr 14, 2026

0.27.7

Apr 14, 2026

0.27.6

Apr 14, 2026

0.27.5

Apr 14, 2026

0.27.4

Apr 14, 2026

0.27.3

Apr 14, 2026

0.27.2

Apr 14, 2026

0.27.1

Apr 13, 2026

0.27.0

Apr 13, 2026

0.26.0

Apr 13, 2026

0.25.0

Apr 13, 2026

0.24.0

Apr 13, 2026

0.23.0

Apr 13, 2026

0.22.3

Apr 13, 2026

0.22.2

Apr 13, 2026

0.22.1

Apr 13, 2026

0.22.0

Apr 13, 2026

0.21.0

Apr 13, 2026

0.20.0

Apr 13, 2026

0.19.0

Apr 13, 2026

0.18.0

Apr 13, 2026

0.17.0

Apr 13, 2026

0.16.0

Apr 13, 2026

0.15.8

Apr 13, 2026

0.15.7

Apr 13, 2026

0.15.6

Apr 13, 2026

0.15.5

Apr 12, 2026

0.15.4

Apr 12, 2026

0.15.3

Apr 12, 2026

0.15.2

Apr 12, 2026

0.15.1

Apr 12, 2026

0.15.0

Apr 12, 2026

0.14.1

Apr 12, 2026

0.14.0

Apr 12, 2026

0.13.1

Apr 12, 2026

0.13.0

Apr 12, 2026

0.12.1

Apr 12, 2026

0.12.0

Apr 11, 2026

0.11.1

Apr 11, 2026

0.11.0

Apr 11, 2026

0.10.0

Apr 11, 2026

0.9.0

Apr 9, 2026

0.8.0

Apr 9, 2026

0.7.0

Apr 9, 2026

0.6.0

Apr 8, 2026

0.5.9

Apr 8, 2026

0.5.6

Apr 6, 2026

0.5.5

Apr 6, 2026

0.5.4

Apr 6, 2026

0.5.3

Apr 6, 2026

0.5.2

Apr 6, 2026

0.5.1

Apr 5, 2026

0.5.0

Apr 5, 2026

0.4.0

Apr 3, 2026

0.3.1

Apr 3, 2026

0.3.0

Apr 2, 2026

This version

0.2.0

Apr 2, 2026

0.1.0

Apr 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nvhive-0.2.0.tar.gz (287.7 kB view details)

Uploaded Apr 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nvhive-0.2.0-py3-none-any.whl (320.5 kB view details)

Uploaded Apr 2, 2026 Python 3

File details

Details for the file nvhive-0.2.0.tar.gz.

File metadata

Download URL: nvhive-0.2.0.tar.gz
Upload date: Apr 2, 2026
Size: 287.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for nvhive-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`68cba201ec75f6f82293928c0bee5e9a99a017f28961dece2ce06215e535abcf`
MD5	`86db0dba15837b9e338e35108bd92db4`
BLAKE2b-256	`ec5d8fab3c64162ed63c965be1f9297a451732b055d52394388cf0bacd8b95f9`

See more details on using hashes here.

File details

Details for the file nvhive-0.2.0-py3-none-any.whl.

File metadata

Download URL: nvhive-0.2.0-py3-none-any.whl
Upload date: Apr 2, 2026
Size: 320.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for nvhive-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4e812448292c07de2bf95ec721e33af93577fc555b8066612bd55a51948bb87c`
MD5	`63232c5b89b52b7f88cf83300772620c`
BLAKE2b-256	`3ae6e0dd2d8c87ac6d91eda760e86b396305772b742e9ed05966ab15f949f053`

See more details on using hashes here.

nvhive 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

nvHive

What is nvHive?

Platform Support

Quick Start

What Happens Automatically

Your First 60 Seconds

Core Commands

Essentials

Focus Modes

Tools

System

Management

Direct Advisor Access

How It Works

Local LLM Orchestration

The Orchestrator's Role

Tiers

Enabling and Disabling

Cost Impact

Supported AI Providers

GPU-Adaptive Model Selection

Auto-Agent Council System

Tool System

Privacy and Safe Mode

HIVE.md Context Injection

Python SDK

OpenAI-Compatible Proxy

MCP Server (Claude Code, Cursor, OpenClaw)

NemoClaw Integration

Configuration

Workflows

For Students

For Linux Desktop

Web Interface

Pages

Design

Screenshots

Architecture Diagram

Architecture

Project Stats

Documentation

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes