MahanAI: terminal AI agent for NVIDIA NIM (OpenAI-compatible), tools, streaming, /api-key, safety prompts.

These details have not been verified by PyPI

Project description

MahanAI Super

Terminal AI agent (Super 2.0) with multi-model support, streaming chat, tools, and a built-in Claude CLI mode. Docs: MahanAI.

Install

pip install mahanai
mahanai

Launch Options

Flag	Description
`--compact`	Compact mode: renders a small MAI ASCII logo and a trimmed header
`--server`	Start the gateway server instead of the chat loop
`--port PORT`	Gateway server port (default: `8080`)
`--type TYPE`	Gateway API type: `openai` (default) or `anthropic`
`--api-key KEY`	API key clients must send to the gateway; also used as the Anthropic backend key

mahanai --compact
mahanai --server --port 8080 --type openai --api-key my-secret

Gateway Server

--server starts a local HTTP gateway that exposes all configured providers behind a single endpoint. Any tool that speaks the OpenAI or Anthropic wire format (Cursor, Continue, LM Studio, Claude Code, etc.) can point at it.

Starting the server

# OpenAI-compatible gateway on port 8080 (default)
mahanai --server

# Custom port
mahanai --server --port 9000

# Anthropic-compatible gateway
mahanai --server --type anthropic

# With an API key (clients must send Authorization: Bearer <key>)
mahanai --server --port 4343 --api-key sk-gaming

# Anthropic gateway with your Anthropic API key
mahanai --server --type anthropic --api-key sk-ant-...

Endpoints

Server type	Endpoint	Purpose
`openai`	`POST /v1/chat/completions`	Chat completions
`anthropic`	`POST /v1/messages`	Messages API
both	`GET /v1/models`	List all available models

Model routing

The gateway automatically routes requests to the right backend based on the model field in the request:

Model ID	Provider	Credentials needed
`mahanai/mahanai`	NVIDIA NIM (server)	`/api-key`
`meta/llama-3.3-70b-instruct`	NVIDIA NIM (direct)	`/api-key-nvidia`
`claude-opus-4-7`	Anthropic	`--api-key sk-ant-...`
`claude-sonnet-4-6`	Anthropic	`--api-key sk-ant-...`
`claude-haiku-4-5-20251001`	Anthropic	`--api-key sk-ant-...`
`gpt-5.4`, `gpt-5.2`, `gpt-5.2-codex` …	OpenAI Codex	`/codex-login`
custom model ID	Custom endpoint	`/custom`

Format conversion

Requests are automatically converted between formats:

OpenAI gateway + Claude model → converts OpenAI→Anthropic, calls Anthropic API, converts response back
Anthropic gateway + NVIDIA/Codex model → converts Anthropic→OpenAI, calls backend, converts response back
Same-format routes (OpenAI gateway + NVIDIA/Codex) → transparent proxy, no conversion overhead

Streaming SSE is preserved end-to-end for all routes.

Authentication

If --api-key is supplied, the server validates every incoming request:

Authorization: Bearer <your-key>

Requests with a missing or wrong key receive HTTP 401. Omit --api-key to run with open access (local use only).

Example curl

curl http://localhost:4343/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-gaming" \
  -d '{
    "model": "gpt-5.2",
    "messages": [
      {"role": "system", "content": "You are a helpful assistant."},
      {"role": "user",   "content": "Hello!"}
    ]
  }'

Themes

MahanAI supports four terminal color themes, including two designed for colorblind accessibility.

Theme	Description
`midnight`	Dark terminal — purple→cyan gradient banner (default)
`light`	Light terminal — navy→teal gradient banner
`midnight-cb`	Dark + colorblind-friendly — blue replaces green, yellow replaces red
`light-cb`	Light + colorblind-friendly

/themes                   # list all themes
/themes midnight
/themes light
/themes midnight-cb
/themes light-cb

Themes persist across sessions (saved to config.json). The banner gradient, prompt colors, and status colors all update when you switch.

Command Approvals

Every tool action MahanAI takes on your behalf requires explicit approval before it runs. The prompt style depends on the action type:

Shell commands

  Shell Command
  npm install react

  [A] Allow    [W] Always Allow (npm)    [D] Deny
  >

Git commands

  Git Command
  git push origin main

  [A] Allow    [D] Deny
  >

GitHub CLI commands

  GitHub Command
  gh pr create --title "..."

  [A] Allow    [D] Deny
  >

File operations

  Write / Create File
  C:\Users\Mahan\project\main.py

  [A] Allow    [W] Always Allow (Write / Create File)    [D] Deny
  >

Approval options

Key	Effect
`A`	Allow once
`W`	Always Allow — stored in `config.json`, never asked again for that command prefix or file op
`D`	Deny — optionally type an instruction the AI will receive as the tool result

Always Allow is available for shell commands (stored by command prefix, e.g. npm) and file operations (stored by operation type). It is not available for git or GitHub commands — those always prompt.

Destructive commands (rm -rf, format, shutdown, etc.) are flagged [DESTRUCTIVE] and Always Allow is disabled for them regardless.

Managing stored rules

/approvals          # list all Always Allow rules
/approvals clear    # remove all Always Allow rules

Models

MahanAI Super supports multiple backends selectable at runtime via /models.

NVIDIA NIM

Pretty Name	Model ID	Backend
Llama 3.3 70B	meta/llama-3.3-70b-instruct	NVIDIA NIM (direct)

Note: A legacy server mode (mahanai/mahanai) exists in the model selector but is undocumented and not recommended for use.

Claude

Pretty Name	Model ID	Backend
Claude Opus 4	claude-opus-4-7	Claude CLI
Claude Sonnet 4.6	claude-sonnet-4-6	Claude CLI
Claude Haiku 4.5	claude-haiku-4-5-20251001	Claude CLI

OpenAI Codex

Seven models available, each accessible in Direct and Indirect mode (see OpenAI Codex below):

Pretty Name	Model ID
GPT-5.4	gpt-5.4
GPT-5.2-Codex	gpt-5.2-codex
GPT-5.1-Codex-Max	gpt-5.1-codex-max
GPT-5.4-Mini	gpt-5.4-mini
GPT-5.3-Codex	gpt-5.3-codex
GPT-5.2	gpt-5.2
GPT-5.1-Codex-Mini	gpt-5.1-codex-mini

Switch models interactively with /models (arrow-key selector) or quick-switch with /mode claude / /mode default.

Default model: MahanAI starts on Claude Haiku 4.5 out of the box.

Custom Endpoint

Point MahanAI at any OpenAI-compatible API (Ollama, LM Studio, vLLM, OpenRouter, etc.):

/custom http://localhost:11434/v1 llama3 [optional-api-key]

Once saved, select Custom Endpoint from /models to start using it. The config persists across sessions.

Commands

Command	Description
`/models`	Interactive model selector (↑↓ arrows, Enter to confirm, Esc to cancel)
`/mode claude`	Quick-switch to Claude Sonnet 4.6
`/mode default`	Quick-switch back to MahanAI Super (server)
`/effort <level>`	Set reasoning effort: `low`, `medium`, `high`, `very-high`
`/plan on`	Enable plan mode — MahanAI outlines a plan before every response
`/plan off`	Disable plan mode
`/themes`	List available color themes
`/themes <name>`	Switch theme: `midnight`, `light`, `midnight-cb`, `light-cb`
`/approvals`	Show stored Always Allow rules
`/approvals clear`	Remove all Always Allow rules
`/api-key [key]`	Save server API key (omit key for hidden prompt)
`/api-key clear`	Remove saved server key
`/api-key-nvidia [key]`	Save NVIDIA direct API key
`/api-key-nvidia clear`	Remove NVIDIA key, switch back to server
`/codex-login`	Sign in to OpenAI via browser (Codex Direct mode)
`/codex-logout`	Remove saved OpenAI Codex credentials
`/custom [url [model [key]]]`	Configure a custom OpenAI-compatible endpoint
`/custom clear`	Remove saved custom endpoint
`/help`	Show help
`/exit` or `/quit`	Leave

Effort Levels

/effort controls how much reasoning the model applies before responding.

Level	Effect
`low`	Concise and fast. Minimal reasoning.
`medium`	Balanced (default).
`high`	Careful, thorough reasoning before responding.
`very-high`	Maximum reasoning depth. ⚠️ Significantly higher token usage and slower responses.

/effort high
/effort very-high

Note: Effort is disabled for Claude Haiku 4.5 — it does not support extended thinking. Switch to Opus or Sonnet to use effort levels.

For OpenAI Codex models, effort maps to the reasoning.effort parameter (low / medium / high). For Claude models (Opus, Sonnet), the effort instruction is prepended to your prompt to guide reasoning depth.

Plan Mode

Plan mode instructs MahanAI to outline its approach before taking action on every message — useful for complex multi-step tasks where you want visibility into the reasoning before execution.

/plan on    # enable
/plan off   # disable

Plan mode works across all model backends.

API Keys

Server / NVIDIA NIM

Environment: MAHANAI_API_KEY=...
Project .env: MAHANAI_API_KEY=...
In-app: /api-key your-key

Keys are stored under %APPDATA%\MahanAI\config.json on Windows or ~/.config/mahanai/config.json on Linux/macOS.

Claude CLI mode

Claude models use your local claude CLI installation. Make sure Claude Code is installed and on your PATH. No extra API key configuration needed inside MahanAI — it uses whatever account Claude CLI is authenticated with.

OpenAI Codex

MahanAI supports two Codex authentication modes:

Direct mode

Signs in to your OpenAI account via a browser-based OAuth PKCE flow — no API key needed.

/codex-login

This opens your browser to auth.openai.com. After you approve, MahanAI receives and stores the access token automatically. Tokens are refreshed silently before they expire (saved to the same config.json as other keys).

Indirect mode

Reads credentials from a locally installed and signed-in OpenAI Codex CLI. MahanAI looks for auth.json in these locations:

Platform	Paths checked
Windows	`%LOCALAPPDATA%\OpenAI\Codex\auth.json`, `~\.codex\auth.json`
macOS / Linux	`~/.codex/auth.json`, `~/.config/codex/auth.json`

If no token file is found, MahanAI falls back to running the codex CLI as a subprocess (requires Codex CLI on your PATH).

To use indirect mode, install and sign in to the Codex CLI first:

npm i -g @openai/codex
codex login

Then select any OpenAI Codex (Indirect) model from /models.

Custom Endpoint

Use /custom to connect to any OpenAI-compatible server — Ollama, LM Studio, vLLM, OpenRouter, or your own deployment.

Interactive setup (prompts for each field):

/custom

One-liner:

/custom <base-url> [model] [api-key]

Examples:

/custom http://localhost:11434/v1 llama3
/custom http://localhost:1234/v1 mistral-7b
/custom https://openrouter.ai/api/v1 openai/gpt-4o sk-or-...

base-url — the /v1 base URL of the server
model — model ID to send in requests (defaults to gpt-3.5-turbo if omitted)
api-key — leave blank if the server doesn't require one

After saving, run /models and select Custom Endpoint. To remove:

/custom clear

Environment Variables

Variable	Purpose
`MAHANAI_API_KEY`	Override saved server API key
`MAHANAI_MODEL`	Override default model ID
`MAHANAI_STREAM`	Set to `0`/`false`/`no`/`off` to disable streaming
`MAHANAI_CONFIG_DIR`	Override config file directory
`NO_COLOR`	Disable terminal colors

Tools

Every tool action is shown to you for approval before it runs (see Command Approvals above).

Tool	Description
`run_command`	Run a shell command
`read_file`	Read a file
`write_file`	Create or overwrite a file
`append_file`	Append to a file
`list_directory`	List directory contents

Develop

pip install -e .
python -m mahanai

Publish to PyPI

Bump version in both pyproject.toml and mahanai/__init__.py, then:

pip install build twine
python -m build
python -m twine check dist/*

Windows (PowerShell):

$env:TWINE_USERNAME = "__token__"
$env:TWINE_PASSWORD = "pypi-YOUR_TOKEN_HERE"
python -m twine upload dist/mahanai-4.0.0*

macOS / Linux:

export TWINE_USERNAME=__token__
export TWINE_PASSWORD=pypi-YOUR_TOKEN_HERE
python -m twine upload dist/mahanai-4.0.0*

twine cannot publish without your token; keep it out of git.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

6.0.1

May 2, 2026

6.0.0

May 2, 2026

5.2.0

May 1, 2026

5.0.1

Apr 29, 2026

5.0.0

Apr 29, 2026

4.7.0

Apr 24, 2026

This version

4.6.0

Apr 23, 2026

4.5.0

Apr 22, 2026

4.4.11

Apr 21, 2026

4.4.10

Apr 21, 2026

4.4.9

Apr 21, 2026

4.4.8

Apr 21, 2026

4.3.8

Apr 21, 2026

4.3.7

Apr 21, 2026

4.2.7

Apr 21, 2026

4.2.6

Apr 21, 2026

4.1.0

Apr 20, 2026

4.0.3

Apr 19, 2026

4.0.2

Apr 19, 2026

4.0.1

Apr 19, 2026

4.0.0

Apr 19, 2026

3.1.0

Apr 15, 2026

3.0.0

Apr 6, 2026

2.0.0

Mar 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mahanai-4.6.0.tar.gz (38.4 kB view details)

Uploaded Apr 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mahanai-4.6.0-py3-none-any.whl (36.2 kB view details)

Uploaded Apr 23, 2026 Python 3

File details

Details for the file mahanai-4.6.0.tar.gz.

File metadata

Download URL: mahanai-4.6.0.tar.gz
Upload date: Apr 23, 2026
Size: 38.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for mahanai-4.6.0.tar.gz
Algorithm	Hash digest
SHA256	`0f9aefb503acf16eea725b9e4b88aca4cd347b581da1f66057ca907435931d11`
MD5	`a0987ec554f8900fd4637733030fd491`
BLAKE2b-256	`169c597d020d8733894d693203e068ec1588d325295b8158033c8b1de8db61d2`

See more details on using hashes here.

File details

Details for the file mahanai-4.6.0-py3-none-any.whl.

File metadata

Download URL: mahanai-4.6.0-py3-none-any.whl
Upload date: Apr 23, 2026
Size: 36.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for mahanai-4.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2efa66afe583a0c8168e2aa8e398efc12396643fc960f49871e3cfd670cde303`
MD5	`1f3511b82a76dd58f53d7694c19772aa`
BLAKE2b-256	`092802bf2e61bf98f54a6e82efb20ffbde8396f40176281949889f560d624a74`

See more details on using hashes here.

mahanai 4.6.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

MahanAI Super

Install

Launch Options

Gateway Server

Starting the server

Endpoints

Model routing

Format conversion

Authentication

Example curl

Themes

Command Approvals

Shell commands

Git commands

GitHub CLI commands

File operations

Approval options

Managing stored rules

Models

NVIDIA NIM

Claude

OpenAI Codex

Custom Endpoint

Commands

Effort Levels

Plan Mode

API Keys

Server / NVIDIA NIM

Claude CLI mode

OpenAI Codex

Direct mode

Indirect mode

Custom Endpoint

Environment Variables

Tools

Develop

Publish to PyPI

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes