MentAsk: Universal AI Coding Agent with multi-provider support via models.dev

These details have not been verified by PyPI

Project links

Project description

mentask — Autonomous AI Coding Agent for the Terminal

mentask banner

WORK IN PROGRESS | Formerly known as askgem

mentask is a professional, autonomous coding agent that lives in your terminal. Powered by models.dev and Google Gemini, it reads your files, edits your code, runs shell commands, and navigates your filesystem — all within an interactive session and with hardened safety guardrails that keep you in control.

No GUI. No cloud sync. No bloat. Just a fast, opinionated CLI agent you can trust with your codebase.

How it works
Features
New in v0.18.5: Lisan al-Gaib
Project Isolation (/init)
Installation
Configuration
Usage
Slash commands
Safety model
Architecture
Development & Simulation
Internationalization
Repository Standard
Roadmap
Contributing
License

How it works

mentask runs an advanced asynchronous reasoning loop powered by the AgentOrchestrator and a modular manager-based core. On each turn:

Environmental Awareness: At startup, the ContextManager performs a Project Blueprint scan, discovering the project type, structure, and key files to build a proactive system instruction.
Cognitive Loop: Your message is processed by the AgentOrchestrator, which manages the Thinking -> Action -> Observation cycle.
Tool Reasoning: The model calls specialized tools (read, edit, execute). mentask intercepts these via the ToolDispatcher.
Safety Guard: Every action passes through the Security Layer for real-time risk analysis and path validation.
Stream Processing: The StreamProcessor extracts function calls and text mid-flight, showing you the agent's "thought process" in real-time.
Persistence: The full session, including tool results and metrics, is auto-saved to your Workspace history.

This autonomous loop repeats until the mission is accomplished or you interrupt it.

Features

Agentic tool engine

Tool	Description
`list_directory`	Explore filesystem trees with depth control
`read_file`	Read any file with optional line ranges — 30k char cap prevents token overflow
`edit_file`	Find-and-replace with atomic writing, uniqueness guard, and automatic `.bkp` backup
`execute_bash`	Run shell commands with 60s timeout and full Risk Analysis
`manage_memory`	Save important project facts to `memory.md` for long-term recall
`manage_mission`	Track complex goals and sub-tasks via `heartbeat.md` mission control
`manage_workspace`	Detects and initializes local project knowledge bases

Workspace Isolation & Local Intelligence

mentask now distinguishes between your Global Persona and your Project Context:

Local Isolation: Run /init to create a dedicated .mentask/ folder in your project. This isolates sessions, settings, and identity to the current directory.
Local Priority: If a .mentask/ folder exists, it takes precedence for settings, memory, and history.
Project Memory: Knowledge saved via manage_memory is stored in .mentask/memory.md (or .mentask_knowledge.md as fallback), preventing context leakage between repositories.
Project Identity: Customize mentask's personality for a specific project via .mentask/identity.md.

Human-in-the-loop safety

Every destructive action is categorized by risk level (SAFE, NOTICE, WARNING, DANGEROUS). Switch modes anytime mid-session:

/mode manual (default) — approve each file edit and shell command.
/mode auto — trust the agent fully; all actions execute without prompts.

Streaming terminal UX

mentask now runs through a Rich-based terminal renderer:

Real-time Markdown streaming in the terminal.
Inline confirmations for file edits and shell commands.
Focus on a fast, scriptable CLI flow instead of a separate dashboard app.

Persistent session history

Every conversation auto-saves to ~/.mentask/history/ as JSON. Reload any past session with /history load <id>. A rolling context window and proactive summarization keep reloaded sessions within token budget.

New in v0.18.5: Lisan al-Gaib

The v0.18.5 release ("Lisan al-Gaib") transforms mentask into a persistent, self-correcting agent with advanced cognitive tools.

1. Persistent Gem-Style Renderer

A complete architectural overhaul of the CLI output. All thoughts, tool calls, and results now persist in your terminal scroll buffer, providing a seamless and professional experience similar to Google's own Gem CLI.

2. Intelligence Tools (Working Memory & Planning)

working_memory: A semantic scratchpad for the agent to store hypotheses and partial conclusions across multiple turns.
plan: Interactive checkpointing of .mentask_plan.md to track multi-step missions effectively.

3. Self-Critique & Error Correction

Integrated reflection loops that force the agent to analyze tool failures before attempting alternative strategies, significantly increasing operational success rates.

4. Advanced UX Commands

/undo: Instantly rollback accidental file modifications.
/artifacts: Browse and expand previous tool results with a compact, interactive UI.
/theme: Switch between premium color schemes (indigo, monokai, ocean) in real-time.

5. Robust Security & Memory

Automatic file backups and dynamic context management to prevent token overflows and memory leaks in long sessions.

Installation

Prerequisites

Python 3.10+.
A Google API Key — free at Google AI Studio.

From source (recommended)

git clone https://github.com/julesklord/mentask
cd mentask.py
python -m venv venv
# On Windows: venv\Scripts\activate
source venv/bin/activate
pip install -e ".[dev]"

Configuration

API key (Standardized)

mentask loads your key from these sources, in order:

Environment variable — GEMINI_API_KEY=your_key mentask (Preferred)
System Keyring — Secure storage via Windows Credential Manager or macOS Keychain (Recommended).
Saved file — ~/.mentask/settings.json (Local fallback).

On first launch without a key, mentask prompts interactively and saves it securely in your system's keyring.

Settings file

You can find the global configuration at ~/.mentask/settings.json.

🧠 Core Knowledge Hub

mentask now features a Hierarchical Knowledge Hub, separating core behavioral rules from user-specific customizations. The agent reloads its intelligence every turn from three layers:

📦 Standard Hub (Internal): Built-in modules defining the "Staff Engineer" persona, operational safety rules, and multimodal guidelines (audio/video/vision).
🌍 Global Hub (~/.mentask/*.md): Your cross-project technical preferences, API guidelines, or personal style.
🚀 Project Hub (.mentask/*.md): Project-specific context, build commands, architecture rules, and "Mission" specifics.

[!TIP] Just drop a .md file in any of these locations to instantly update mentask's cognitive behavior without touching the code.

👁️ Multimodal Intelligence

Fully optimized for Gemini 1.5 Pro and 2.0 Flash:

Screenshots: Analyze UI layouts and design systems.
Video: Summarize technical demos and terminal recordings.
Audio: Digest project discussions and voice notes.

Usage

Stored at ~/.mentask/settings.json (POSIX) or %APPDATA%\mentask\settings.json (Windows):

{
    "model_name": "gemini-1.5-flash",
    "edit_mode": "manual"
}

Configuration paths

Path	Purpose
`~/.mentask/settings.json`	Model name, edit mode, and user preferences
`~/.mentask/history/`	Auto-saved session JSON files
`~/.mentask/mentask.log`	Debug log — tool execution events and retry details

Usage

Launch the agent:

mentask

Common Workflows

Context Analysis: "Read my pyproject.toml and explain the dependencies."
Code Generation: "Create a src/utils.py file with a function to calculate SHA256 hashes."
Refactoring: "Refactor authenticate() in src/auth.py to use JWT instead of sessions."
Exploration: "Find all TODO comments in the project and group them by file."

Exiting

Type exit, quit, q, or press Ctrl+C.

Slash commands

Command	Description
`/help`	Show the full command reference and examples
`/model <name>`	Switch Gemini models mid-conversation (history preserved)
`/mode [auto/manual]`	Toggle between approving actions or automatic execution
`/clear`	Reset the context window to free up tokens without ending session
`/usage`	Show detailed token consumption and estimated USD cost
`/stats`	Summary of session accomplishments (messages, tools, files)
`/stop`	Interrupt the current generation immediately
`/reset`	Restart the entire session and reset all counters
`/init`	Initialize local project isolation and configuration
`/history [list/load/delete]`	Manage saved conversation sessions
`/trust [path]`	Add a directory to the permanent whitelist
`/untrust [path]`	Remove a directory from the whitelist

Safety model

Always, regardless of mode (Sandboxed Environment):

Trust Management Layer: mentask now implements a strict whitelist for file operations. By default, it can only touch the current workspace. Use /trust to authorize external paths.
Cross-Drive Protection: On Windows, the agent is blocked from crossing drive letters (e.g., C: to G:) unless the target is explicitly trusted, preventing unintended system-wide access.
Risk Analysis Engine: Powered by core/security.py, every command is categorized:
- SAFE: Informative commands (ls, git status).
- NOTICE: Standard operations.
- WARNING: High-risk patterns (sudo, sensitive file access).
- DANGEROUS: Critical risk (rm -rf, fork bombs, world-writable chmod).
Atomic Writing: edit_file uses a temporary file + rename strategy to prevent corruption.
Automatic Backups: Every file modification creates a .bkp backup at <path>.bkp.
Hard Timeouts: Shell commands have a strict 60-second execution limit.

Architecture

mentask operates across three tightly decoupled layers enforcing strong logical boundaries. As of version 0.16.0, the system has evolved into an Orchestrated Architecture, where a central engine manages cognitive managers, security centinels, and an autonomous LSP verification loop.

High-Level System Diagram

flowchart TD
    CLI(["User execution (mentask)"]) --> Main(cli/main.py)
    Main --> Renderer(cli/renderer.py)
    Renderer <--> Orchestrator(agent/orchestrator.py: AgentOrchestrator)
    
    subgraph Cognitive_Layer [Cognitive Managers]
        Orchestrator --> Session[agent/core/session.py]
        Orchestrator --> Context[agent/core/context.py]
        Orchestrator --> Stream[agent/core/stream.py]
        Orchestrator --> Commands[agent/core/commands.py]
        Orchestrator --> Simulation[agent/core/simulation.py]
    end

    Orchestrator <--> GenAI[Google Gemini API]
    
    subgraph Security_Layer [Security & Trust Centinel]
        GenAI -. function calls .-> Trust[core/trust_manager.py]
        Trust --> SecurityCheck[core/security.py]
        SecurityCheck --> Tools(tools/)
    end

    Tools --> localDisk[(Local Workspace)]
    Context -. Blueprint .-> localDisk

Layer Breakdown

Presentation Layer (cli/): Handles CLI startup, interactive prompts, audit views, and real-time Markdown rendering.
Cognitive Layer (agent/): The "Brain". Powered by the AgentOrchestrator, it manages state, context blueprints, and mission tracking.
Security Layer (core/): The "Guard". Gathers risk analysis and whitelisting logic to ensure the agent never exceeds its authority.

Project Structure (v0.16.4)

mentask.py/
├── src/mentask/
│   ├── __init__.py              # Single source of truth for version (0.18.5)
│   ├── agent/
│   │   ├── orchestrator.py      # The Reasoning Brain — Thinking/Action/Observation
│   │   ├── schema.py            # Unified message and tool schemas
│   │   └── core/                # Cognitive Managers
│   │       ├── session.py       # API lifecycle, Retries and Error handling
│   │       ├── context.py       # Blueprint, Memory and Mission management
│   │       ├── commands.py      # Slash command handler
│   │       └── simulation.py    # Deterministic loop recording
│   ├── cli/
│   │   ├── main.py              # Entry point and session initialization
│   │   ├── renderer.py          # Rich streaming renderer and interactive prompts
│   ├── core/
│   │   ├── security.py          # Hardened safety engine
│   │   ├── trust_manager.py     # Directory trust whitelist control
│   │   ├── paths.py             # OS-agnostic path resolution (Workspace aware)
│   ├── tools/                   # Atomic agentic tools
│   └── locales/                 # i18n JSON data (8 languages supported)
├── tests/                       # Reliable unit and integration tests
├── scripts/                     # Maintenance and diagnostic utilities
├── docs/                        # Rich documentation and assets
└── pyproject.toml

Development & Simulation

Setup

git clone https://github.com/julesklord/mentask
cd mentask.py
pip install -e ".[dev]"

Reliable Testing Protocol

mentask introduces a Simulation Layer. You can record agent turns and play them back deterministically:

Record: Set SIMULATION_MODE=record to capture interactions.
Playback: Run pytest tests/integration/test_full_agent_loop.py to verify the logic against the recorded transcript without hitting the real API.

Tests & Linting

pytest tests/                   # full reliable suite
ruff check src/ tests/ --fix    # auto-fix linting violations

Internationalization

mentask is English-First at the SDK/System level for maximum model reliability, but the entire user interface supports 8 languages:

Code	Language	File
`en`	English (Standard)	`en.json`
`es`	Español	`es.json`
`fr`	Français	`fr.json`
`pt`	Português	`pt.json`
`de`	Deutsch	`de.json`
`it`	Italiano	`it.json`
`ja`	日本語	`ja.json`
`zh`	中文 (简体)	`zh.json`

Repository Standard

mentask.py is now the reference repo for structure, hygiene, and architecture conventions in this workspace.

See STANDARD.md for the operating standard to apply across the other repositories.

Roadmap

Version	Theme	Status
`v0.14.0`	Stability, Renderer Polish	✅ Done
`v0.15.0`	Kwisatz Haderach - LSP Integration	✅ Done
`v0.16.0`	The Golden Path - Professional Recovery	✅ Done
`v0.18.0`	Lisan al-Gaib - Cognitive Architecture	✅ Done
`v0.18.5`	Full Branding & Stability	✅ Done
`v0.19.0`	Water of Life - Self-Healing Loop	📋 Planned
`v0.20.0`	God Emperor - Absolute Orchestration	📋 Planned

License

GNU General Public License v3.0 — see LICENSE for full terms.

Built by julesklord.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.20.1

Apr 26, 2026

0.20.0

Apr 26, 2026

0.19.5

Apr 26, 2026

This version

0.18.6

Apr 26, 2026

0.18.5

Apr 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mentask-0.18.6.tar.gz (121.3 kB view details)

Uploaded Apr 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mentask-0.18.6-py3-none-any.whl (140.0 kB view details)

Uploaded Apr 26, 2026 Python 3

File details

Details for the file mentask-0.18.6.tar.gz.

File metadata

Download URL: mentask-0.18.6.tar.gz
Upload date: Apr 26, 2026
Size: 121.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for mentask-0.18.6.tar.gz
Algorithm	Hash digest
SHA256	`bba2758a6e9e5121b21297d49b832692a3c034957d59c1d070862a5c17d21f86`
MD5	`3807582962860bac186be6849ad4426a`
BLAKE2b-256	`9c416479956c2ff97c6984afdb317ac65759091f50892d50129edb688e6e298d`

See more details on using hashes here.

File details

Details for the file mentask-0.18.6-py3-none-any.whl.

File metadata

Download URL: mentask-0.18.6-py3-none-any.whl
Upload date: Apr 26, 2026
Size: 140.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for mentask-0.18.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`eaeb7049904092ec6f312be2fc4dfc1cae75d3e4d72ee3d60081eedbecdf90f2`
MD5	`302fbb52f31a306b1ad79b5e7342d642`
BLAKE2b-256	`b77ecf0e905288c0b32cede01d28a1fb5e845e51de2a12586a80f89f0685f4f1`

See more details on using hashes here.

mentask 0.18.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

mentask — Autonomous AI Coding Agent for the Terminal

Contents

How it works

Features

Agentic tool engine

Workspace Isolation & Local Intelligence

Human-in-the-loop safety

Streaming terminal UX

Persistent session history

New in v0.18.5: Lisan al-Gaib

1. Persistent Gem-Style Renderer

2. Intelligence Tools (Working Memory & Planning)

3. Self-Critique & Error Correction

4. Advanced UX Commands

5. Robust Security & Memory

Installation

Prerequisites

From source (recommended)

Configuration

API key (Standardized)

Settings file

🧠 Core Knowledge Hub

👁️ Multimodal Intelligence

Usage

Configuration paths

Usage

Common Workflows

Exiting

Slash commands

Safety model

Architecture

High-Level System Diagram

Layer Breakdown

Project Structure (v0.16.4)

Development & Simulation

Setup

Reliable Testing Protocol

Tests & Linting

Internationalization

Repository Standard

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes