Skip to main content

A resilient, zero-dependency Python wrapper for the official Google Gemini Node.js CLI.

Project description

Gemini CLI Headless

Note: Project finalized until upstream gemini-cli path-bug resolution or new requests.

Gemini CLI Headless Hero

gemini-cli-headless is a Python-based wrapper for the Gemini CLI. It provides a secure, programmatically controllable execution environment designed for autonomous agents, automated workflows, and complex data extraction. This wrapper is fully tested and supported on both Windows and Linux, automatically adapting its security boundaries to the host OS. For technical details on how OS differences are handled, see Cross-Platform Architecture.

Quick Start

Prerequisite: Both usage and testing of this library require a valid Google Gemini API key. Ensure it is available in your environment before running any code:

# Windows
$env:GEMINI_API_KEY="your-api-key"

# Linux / macOS
export GEMINI_API_KEY="your-api-key"

Alternatively, you can pass it directly to the function using the api_key argument. The wrapper will fail with a clear ValueError if the key is completely missing.

Example 1: The Secure File Inspector Create an agent that can read files to answer questions, but physically prevent it from modifying your project or running any shell commands.

import os
from gemini_cli_headless import run_gemini_cli_headless

project_root = os.path.abspath("./my_project")

session = run_gemini_cli_headless(
    prompt="Read the auth.ts file and summarize its exported functions.",
    cwd=project_root,
    # Strictly limit the physical sandbox to read-only tools
    allowed_tools=["read_file", "list_directory", "grep_search"]
)

print(session.text)

Example 2: The Strict Data Bot (No Tools, Custom Persona) Wipe the default Software Engineer identity entirely. Prevent the model from using any tools, ensuring it only processes the text provided.

from gemini_cli_headless import run_gemini_cli_headless

strict_persona = """
You are a DATA_BOT. You do not write code. 
You extract names from text and return them as a JSON array.
No preamble, no explanation, no tools.
"""

session = run_gemini_cli_headless(
    prompt="Hello, my name is Alice and I work with Bob.",
    system_instruction_override=strict_persona,
    allowed_tools=[], # Physically disable all tool access
)

print(session.text)
# Output: ["Alice", "Bob"]

Why this library?

If you try to orchestrate the official Gemini CLI headlessly out-of-the-box, you quickly realize it is a disaster waiting to happen. The raw CLI is optimized for interactive developer usage, not programmatic control.

When building workflows, developers face enormous pain points that gemini-cli-headless solves:

1. The Persona Problem & Model Identity The CLI has a hardcoded "Software Engineer" identity. Try asking it to simply extract JSON from a document, and it will often refuse or start explaining its engineering credentials.

  • Our Solution: Use the system_instruction_override parameter to completely wipe the agent's mind and replace it with your instructions. Read about how we handle model paranoia and identity in Controlling the Agent's Mind.

2. Inconsistent Sandboxing & Dangerous Defaults Headless mode requires using --raw-output and the --yolo flag. By default, the agent has free rein over your filesystem and shell. Trying to restrict the agent to a specific folder or a specific set of tools via CLI flags is extremely difficult and non-transparent.

3. Hierarchical Context Pollution If you run the raw CLI inside your project, it stealthily searches parent directories for GEMINI.md files. Your headless bot's behavior will mysteriously change depending on which folder it runs in because it's secretly inheriting external project rules.

  • Our Solution: We built a surgical environment trick (isolate_from_hierarchical_pollution=True) that forces the CLI into a clean room using GEMINI_CLI_HOME and GEMINI_SYSTEM_MD. This guarantees your persona remains pure and prevents parent folder pollution, while our custom Workspace Root Resolution and Robust Session Discovery ensure that chat histories are still reliably found and saved in the correct project directory. Understand our overarching philosophy in How We Tamed the Engine (Architecture Overview).

We have done our best not only to provide clear controls for these challenges, but also to create a suite of smart edge-case tests to verify this safety. You can learn about our trace auditing in How We Test. For detailed API references and advanced configuration options, also take a look at the Usage & Examples page.

Best Practices

1. Workspace Isolation

If you are using system_instruction_override to create a pure data bot, the wrapper defaults to isolate_from_hierarchical_pollution=True. This prevents the CLI from walking up the directory tree and discovering GEMINI.md files from your parent projects. Do not disable this flag unless you explicitly want your headless agent to adopt the "Software Engineer" identity of the surrounding workspace.

2. Recommended Models

For the best balance of speed, cost, and obedience to the strict sandboxing rules, we strongly recommend using the following specific models:

  • gemini-3-flash-preview: Excellent middle ground for agents that need to use basic tools (read/write files) rapidly.
  • gemini-3.1-pro-preview: Use this when the task requires deep reasoning or complex, multi-step orchestrations.
  • gemini-3.1-flash-lite-preview: Best for high-volume, tool-restricted tasks and data extraction. Fast and cheap.

3. Testing

To verify physical security and cognitive obedience, use our custom Integration Test Battery. Use python tests/run_integration_tests.py gemini-3-flash-preview to run tests. More details in Trace Auditing & Testing.

Tests yield either a non-fatal [MODEL FAIL] (a cognitive refusal) or a critical [ENGINE FAIL] (a physical sandbox leak). Testing is handled via a Local Opt-Out Git Hook, which you can skip by committing with git push --no-verify.


⚠️ Critical Warnings

When operating gemini-cli-headless in production, you must understand the following critical constraints:

1. Broken Path Security (Upstream Bug)

🚨 CRITICAL WARNING: PATH SECURITY IS CURRENTLY BROKEN 🚨 Do NOT use the allowed_paths parameter in the current version. Due to a static compiler bug in the upstream Gemini CLI policy engine, attempting to restrict paths will permanently delete all tools from the agent's schema, causing severe hallucinations. Rely on allowed_tools and allowed_commands for security instead. (See the canary_tool_presence_baseline vs canary_upstream_compiler_bug tests in our integration suite for reproducible proof of this defect).

Note: The library will actively emit a logger.warning() to your console at runtime if it detects you attempting to use allowed_paths to prevent accidental deployments of broken agents.

2. Version Lock & System Brittleness

This orchestrator relies on deeply undocumented internal mechanics of the Gemini CLI's policy engine. It is version-locked and certified ONLY for Gemini CLI v0.38.2. Using newer versions may cause the sandbox to silently fail.

  • Action: Never auto-update the underlying CLI in your production environments. We maintain an Automated Nightly Monitor via GitHub Actions to detect breaking upstream changes immediately. See Version Lock & Stability for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gemini_cli_headless-5.0.3.tar.gz (12.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gemini_cli_headless-5.0.3-py3-none-any.whl (12.4 kB view details)

Uploaded Python 3

File details

Details for the file gemini_cli_headless-5.0.3.tar.gz.

File metadata

  • Download URL: gemini_cli_headless-5.0.3.tar.gz
  • Upload date:
  • Size: 12.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gemini_cli_headless-5.0.3.tar.gz
Algorithm Hash digest
SHA256 cfc18dccaab6047ab3c80677e1c2281f06c7054bd6d3007498768c69b6d372a1
MD5 82dcc30750fbf932f2d84bda5c424290
BLAKE2b-256 cd0efdb65a7486580017c3969d571e182f296c29dd1963d495b179d9a4a8e1cd

See more details on using hashes here.

Provenance

The following attestation bundles were made for gemini_cli_headless-5.0.3.tar.gz:

Publisher: publish.yml on jarek108/gemini-cli-headless

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file gemini_cli_headless-5.0.3-py3-none-any.whl.

File metadata

File hashes

Hashes for gemini_cli_headless-5.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 7bc4c25c3e33a06d71389a330b6d691efd3e2591b05fef8784e5f9df911b86e6
MD5 2a619a6abee2fbcfe6cb059e8d10f51c
BLAKE2b-256 f4989f855d277efa6ff593e2b5ad06c98e4a88269a8de8af7a1005067e557939

See more details on using hashes here.

Provenance

The following attestation bundles were made for gemini_cli_headless-5.0.3-py3-none-any.whl:

Publisher: publish.yml on jarek108/gemini-cli-headless

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page