Skip to main content

Automated OAuth wrapper for Google Gemini CLI on macOS

Project description

geminpy

Automated OAuth wrapper for Google's Gemini CLI on macOS

geminpy eliminates the manual authentication steps required by Google's gemini CLI tool. It automatically handles the entire OAuth flow - from browser management to account selection - letting you use Gemini programmatically without interruption.

Why It Exists

Google's official gemini CLI requires manual OAuth authentication through a web browser each time you use it. This makes automation impossible and interrupts workflows. geminpy solves this by:

  1. Automating the OAuth dance - No manual clicking through Google's authentication screens
  2. Managing browser contexts - Uses isolated Chrome for Testing to avoid conflicts
  3. Handling rate limits gracefully - Automatically retries with flash model when rate limited
  4. Providing clean outputs - Filters authentication noise from responses

What It Does

geminpy acts as a transparent wrapper around the official gemini CLI, adding automation capabilities:

  • Drop-in CLI replacement: Use it exactly like the original gemini command
  • Programmatic API: Call Gemini from Python code with ask() function
  • Multi-language support: Handles OAuth pages in 9+ languages
  • Smart user detection: Remembers your preferred Google account
  • Rate limit resilience: Automatic fallback to gemini-2.5-flash when quota exceeded
  • Browser isolation: Uses Chrome for Testing to avoid disrupting your main browser

How It Works

Technical Architecture

The package orchestrates several components to achieve seamless automation:

User Request → GeminiClient → Browser Setup → OAuth Automation → Gemini CLI → Response
                     ↓              ↓               ↓                ↓
              BrowserManager  ChromeManager  OAuthAutomator  GeminiExecutor
                     ↓              ↓               ↓                ↓
              macdefaultbrowsy  Chrome CDP    Playwright      subprocess

Automation Flow

  1. Browser Preparation

    • Installs Chrome for Testing if needed (via @puppeteer/browsers)
    • Saves current default browser
    • Temporarily switches to Chrome for Testing
  2. OAuth Automation

    • Launches Chrome with remote debugging (--remote-debugging-port=9222)
    • Starts gemini CLI which opens OAuth URL
    • Playwright connects via Chrome DevTools Protocol
    • Automatically clicks your Google account
    • Detects and clicks sign-in button (multi-language, multi-strategy)
    • Waits for authentication success
  3. Execution & Monitoring

    • Monitors gemini process output in real-time
    • Detects rate limits (429, quota exceeded, etc.)
    • Automatically retries with fallback model if needed
    • Extracts clean response from CLI output
  4. Cleanup

    • Restores original default browser
    • Optionally quits Chrome
    • Returns pure response text

Key Components

  • browser.manager: Controls macOS default browser via macdefaultbrowsy
  • browser.chrome: Manages Chrome for Testing installation and lifecycle
  • browser.automation: Playwright-based OAuth flow automation
  • gemini.client: Main orchestrator coordinating all components
  • gemini.executor: Subprocess management with real-time monitoring
  • gemini.parser: Response extraction and cleaning

Installation

Prerequisites

macOS only - Browser automation requires macOS-specific tools.

Install geminpy

uv pip install geminpy

One-time setup: Install Playwright browsers

playwright install chromium

Usage

CLI Usage

Use exactly like the original gemini CLI:

# Ask a question
geminpy -p "Explain Python decorators"

# Use specific model with new shortcuts
geminpy -P -p "Write a Python function"  # Uses gemini-2.5-pro
geminpy -F -p "Quick question"           # Uses gemini-2.5-flash

# Traditional model selection still works
geminpy -m "gemini-pro" -p "Complex analysis"

# Enable verbose logging
geminpy --verbose -p "Debug this"

# Quit Chrome after completion
geminpy --quit-chrome -p "One-off query"

Programmatic Usage

from geminpy import ask

# Simple question-answer
response = ask("Explain quantum computing")
print(response)

# Async usage with full control
import asyncio
from geminpy import call_gemini_cli

async def main():
    response = await call_gemini_cli(
        gemini_args=["-m", "gemini-pro", "-p", "Your prompt"],
        user="your.email@gmail.com",
        verbose=True,
        quit_browser=True
    )
    print(response)

asyncio.run(main())

Configuration

User Account Resolution

geminpy determines which Google account to use in this priority order:

  1. CLI argument: --user="you@gmail.com"
  2. Environment variable: GEMINI_CLI_USER="you@gmail.com"
  3. Stored settings: From previous successful authentication
  4. First available: Uses first Google account found

Settings Storage

Settings are automatically persisted to:

~/Library/Application Support/com.twardoch.chrometesting/settings.json

Contains:

  • Chrome for Testing executable path
  • Last used Google account email

Advanced Features

Multi-Language OAuth Support

Detects sign-in buttons in multiple languages:

  • English, Polish, French, German, Spanish
  • Italian, Russian, Japanese, Chinese
  • Falls back to attribute and style-based detection

Rate Limit Handling

Automatic detection and retry logic:

Original request → Rate limit detected → Retry with gemini-2.5-flash → Final response

Browser Isolation

  • Uses dedicated Chrome for Testing instance
  • Preserves your regular browser state
  • No profile contamination
  • Clean OAuth every time

Troubleshooting

Common Issues

"Chrome CDP did not become available"

  • Check if port 9222 is available: lsof -i :9222
  • Look at Chrome logs: /tmp/gemini_chrome_stderr.log

"Could not find sign-in button"

  • Enable verbose mode: --verbose
  • Check screenshots: oauth_error.png, oauth_error_no_signin.png
  • Ensure your Google account has Gemini access

Debug Mode

Enable comprehensive logging:

geminpy --verbose -p "Your question"

Shows:

  • Chrome installation progress
  • Browser switching operations
  • OAuth automation steps
  • Gemini CLI interactions
  • Response parsing details

Development

Project Structure

src/geminpy/
├── browser/          # Browser automation components
│   ├── automation.py # OAuth flow automation
│   ├── chrome.py     # Chrome for Testing management
│   └── manager.py    # Default browser control
├── gemini/           # Gemini CLI integration
│   ├── client.py     # Main orchestrator
│   ├── executor.py   # Process management
│   └── parser.py     # Response extraction
├── core/             # Core utilities
│   ├── config.py     # Configuration
│   ├── constants.py  # Constants
│   └── exceptions.py # Custom exceptions
├── utils/            # Utilities
│   ├── platform.py   # Platform checks
│   └── storage.py    # Settings persistence
├── api.py            # Public API
└── cli.py            # CLI interface

Running Tests

# Run all tests
uvx hatch run test

# Run with coverage
uvx hatch run test-cov

# Type checking
uvx hatch run type-check

# Linting
uvx hatch run lint

Building

# Build package
uvx hatch build

# Install locally
uv pip install --system -e .

Security

  • No credential storage - Only stores email preference
  • Local automation only - All OAuth happens on your machine
  • Temporary browser access - Restored after each use
  • Process isolation - Chrome runs in separate process

Requirements

  • Platform: macOS (Darwin) only
  • Python: 3.10, 3.11, or 3.12
  • Browser: Chrome for Testing (auto-installed)

License

MIT License - see LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geminpy-1.1.2.tar.gz (24.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geminpy-1.1.2-py3-none-any.whl (27.1 kB view details)

Uploaded Python 3

File details

Details for the file geminpy-1.1.2.tar.gz.

File metadata

  • Download URL: geminpy-1.1.2.tar.gz
  • Upload date:
  • Size: 24.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.28.1

File hashes

Hashes for geminpy-1.1.2.tar.gz
Algorithm Hash digest
SHA256 61d25709e530826cd0dd9f2fdf7ee8d5159e324bd3c573c972cce612b43e90b8
MD5 e41bafb33bc411740b7e5001cdf9ee8a
BLAKE2b-256 b57a2a184622b3fd8509702d38da22d5a1937de21102c6f246b28ccfd070637f

See more details on using hashes here.

File details

Details for the file geminpy-1.1.2-py3-none-any.whl.

File metadata

  • Download URL: geminpy-1.1.2-py3-none-any.whl
  • Upload date:
  • Size: 27.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.28.1

File hashes

Hashes for geminpy-1.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 ee78de43b1c96be181ce7a2b5183ac15486eb1bcabcf8cad4d3747d67d449f56
MD5 c10ee5acc2dccbc1745bf94c2e8b5189
BLAKE2b-256 cf584fc0e3c73dc23de2610bab0845d81d4364d1c4f08cf5666560e16df7f211

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page