Skip to main content

Automated OAuth wrapper for Google Gemini CLI on macOS

Project description

geminpy

Automated OAuth wrapper for Google's Gemini CLI on macOS

geminpy eliminates the manual authentication steps required by Google's gemini CLI tool. It automatically handles the entire OAuth flow - from browser management to account selection - letting you use Gemini programmatically without interruption.

Why It Exists

Google's official gemini CLI requires manual OAuth authentication through a web browser each time you use it. This makes automation impossible and interrupts workflows. geminpy solves this by:

  1. Automating the OAuth dance - No manual clicking through Google's authentication screens
  2. Managing browser contexts - Uses isolated Chrome for Testing to avoid conflicts
  3. Handling rate limits gracefully - Automatically retries with flash model when rate limited
  4. Providing clean outputs - Filters authentication noise from responses

What It Does

geminpy acts as a transparent wrapper around the official gemini CLI, adding automation capabilities:

  • Drop-in CLI replacement: Use it exactly like the original gemini command
  • Programmatic API: Call Gemini from Python code with ask() function
  • Multi-language support: Handles OAuth pages in 9+ languages
  • Smart user detection: Remembers your preferred Google account
  • Rate limit resilience: Automatic fallback to gemini-2.5-flash when quota exceeded
  • Browser isolation: Uses Chrome for Testing to avoid disrupting your main browser

How It Works

Technical Architecture

The package orchestrates several components to achieve seamless automation:

User Request → GeminiClient → Browser Setup → OAuth Automation → Gemini CLI → Response
                     ↓              ↓               ↓                ↓
              BrowserManager  ChromeManager  OAuthAutomator  GeminiExecutor
                     ↓              ↓               ↓                ↓
              macdefaultbrowsy  Chrome CDP    Playwright      subprocess

Automation Flow

  1. Browser Preparation

    • Installs Chrome for Testing if needed (via @puppeteer/browsers)
    • Saves current default browser
    • Temporarily switches to Chrome for Testing
  2. OAuth Automation

    • Launches Chrome with remote debugging (--remote-debugging-port=9222)
    • Starts gemini CLI which opens OAuth URL
    • Playwright connects via Chrome DevTools Protocol
    • Automatically clicks your Google account
    • Detects and clicks sign-in button (multi-language, multi-strategy)
    • Waits for authentication success
  3. Execution & Monitoring

    • Monitors gemini process output in real-time
    • Detects rate limits (429, quota exceeded, etc.)
    • Automatically retries with fallback model if needed
    • Extracts clean response from CLI output
  4. Cleanup

    • Restores original default browser
    • Optionally quits Chrome
    • Returns pure response text

Key Components

  • browser.manager: Controls macOS default browser via macdefaultbrowsy
  • browser.chrome: Manages Chrome for Testing installation and lifecycle
  • browser.automation: Playwright-based OAuth flow automation
  • gemini.client: Main orchestrator coordinating all components
  • gemini.executor: Subprocess management with real-time monitoring
  • gemini.parser: Response extraction and cleaning

Installation

Prerequisites

macOS only - Browser automation requires macOS-specific tools.

Install geminpy

uv pip install geminpy

One-time setup: Install Playwright browsers

playwright install chromium

Usage

CLI Usage

Use exactly like the original gemini CLI:

# Ask a question
geminpy -p "Explain Python decorators"

# Use specific model with new shortcuts
geminpy -P -p "Write a Python function"  # Uses gemini-2.5-pro
geminpy -F -p "Quick question"           # Uses gemini-2.5-flash

# Traditional model selection still works
geminpy -m "gemini-pro" -p "Complex analysis"

# Enable verbose logging
geminpy --verbose -p "Debug this"

# Quit Chrome after completion
geminpy --quit-chrome -p "One-off query"

Programmatic Usage

from geminpy import ask

# Simple question-answer
response = ask("Explain quantum computing")
print(response)

# Async usage with full control
import asyncio
from geminpy import call_gemini_cli

async def main():
    response = await call_gemini_cli(
        gemini_args=["-m", "gemini-pro", "-p", "Your prompt"],
        user="your.email@gmail.com",
        verbose=True,
        quit_browser=True
    )
    print(response)

asyncio.run(main())

Configuration

User Account Resolution

geminpy determines which Google account to use in this priority order:

  1. CLI argument: --user="you@gmail.com"
  2. Environment variable: GEMINI_CLI_USER="you@gmail.com"
  3. Stored settings: From previous successful authentication
  4. First available: Uses first Google account found

Settings Storage

Settings are automatically persisted to:

~/Library/Application Support/com.twardoch.chrometesting/settings.json

Contains:

  • Chrome for Testing executable path
  • Last used Google account email

Advanced Features

Multi-Language OAuth Support

Detects sign-in buttons in multiple languages:

  • English, Polish, French, German, Spanish
  • Italian, Russian, Japanese, Chinese
  • Falls back to attribute and style-based detection

Rate Limit Handling

Automatic detection and retry logic:

Original request → Rate limit detected → Retry with gemini-2.5-flash → Final response

Browser Isolation

  • Uses dedicated Chrome for Testing instance
  • Preserves your regular browser state
  • No profile contamination
  • Clean OAuth every time

Troubleshooting

Common Issues

"Chrome CDP did not become available"

  • Check if port 9222 is available: lsof -i :9222
  • Look at Chrome logs: /tmp/gemini_chrome_stderr.log

"Could not find sign-in button"

  • Enable verbose mode: --verbose
  • Check screenshots: oauth_error.png, oauth_error_no_signin.png
  • Ensure your Google account has Gemini access

Debug Mode

Enable comprehensive logging:

geminpy --verbose -p "Your question"

Shows:

  • Chrome installation progress
  • Browser switching operations
  • OAuth automation steps
  • Gemini CLI interactions
  • Response parsing details

Development

Project Structure

src/geminpy/
├── browser/          # Browser automation components
│   ├── automation.py # OAuth flow automation
│   ├── chrome.py     # Chrome for Testing management
│   └── manager.py    # Default browser control
├── gemini/           # Gemini CLI integration
│   ├── client.py     # Main orchestrator
│   ├── executor.py   # Process management
│   └── parser.py     # Response extraction
├── core/             # Core utilities
│   ├── config.py     # Configuration
│   ├── constants.py  # Constants
│   └── exceptions.py # Custom exceptions
├── utils/            # Utilities
│   ├── platform.py   # Platform checks
│   └── storage.py    # Settings persistence
├── api.py            # Public API
└── cli.py            # CLI interface

Running Tests

# Run all tests
uvx hatch run test

# Run with coverage
uvx hatch run test-cov

# Type checking
uvx hatch run type-check

# Linting
uvx hatch run lint

Building

# Build package
uvx hatch build

# Install locally
uv pip install --system -e .

Security

  • No credential storage - Only stores email preference
  • Local automation only - All OAuth happens on your machine
  • Temporary browser access - Restored after each use
  • Process isolation - Chrome runs in separate process

Requirements

  • Platform: macOS (Darwin) only
  • Python: 3.10, 3.11, or 3.12
  • Browser: Chrome for Testing (auto-installed)

License

MIT License - see LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geminpy-1.4.0.tar.gz (25.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geminpy-1.4.0-py3-none-any.whl (27.7 kB view details)

Uploaded Python 3

File details

Details for the file geminpy-1.4.0.tar.gz.

File metadata

  • Download URL: geminpy-1.4.0.tar.gz
  • Upload date:
  • Size: 25.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.28.1

File hashes

Hashes for geminpy-1.4.0.tar.gz
Algorithm Hash digest
SHA256 10103489249b26657ab96960fc0d150222fded40508005bb2d9affdaab23baf3
MD5 8a01f9236b4710d9774cc13dc9051533
BLAKE2b-256 3d7f838bbe9c399f84283b5a3b2987c9a155485e4fb5c6a49b65756b1d44cfe3

See more details on using hashes here.

File details

Details for the file geminpy-1.4.0-py3-none-any.whl.

File metadata

  • Download URL: geminpy-1.4.0-py3-none-any.whl
  • Upload date:
  • Size: 27.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.28.1

File hashes

Hashes for geminpy-1.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1d1c304a131145c5cd6aa2b5d131f3e50decb99fcb00520d3fa3ca8dbaabdf20
MD5 e2315d23d1f2bb5eb22c0105d276e12b
BLAKE2b-256 0adbdef97578c693a62b15f6f96777364ec0999b42f895919e93e91d2d51ba48

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page