A lightweight Python library for Google Gemini API key rotation and model fallback.

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

jayeeed

Project description

Gemini Rotate

Async Python

A lightweight Python library for Google Gemini API key rotation, valid model selection, and automatic fallback to "Lite" models on server errors. Supports both Async and Sync execution.

🚀 Features

✅ Automatic Key Rotation: Seamlessly rotates through a list of API keys when quota is exhausted (429), permission denied (403), or any other API error occurs.
🔄 Smart Model Fallback: Automatically downgrades specific models if server errors (5xx) persist.
⚡ Async & Sync Support: Built on top of the google-genai client, offering standard generation (generate_content, generate_content_sync) and streaming generation (generate_content_stream, generate_content_stream_sync) methods for high-performance and standard applications.
🛡️ Robust Error Handling: Implements exponential backoff before rotating keys or switching models.
📝 Concise Logging: Logs only essential success/failure information (e.g., 400 INVALID_ARGUMENT) to keep your console clean.
📊 Integrated LangSmith Tracing: Zero-setup wrapper integration with LangSmith. Automatically traces requests, attributes success to specific API clients and models, and logs accurate pricing and token metadata.

📦 Installation

pip install gemini-rotate

⚡ Quick Start

Configure Environment: Copy .env.example to .env and configure your API keys:

GEMINI_API_KEY_1="AIzaSy..."
GEMINI_API_KEY_2="AI3yhj..."
GEMINI_API_KEY_3="AIdf56..."

Run Code:

import asyncio
from gemini_rotate import GeminiRotationClient

async def main():
    client = GeminiRotationClient()
    response = await client.generate_content("Hello, Gemini!")
    print(response.text)

asyncio.run(main())

📖 Usage Guide

Initialization

The client automatically loads API keys from your environment variables (GEMINI_API_KEY_1, GEMINI_API_KEY_2, etc.).

client = GeminiRotationClient()

Generating Content

The library provides both asynchronous (generate_content) and synchronous (generate_content_sync) methods. Both methods wrap the standard google-genai calls but add rotation and fallback logic.

1. Async Text Generation

import asyncio
from gemini_rotate import GeminiRotationClient
from dotenv import load_dotenv

load_dotenv()

async def generate_text():
    client = GeminiRotationClient()
    try:
        response = await client.generate_content(
            contents="Explain quantum computing in 50 words."
        )
        print(f"Generated text: {response.text}")
    except Exception as e:
        print(f"Error: {e}")

if __name__ == "__main__":
    asyncio.run(generate_text())

2. Sync Text Generation

from gemini_rotate import GeminiRotationClient
from dotenv import load_dotenv

load_dotenv()

def generate_text_sync():
    client = GeminiRotationClient()
    try:
        response = client.generate_content_sync(
            contents="Explain quantum computing in 50 words."
        )
        print(f"Generated text: {response.text}")
    except Exception as e:
        print(f"Error: {e}")

if __name__ == "__main__":
    generate_text_sync()

3. Streaming Responses (Async & Sync)

You can stream responses chunk-by-chunk. If the connection fails before yielding the first chunk, the client automatically rotates to the next API key or model fallback. If a failure occurs mid-stream (after yielding some chunks), the client raises the error immediately to avoid duplicate outputs.

Async Streaming:

import asyncio
from gemini_rotate import GeminiRotationClient
from dotenv import load_dotenv

load_dotenv()

async def main():
    client = GeminiRotationClient()
    try:
        async for chunk in client.generate_content_stream("Write a short story about space."):
            print(chunk.text or "", end="", flush=True)
        print()
    except Exception as e:
        print(f"\nError: {e}")

if __name__ == "__main__":
    asyncio.run(main())

Sync Streaming:

from gemini_rotate import GeminiRotationClient
from dotenv import load_dotenv

load_dotenv()

def main():
    client = GeminiRotationClient()
    try:
        for chunk in client.generate_content_stream_sync("Explain gravity in one sentence."):
            print(chunk.text or "", end="", flush=True)
        print()
    except Exception as e:
        print(f"\nError: {e}")

if __name__ == "__main__":
    main()

4. Advanced: Tool Calling & Structured Output (Async Example)

You can pass tools and response_schema (or response_mime_type) via the config parameter.

import asyncio
from google import genai
from gemini_rotate import GeminiRotationClient
from pydantic import BaseModel
from dotenv import load_dotenv

load_dotenv()

# Define a schema for structured output
class Recipe(BaseModel):
    title: str
    ingredients: list[str]
    instructions: list[str]

async def generate_recipe():
    client = GeminiRotationClient()

    try:
        response = await client.generate_content(
            contents="Give me a recipe for chocolate cake.",
            config={
                "response_mime_type": "application/json",
                "response_schema": Recipe,
            }
        )
        
        # Parse result directly into Pydantic model
        recipe = response.parsed
        print(f"Title: {recipe.title}")
        print(f"Ingredients: {recipe.ingredients}")
        
    except Exception as e:
        print(f"Error: {e}")

5. LangSmith Tracing Integration

gemini-rotate automatically wraps the internal Google GenAI clients using LangSmith's standard Gemini wrapper. This enables automatic tracing of your generated content requests.

To activate tracing, configure your environment with the standard LangSmith variables:

export LANGCHAIN_TRACING_V2="true"
export LANGCHAIN_API_KEY="your-api-key"
# Optional:
export LANGCHAIN_PROJECT="my-gemini-project"

By default, the library traces requests with the tag "gemini-rotate" and integration metadata. You can customize tags, metadata, or extra parameters by passing tracing_extra to the GeminiRotationClient constructor:

import asyncio
from gemini_rotate import GeminiRotationClient

async def main():
    # Initialize client with custom tracing tags and metadata
    client = GeminiRotationClient(
        tracing_extra={
            "tags": ["production", "chatbot"],
            "metadata": {
                "user_id": "user_123"
            }
        }
    )
    
    response = await client.generate_content("Describe neural networks in one sentence.")
    print(response.text)

if __name__ == "__main__":
    asyncio.run(main())

Parameters

Parameter	Type	Description
`tracing_extra`	`dict`	(Optional) Extra tracing configuration (tags, metadata) passed to LangSmith's wrapping utility.
`contents`	`str` \| `list`	The prompt or content to send.
`config`	`dict`	(Optional) Generation config (temperature, tools, schema) passed to `google.genai`.

⚙️ Configuration

1. The `.env` Format (Expected)

To configure the library, copy the provided .env.example file to .env in the root of your project.

The environment configuration supports the following parameters:

# Required: Define your Gemini API keys using the pattern GEMINI_API_KEY_<number>
GEMINI_API_KEY_1="AIzaSy..."
GEMINI_API_KEY_2="AI3yhj..."
GEMINI_API_KEY_3="AIdf56..."

# Optional: Define your models in a valid JSON array format.
# The models will be processed in pairs (Primary -> Secondary fallback).
GEMINI_MODELS='["gemini-3.5-flash", "gemini-3.1-pro-preview"]'

# Optional: LangSmith Tracing integration config.
# Note: Leaving these variables empty (e.g. LANGCHAIN_TRACING_V2=) or unconfigured
# will automatically disable tracing wrapper logic without throwing missing key errors.
LANGCHAIN_TRACING_V2="true"
LANGCHAIN_API_KEY="your-langsmith-api-key"
LANGCHAIN_PROJECT="gemini-rotate"

(Note: A single GEMINI_API_KEY environment variable is also supported as a fallback. If numbered keys like GEMINI_API_KEY_1 are present, they are guaranteed to rotate in sequential order.)

2. Model Priority Breakdown

You can customize the order in which models are attempted by setting GEMINI_MODELS in .env as shown above. The string MUST be a valid JSON array. The library processes models in Primary -> Secondary pairs.

Default Behavior (if GEMINI_MODELS is not set):

gemini-3.5-flash -> gemini-3.1-flash-lite
gemini-3-flash-preview -> gemini-2.5-flash
gemini-2.5-flash-lite -> gemma-4-26b-a4b-it
gemma-4-31b-it (no secondary fallback)

Custom Configuration:

GEMINI_MODELS='["gemini-3.5-flash", "gemini-3.1-flash-lite", "gemini-3-flash-preview"]'

🔍 How it Works

1. Execution Flow & Retries

The library attempts model pairs sequentially using rotated API clients:

graph TD
    Start[Start Request] --> IsTraced{LangSmith Enabled?}
    IsTraced -->|Yes| StartTrace[Start Trace Parent Run]
    IsTraced -->|No| LoopPairs
    StartTrace --> LoopPairs{Loop Model Pairs}
    
    LoopPairs -->|Primary, Secondary| LoopClients{Loop API Clients}
    
    LoopClients -->|Next Client| AttemptPrimary[Attempt Primary Model]
    
    AttemptPrimary -->|Success| LogTraceSuccess[Log Client/Model Success & Cost to Trace]
    AttemptPrimary -->|Failure| CheckSecondary{Has Secondary Model?}
    
    CheckSecondary -->|Yes| AttemptSecondary[Attempt Secondary Model]
    CheckSecondary -->|No| NextClient[Next Client]
    
    AttemptSecondary -->|Success| LogTraceSuccess
    AttemptSecondary -->|Failure| NextClient
    
    LogTraceSuccess --> ReturnResponse[Return Response]
    
    NextClient -->|Clients Exhausted| NextPair[Next Pair]
    NextPair -->|Pairs Exhausted| LogTraceFailure[Log Failure to Trace]
    LogTraceFailure --> RaiseError[Raise AllClientsFailed]
    
    style Start fill:#f9f,stroke:#333,stroke-width:2px
    style ReturnResponse fill:#9f9,stroke:#333,stroke-width:2px
    style RaiseError fill:#f99,stroke:#333,stroke-width:2px

2. Tracing Pipeline Integration

If LangSmith tracing is enabled:

Parent Trace Context: The entire rotation execution is wrapped in a high-level parent run, allowing you to see the aggregate latency and outcome.
Rotated Client Wrapping: Each raw client inside the pool gets its own metadata identifiers (Client-1, Client-2, etc.) matching their corresponding API keys.
Outcome Attribution: The successfully resolved key and model are dynamically logged under the trace's metadata attributes (succeeded_client, succeeded_model, ls_model_name).
Precise Cost & Token Tracking: The library dynamically calculates the precise model usage pricing based on the official guidelines (including dynamic token tiers) and posts token counts and the calculated total USD cost directly to the trace outputs.

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

jayeeed

Release history Release notifications | RSS feed

This version

0.2.2

Jun 20, 2026

0.2.1

Jun 20, 2026

0.2.0

Jun 12, 2026

0.1.9

Jun 12, 2026

0.1.8

Jun 11, 2026

0.1.7

Jun 11, 2026

0.1.6

Jun 11, 2026

0.1.5

May 13, 2026

0.1.4

Apr 29, 2026

0.1.3

Feb 24, 2026

0.1.2

Feb 7, 2026

0.1.1

Feb 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gemini_rotate-0.2.2.tar.gz (15.0 kB view details)

Uploaded Jun 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gemini_rotate-0.2.2-py3-none-any.whl (11.7 kB view details)

Uploaded Jun 20, 2026 Python 3

File details

Details for the file gemini_rotate-0.2.2.tar.gz.

File metadata

Download URL: gemini_rotate-0.2.2.tar.gz
Upload date: Jun 20, 2026
Size: 15.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gemini_rotate-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`8401a212f1f4587ea588fe664e48465af1794f68de866ff630108b82eec9cad7`
MD5	`228e6f6619ee25a7cbce7c32072f5579`
BLAKE2b-256	`4f1635c26dae119f19ca569232c51d57f3e36b72514385c1069562e56687eed2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for gemini_rotate-0.2.2.tar.gz:

Publisher: pypi-publish.yml on jayeeed/gemini-rotate

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: gemini_rotate-0.2.2.tar.gz
- Subject digest: 8401a212f1f4587ea588fe664e48465af1794f68de866ff630108b82eec9cad7
- Sigstore transparency entry: 1886326200
- Sigstore integration time: Jun 20, 2026
Source repository:
- Permalink: jayeeed/gemini-rotate@c2923f3a6baee61a89c59751574d5e9a4b2d6ac9
- Branch / Tag: refs/heads/main
- Owner: https://github.com/jayeeed
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yml@c2923f3a6baee61a89c59751574d5e9a4b2d6ac9
- Trigger Event: push

File details

Details for the file gemini_rotate-0.2.2-py3-none-any.whl.

File metadata

Download URL: gemini_rotate-0.2.2-py3-none-any.whl
Upload date: Jun 20, 2026
Size: 11.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gemini_rotate-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`79beb0102aa293aa3b9ce521dd8c19d2d9ac637f988f8da82d4249c2d7c98ed1`
MD5	`4a1705480a21c28f2afabe5e8f5ccfbd`
BLAKE2b-256	`f06ebf1ec9f672578ed5a4f2bab6b464fe65e02b07980c37d5a6d550e92b5aaa`

See more details on using hashes here.

Provenance

The following attestation bundles were made for gemini_rotate-0.2.2-py3-none-any.whl:

Publisher: pypi-publish.yml on jayeeed/gemini-rotate

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: gemini_rotate-0.2.2-py3-none-any.whl
- Subject digest: 79beb0102aa293aa3b9ce521dd8c19d2d9ac637f988f8da82d4249c2d7c98ed1
- Sigstore transparency entry: 1886326361
- Sigstore integration time: Jun 20, 2026
Source repository:
- Permalink: jayeeed/gemini-rotate@c2923f3a6baee61a89c59751574d5e9a4b2d6ac9
- Branch / Tag: refs/heads/main
- Owner: https://github.com/jayeeed
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yml@c2923f3a6baee61a89c59751574d5e9a4b2d6ac9
- Trigger Event: push

gemini-rotate 0.2.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Project description

Gemini Rotate

🚀 Features

📦 Installation

⚡ Quick Start

📖 Usage Guide

Initialization

Generating Content

1. Async Text Generation

2. Sync Text Generation

3. Streaming Responses (Async & Sync)

4. Advanced: Tool Calling & Structured Output (Async Example)

5. LangSmith Tracing Integration

Parameters

⚙️ Configuration

1. The .env Format (Expected)

2. Model Priority Breakdown

🔍 How it Works

1. Execution Flow & Retries

2. Tracing Pipeline Integration

🤝 Contributing

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

1. The `.env` Format (Expected)