Minimal Gemini Live AI WebSocket proxy for FastAPI — plug and play voice + video AI.

These details have not been verified by PyPI

Project description

gemilive 🎙️🚀

Plug-and-play Gemini Multimodal Live AI (voice + video) for your custom stack.

While Google provides excellent core SDKs for the Gemini Multimodal Live API, integrating it securely into a production app usually kills a weekend. You can't put your API keys directly into a browser frontend, so you are forced to build a custom backend proxy. Suddenly, you're hand-wiring WebSockets to bridge raw 16kHz microphone streams from a JS frontend into a Python backend just to forward them to Gemini.

gemilive permanently solves this "Proxy Problem."

It provides a seamless, secure bridge connecting your frontend directly to Google's AI through your own custom backend. It abstracts away all the tedious boilerplate of WebSockets, bidirectional audio streams (16kHz up / 24kHz down), gapless browser PCM playback, and live video framing.

Instead of spending hours reading Web Audio specs, you can now add secure, multimodal conversational AI to your project in just six lines of code.

This repository contains the full ecosystem spanning two packages:

🐍 gemilive: The secure Python backend extension for FastAPI.
🌐 gemilive-js: The companion JavaScript client that handles all browser multimedia.

✨ Why gemilive?

Real-Time Voice: Native PCM audio streaming for natural, interruption-friendly conversations. No laggy turn-by-turn.
Multimodal Vision: The AI can securely see what your camera sees via optimized JPEG snapshots (1fps).
Zero-Boilerplate Backend: Just wrap your existing FastAPI app with mount_gemilive(). It abstracts all the WebSocket proxying securely.
Lightweight JS SDK: A clean browser GemiliveClient handling media permissions, capturing, scaling, and gapless audio resampling so you never have to touch the Web Audio API.

🛠️ Installation & Quickstart

Integration requires two pieces: the Python server endpoint and the JavaScript browser client. They are designed to work together flawlessly.

🐍 Backend (Python / FastAPI)

Install the pip package. You can use standard pip or modern package managers like uv:

uv add gemilive

Setup requires a Google Gemini API key. You can provide it directly in code or grab it from your .env:

GOOGLE_API_KEY=your_gemini_api_key_here
MODEL_NAME=gemini-3.1-flash-live-preview

Mount it into any FastAPI application:

from fastapi import FastAPI
from gemilive import mount_gemilive

app = FastAPI()

# Mounts the secure WebSocket proxy route automatically at /ws/live
mount_gemilive(app, system_prompt="You are a helpful assistant. Keep your answers brief and conversational.")

Frontend (JavaScript)

Install the npm package:

npm install gemilive-js

Or use via CDN in plain HTML:

<script src="https://cdn.jsdelivr.net/npm/gemilive-js/dist/gemilive.min.js"></script>

Initialize the client, connect, and start talking:

import { GemiliveClient } from 'gemilive-js';

// Point it to your FastAPI server's mount path
const client = new GemiliveClient("ws://localhost:8000/ws/live");

client.onMessage = (text) => console.log("Gemini:", text);
client.onError = (err) => console.error("Error:", err);

// Start the connection (prompts user for Mic & Camera)
await client.start();

// Disable video mid-session (audio continues)
// client.toggleVideo(false);

// Stop and disconnect
// client.stop();

⚙️ Advanced Configuration

Python `mount_gemilive()` Overrides

You can override environment variables dynamically when mounting the API:

mount_gemilive(
    app,
    google_api_key="...",                 # Overrides GOOGLE_API_KEY env 
    model="gemini-3.1-flash-live-preview",# Overrides MODEL_NAME env
    voice="Aoede",                        # Optional Gemini Voice ("Aoede", "Charon", etc.)
    allow_origins=["https://myapp.com"],  # Essential if your frontend is on a different domain
    debug_mode=True                       # Console logging of message flow
)

The System Prompt

You can set system prompts on the server-side (via mount_gemilive) or the client-side (via new GemiliveClient(url, { systemPrompt: "..." })). If both are provided, the server-side prompt takes precedence, and the client-side prompt is appended securely as "Additional context".

📂 Project Structure (For Contributors)

gemilive is developed as a monorepo containing two packages:

├── gemilive/             # PyPI package source
│   ├── mount.py        # Public FastAPI installer
│   ├── config.py       # Pydantic env validation
│   └── router.py       # Internal WebSocket / GenAI flow
├── gemilive-js/          # npm package source
│   ├── src/index.js    # Browser SDK (Web Audio API logic)
│   └── package.json
└── main.py             # Sandbox FastAPI app for testing and local dev

For guidelines on local development and how to publish to PyPI and npm, read PUBLISHING.md.

⚠️ Important Considerations

Browser Security: Browsers restrict microphone/camera access to secure contexts. getUserMedia requires HTTPS in production. localhost works for development.
Audio Resampling: Browsers typically record audio at 44.1kHz or 48kHz. The gemilive-js SDK seamlessly resamples microphone inputs to 16kHz PCM to meet Gemini's strict API requirements. Responses from Gemini are returned as 24kHz PCM and gaplessly played back using Javascript time-scheduling.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.1

Apr 13, 2026

0.1.0

Apr 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gemilive-0.1.1.tar.gz (8.0 kB view details)

Uploaded Apr 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gemilive-0.1.1-py3-none-any.whl (8.9 kB view details)

Uploaded Apr 13, 2026 Python 3

File details

Details for the file gemilive-0.1.1.tar.gz.

File metadata

Download URL: gemilive-0.1.1.tar.gz
Upload date: Apr 13, 2026
Size: 8.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gemilive-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`d33ed7e755be0efee64f8234ef5e8df504b770a641fda9c34c295972368d097c`
MD5	`5006e3065e23fe3445e6db15d4858910`
BLAKE2b-256	`73fe270107584ffec9bdeff2002af746d4e1c2f599d016d78b7c886f55d50068`

See more details on using hashes here.

Provenance

The following attestation bundles were made for gemilive-0.1.1.tar.gz:

Publisher: publish-python.yml on saidurpulok/gemilive

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: gemilive-0.1.1.tar.gz
- Subject digest: d33ed7e755be0efee64f8234ef5e8df504b770a641fda9c34c295972368d097c
- Sigstore transparency entry: 1283130313
- Sigstore integration time: Apr 13, 2026
Source repository:
- Permalink: saidurpulok/gemilive@8a199dbff42fb9c00cc7073d09e354ef7dabd85f
- Branch / Tag: refs/tags/python-v0.1.1
- Owner: https://github.com/saidurpulok
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-python.yml@8a199dbff42fb9c00cc7073d09e354ef7dabd85f
- Trigger Event: push

File details

Details for the file gemilive-0.1.1-py3-none-any.whl.

File metadata

Download URL: gemilive-0.1.1-py3-none-any.whl
Upload date: Apr 13, 2026
Size: 8.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gemilive-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`01fbda822f5e8d6e93e0d944f227bf6e17d5d831ce5a771f511a426f4841b4a8`
MD5	`c9eab5695c9dd986d8229d303a0ae4da`
BLAKE2b-256	`b518004a626bcd29b1383d9a07eb04f6a0798bfe8edec4d8c38eeb61c4a70988`

See more details on using hashes here.

Provenance

The following attestation bundles were made for gemilive-0.1.1-py3-none-any.whl:

Publisher: publish-python.yml on saidurpulok/gemilive

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: gemilive-0.1.1-py3-none-any.whl
- Subject digest: 01fbda822f5e8d6e93e0d944f227bf6e17d5d831ce5a771f511a426f4841b4a8
- Sigstore transparency entry: 1283130343
- Sigstore integration time: Apr 13, 2026
Source repository:
- Permalink: saidurpulok/gemilive@8a199dbff42fb9c00cc7073d09e354ef7dabd85f
- Branch / Tag: refs/tags/python-v0.1.1
- Owner: https://github.com/saidurpulok
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-python.yml@8a199dbff42fb9c00cc7073d09e354ef7dabd85f
- Trigger Event: push

gemilive 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

gemilive 🎙️🚀

✨ Why gemilive?

🛠️ Installation & Quickstart

🐍 Backend (Python / FastAPI)

Frontend (JavaScript)

⚙️ Advanced Configuration

Python `mount_gemilive()` Overrides

The System Prompt

📂 Project Structure (For Contributors)

⚠️ Important Considerations

📄 License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

gemilive 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

gemilive 🎙️🚀

✨ Why gemilive?

🛠️ Installation & Quickstart

🐍 Backend (Python / FastAPI)

Frontend (JavaScript)

⚙️ Advanced Configuration

Python mount_gemilive() Overrides

The System Prompt

📂 Project Structure (For Contributors)

⚠️ Important Considerations

📄 License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Python `mount_gemilive()` Overrides