Skip to main content

Voice-driven AI assistant for real-time transcription and Gemini integration.

Project description

VoxAI

VoxAI Logo

Voice-driven AI assistant capturing desktop audio and streaming live, textual answers—ideal for interviews, meetings, and knowledge work.


⚙️ Prerequisites

  • Python ≥3.7
  • Node.js ≥14
  • Electron (global)
    npm install -g electron
    

🚀 Quickstart

  1. Clone the repo

    git clone https://github.com/rtiwariops/voxai.git
    cd voxai
    
  2. Create & activate a Python virtual environment

    python3 -m venv .venv
    source .venv/bin/activate      # macOS/Linux
    # .venv\Scripts\activate       # Windows PowerShell
    
  3. Install Python dependencies

    pip install --upgrade pip
    pip install .
    
  4. Run VoxAI

voxai
  1. Configure environment variables

    cp .env.example .env
    
  2. Edit .env and provide:

    GENAI_API_KEY=sk-...
    GENAI_MODEL=gemini-1.5-flash
    
  3. Run VoxAI

    VoxAI
    

🎧 How It Works

  • Start Recording
    Click Start to capture audio from your input device (e.g., BlackHole).

  • Stop & Transcribe
    Click Stop. Whisper transcribes and shows the full audio clip under Transcript.

  • Ask AI
    Click Ask AI to send the transcript to Gemini. Answers stream live in the Answer panel.

  • Copy & Share
    Output is plain text—reuse it in RAG/finetune workflows or anywhere else.


⚙️ Features

  • Universal Audio Capture (BlackHole, etc.)
  • Manual Control for long-form Q&A
  • One-Shot, Full-Clip Transcription
  • Live Token-by-Token AI Streaming
  • Configurable Gemini Model (.env)
  • Electron-Based Desktop UI

📦 Installation Options

From source:

git clone https://github.com/YourUsername/VoxAI.git
cd VoxAI
pip install -e .
VoxAI

From PyPI:

pip install VoxAI
VoxAI

🛠 Configuration

Set in .env:

Variable Description Example
GENAI_API_KEY Google Generative AI (Gemini) API key sk-…
GENAI_MODEL Gemini model to use gemini-1.5-flash

See .env.example for reference.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voxai-0.1.5.tar.gz (8.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

voxai-0.1.5-py3-none-any.whl (7.8 kB view details)

Uploaded Python 3

File details

Details for the file voxai-0.1.5.tar.gz.

File metadata

  • Download URL: voxai-0.1.5.tar.gz
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.0

File hashes

Hashes for voxai-0.1.5.tar.gz
Algorithm Hash digest
SHA256 8d692cc56d7d578d4afd92b296b74aca48d5d7233134444e2bf2d368f28b7180
MD5 5c807c7fd87c61eec892486493315a43
BLAKE2b-256 0ffe370fd73705d36125920a20a95d2311465d6fd2553c8ea5abcd7e87ed030e

See more details on using hashes here.

File details

Details for the file voxai-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: voxai-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 7.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.0

File hashes

Hashes for voxai-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 42f6b4fcf08cc6f855ddd021a5e45b947a1683b56332fb8e5d8c32499c6425fa
MD5 fed2a34f5af0dec9284097ebabf485eb
BLAKE2b-256 110f132990c0f8d840e78d43543c2673a700d19725683e4bca4bf22913984498

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page