RAG over audio files with provider-agnostic pipeline

These details have not been verified by PyPI

Project links

Project description

AudioRAG

Provider-agnostic RAG pipeline for audio content. Download, transcribe, chunk, embed, and search audio from YouTube and other sources.

Features

Multi-provider support: OpenAI, Deepgram, AssemblyAI, Groq (STT); OpenAI, Voyage, Cohere (embeddings); OpenAI, Anthropic, Gemini (generation); ChromaDB, Pinecone, Weaviate, Supabase (vector stores)
Resumable processing: SQLite state tracking with hash-based IDs
Automatic chunking: Time-based segmentation with configurable duration
Audio splitting: Handles large files by splitting before transcription
Structured logging: Context-aware logging with operation timing
Type-safe: Python 3.12+ with full type annotations

Quick Start

import asyncio
from audiorag import AudioRAGPipeline, AudioRAGConfig

async def main():
    # Configure with your chosen providers
    config = AudioRAGConfig(
        stt_provider="openai",
        stt_model="whisper-1",
        embedding_provider="openai",
        embedding_model="text-embedding-3-small",
        vector_store_provider="chromadb",
        generation_provider="openai",
        generation_model="gpt-4o-mini",
        # API keys can also be set via environment variables
        openai_api_key="sk-...",
    )
    
    # Initialize pipeline
    pipeline = AudioRAGPipeline(config)
    
    # Index audio from YouTube
    await pipeline.index("https://youtube.com/watch?v=...")
    
    # Query the indexed content
    result = await pipeline.query("What are the main points discussed?")
    print(result.answer)
    
    # Access sources with timestamps
    for source in result.sources:
        print(f"{source.video_title} at {source.start_time}s")
        print(f"URL: {source.youtube_timestamp_url}")

asyncio.run(main())

Installation

# Install with uv (recommended)
uv pip install audiorag

# Or with pip
pip install audiorag

Optional Dependencies

# Audio scraping utilities (yt-dlp, pydub)
uv pip install audiorag[defaults]  # or: pip install audiorag[defaults]

# All providers and utilities
uv pip install audiorag[all]  # or: pip install audiorag[all]

# Specific providers only
uv pip install audiorag[openai,chromadb,scraping,cohere]

Configuration

AudioRAG uses pydantic-settings with environment variable support. All settings use the AUDIORAG_ prefix.

# Example: Using OpenAI for STT, embeddings, and generation
export AUDIORAG_OPENAI_API_KEY="sk-..."
export AUDIORAG_STT_PROVIDER="openai"
export AUDIORAG_EMBEDDING_PROVIDER="openai"
export AUDIORAG_VECTOR_STORE_PROVIDER="chromadb"
export AUDIORAG_GENERATION_PROVIDER="openai"

# Example: Using different providers
export AUDIORAG_DEEPGRAM_API_KEY="..."
export AUDIORAG_STT_PROVIDER="deepgram"
export AUDIORAG_VOYAGE_API_KEY="..."
export AUDIORAG_EMBEDDING_PROVIDER="voyage"

# Processing settings
export AUDIORAG_CHUNK_DURATION_SECONDS="30"
export AUDIORAG_RETRIEVAL_TOP_K="10"
export AUDIORAG_RERANK_TOP_N="3"

See Configuration Guide for all options.

Documentation

Quick Start Guide - Get up and running
Configuration - All configuration options
Providers - Available providers and setup
Architecture - Pipeline stages and data flow
API Reference - Complete API documentation

Development

# Clone and setup
git clone <repository-url>
cd audiorag
uv sync

# Run tests
uv run pytest

# Run checks
uv run ruff check . --fix
uv run ty check

# Install pre-commit hooks
uv run prek install

Pipeline Stages

Download: Fetch audio from URL (YouTube supported)
Split: Divide large files into processable chunks
Transcribe: Convert audio to text using STT provider
Chunk: Group transcription into time-based segments
Embed: Generate vector embeddings for each chunk
Store: Persist embeddings in vector database

License

MIT License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.15.0

Feb 15, 2026

0.14.1

Feb 15, 2026

0.14.0

Feb 15, 2026

0.13.0

Feb 14, 2026

0.12.0

Feb 14, 2026

0.11.0

Feb 14, 2026

0.10.0

Feb 14, 2026

0.9.0

Feb 13, 2026

0.8.1

Feb 13, 2026

0.8.0

Feb 13, 2026

0.7.0

Feb 13, 2026

0.6.2

Feb 13, 2026

0.6.1

Feb 13, 2026

0.6.0

Feb 13, 2026

0.5.5

Feb 12, 2026

0.5.4

Feb 12, 2026

0.5.3

Feb 12, 2026

0.5.2

Feb 12, 2026

0.5.1

Feb 12, 2026

0.5.0

Feb 12, 2026

0.4.0

Feb 12, 2026

0.3.1

Feb 11, 2026

0.3.0

Feb 11, 2026

0.2.0

Feb 11, 2026

This version

0.1.0

Feb 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audiorag-0.1.0.tar.gz (39.6 kB view details)

Uploaded Feb 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

audiorag-0.1.0-py3-none-any.whl (68.4 kB view details)

Uploaded Feb 8, 2026 Python 3

File details

Details for the file audiorag-0.1.0.tar.gz.

File metadata

Download URL: audiorag-0.1.0.tar.gz
Upload date: Feb 8, 2026
Size: 39.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for audiorag-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`30e12f666943e2bd3485d224ee7f166b8be4dc54907de72e3f93ab6fd518fc89`
MD5	`9b21f7a9a7e972e91d275426f09f8a57`
BLAKE2b-256	`39f490dc2345f6ce330cf5eb47eb93454e3bf0b4826a054ece62413c80030153`

See more details on using hashes here.

File details

Details for the file audiorag-0.1.0-py3-none-any.whl.

File metadata

Download URL: audiorag-0.1.0-py3-none-any.whl
Upload date: Feb 8, 2026
Size: 68.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for audiorag-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e17405fdede4751361fc8498debb2a92f83d1cbee7ec46be4559afd35174ef1b`
MD5	`6e779150d220f9adfb199f76ab57bdbf`
BLAKE2b-256	`e13d80ebef0b4dfd1ff33dc897347ea5063947cd139a786584a0f43056d144ac`

See more details on using hashes here.

audiorag 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AudioRAG

Features

Quick Start

Installation

Optional Dependencies

Configuration

Documentation

Development

Pipeline Stages

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes