Audio-First Knowledge Engine Platform
Project description
Epist.ai Audio RAG Platform
[!TIP] "Read" your audio. Transform podcasts, meetings, and voice notes into a queryable Second Brain with RAG.
🔗 Quick Links
🚀 New: Model Context Protocol (MCP)
Connect Claude Desktop directly to your knowledge base using our official MCP Server:
pip install epist-mcp-server
Introduction
Epist.ai is a next-generation system that treats audio as a first-class citizen, enabling seamless ingestion, understanding, and retrieval of information from meetings, voice notes, and ambient conversations.
It provides a production-ready API for audio ingestion, transcription, and semantic search, built on modern RAG (Retrieval-Augmented Generation) principles.
🧠 Core Concepts
The Audio RAG Pipeline
- Ingestion: Audio/Video files are streamed to GCS.
- Transcription: Fireworks AI (Whisper V3 Turbo) generates high-fidelity transcripts with speaker diarization.
- Semantic Chunking: Transcripts are intelligently split to preserve context.
- Vectorization: Text chunks are converted to embeddings using OpenAI's
text-embedding-3-small. - Hybrid Retrieval: Search combines BM25 keyword matching with Vector similarity for maximum precision.
Platform vs. Tenant
- Platform (This Repo): The core engine providing the API, storage, and processing logic.
- Tenant (External): Applications like podcast-rag-app that consume the API to provide specific user experiences.
Features
- Audio Upload: Stream large audio files directly to Google Cloud Storage.
- RSS Ingestion: Ingest entire podcast feeds automatically with metadata extraction.
- Automatic Transcription: Background transcription with speaker diarization.
- Hybrid Search: Combine semantic meaning (Vector) and exact keywords (Full-Text) using Reciprocal Rank Fusion (RRF).
- Multi-Modal Support: Support for
.mp4,.mov,.mp3,.wav. - 📊 Observability: "Glass Box" tracing system to visualize every pipeline step.
- 💬 Interactive Chat: Chat with your audio content using citations that link directly to timestamps.
- MCP Integration: Connect your audio knowledge base directly to Claude.
Tech Stack
- Backend: FastAPI (Python 3.11+)
- Database: PostgreSQL 15 +
pgvector - ASR: Fireworks AI (Whisper V3 Turbo)
- Embeddings: OpenAI
text-embedding-3-small - RAG Framework: LangChain
- Infrastructure: GCP (Cloud Run, Cloud SQL, Cloud Storage, Cloud Tasks)
- Frontend: Next.js 15 (React)
Quick Start
This project uses uv for dependency management.
# Install dependencies
uv sync
# Run development server
uv run fastapi dev src/main.py
Frontend Development
cd apps/web
npm install
npm run dev
Project Structure
| Directory | Description |
|---|---|
src/ |
Core Backend API: FastAPI application, services, and models. |
apps/web/ |
Main Dashboard: Next.js 15 application for managing audio and search. |
packages/ |
Internal Packages: Includes epist_cli and epist_mcp_server. |
sdks/ |
Client Libraries: Official SDKs for JS/TS and Python. |
infra/ |
Infrastructure: Terraform configurations for GCP deployment. |
docs/ |
Documentation: Deep-dive guides and API references. |
labs/ |
Research: Experimental notebooks and chunking evaluations. |
Changelog
v1.2.0 (Jan 05, 2026)
- RSS Ingestion: New generic platform feature to ingest entire podcast feeds.
- Enhanced Metadata: Automated extraction of author, description, and images.
v1.1.0 (Dec 27, 2025)
- Stability & Observability: Integrated Sentry and migrated to Cloud Scheduler.
- Security & Performance: Added TrustedHost, GZip, and production security headers.
- Tiered Sync: Automated feed synchronization with tier-based refresh intervals.
v0.1.0 (Dec 08, 2025)
- Core Platform: Initial release with Audio RAG and Vector Search.
Contributing
We welcome contributions! Please see our Contributing Guide for more information.
License
This project is open source under the MIT License.
Support
If you have questions or need assistance, please contact us at admin@epist.ai.
Epist.ai - Transforming audio into actionable knowledge.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file epist-1.2.2.tar.gz.
File metadata
- Download URL: epist-1.2.2.tar.gz
- Upload date:
- Size: 3.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3c47b0fb62fe5a459cb948ff84e1525c3fef90615a2243cc89461ccfaa1b9647
|
|
| MD5 |
81454adfe7619b795e9beaa76d1591fe
|
|
| BLAKE2b-256 |
175e5bc9ed78778c2e35a826ded6f0b3481aea3b53e8fc9f30163bd303349b9f
|
Provenance
The following attestation bundles were made for epist-1.2.2.tar.gz:
Publisher:
publish.yml on Seifollahi/epist
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
epist-1.2.2.tar.gz -
Subject digest:
3c47b0fb62fe5a459cb948ff84e1525c3fef90615a2243cc89461ccfaa1b9647 - Sigstore transparency entry: 813560164
- Sigstore integration time:
-
Permalink:
Seifollahi/epist@cd928eebf8f56b397589d4c133c24863fd3d8fe9 -
Branch / Tag:
refs/tags/v1.2.2 - Owner: https://github.com/Seifollahi
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@cd928eebf8f56b397589d4c133c24863fd3d8fe9 -
Trigger Event:
push
-
Statement type:
File details
Details for the file epist-1.2.2-py3-none-any.whl.
File metadata
- Download URL: epist-1.2.2-py3-none-any.whl
- Upload date:
- Size: 85.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
331e1f17a44316e7d5e251362b699489a300c3de6158a5417980430cf24dbb06
|
|
| MD5 |
9d318e2b6d76acb55258210b041cdd9e
|
|
| BLAKE2b-256 |
0572797599ded21da067ed11c03eb393172fd1c653ddcc6c54c2525f5be62698
|
Provenance
The following attestation bundles were made for epist-1.2.2-py3-none-any.whl:
Publisher:
publish.yml on Seifollahi/epist
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
epist-1.2.2-py3-none-any.whl -
Subject digest:
331e1f17a44316e7d5e251362b699489a300c3de6158a5417980430cf24dbb06 - Sigstore transparency entry: 813560165
- Sigstore integration time:
-
Permalink:
Seifollahi/epist@cd928eebf8f56b397589d4c133c24863fd3d8fe9 -
Branch / Tag:
refs/tags/v1.2.2 - Owner: https://github.com/Seifollahi
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@cd928eebf8f56b397589d4c133c24863fd3d8fe9 -
Trigger Event:
push
-
Statement type: