Skip to main content

Embed transcript files into ChromaDB and query them with RAG

Project description

Transcript RAG

A small toolkit that embeds transcript files into ChromaDB and allows simple retrieval augmented generation queries. It exposes a command line interface via transcriptrag.

Installation

pip install transcript-rag

Usage

Embed transcripts

transcriptrag embed /path/to/transcripts --collection osce --model all-MiniLM-L6-v2

Ask a question

transcriptrag ask --collection osce "How does the student greet the patient?" --gen-model gemini-pro

The command retrieves relevant chunks from ChromaDB and sends them to the specified generation model. Ensure the appropriate API key is available in your environment for the generation model (e.g., Google Generative AI).

Publishing to PyPI

  1. Install build tools: pip install build twine
  2. Run python -m build to generate distribution files in dist/
  3. Upload with twine upload dist/*

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transcript_rag-0.1.0.tar.gz (3.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

transcript_rag-0.1.0-py3-none-any.whl (5.3 kB view details)

Uploaded Python 3

File details

Details for the file transcript_rag-0.1.0.tar.gz.

File metadata

  • Download URL: transcript_rag-0.1.0.tar.gz
  • Upload date:
  • Size: 3.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for transcript_rag-0.1.0.tar.gz
Algorithm Hash digest
SHA256 dc9dbf43a6d372c89084c5641a18c3d90a036bdaa6fe1117a795195b603742ca
MD5 2d6b5dbefda916efce2875c8d4405c04
BLAKE2b-256 65fcc1ac31b7c8eeb24e70f3a02ca85b77d637bf4e6a10e8430104847f37bef9

See more details on using hashes here.

File details

Details for the file transcript_rag-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: transcript_rag-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 5.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for transcript_rag-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 51c0d746d9215178e848d0f2ac2dd696ca73503bf8f3810a06c6c5af997bbee2
MD5 3bde505d4b4bbfa033aed942008608f2
BLAKE2b-256 9450f90747d56c191f75b6882c260e487927aad306db428d957d9513f1e9b1ca

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page