An Intelligent Open-Source Video Understanding System A different path from traditional Large Video Language Models (LVLMs). Built for modularity, openness, and real-world usability.
Project description
gurrt is an intelligent video understanding system, an open-source alternative to monolithic Large Video Language Models built out of frustration.
One cannot work with Large Video Language Models :
- Expensive to set up
- GPU intensive
- Slow to experiment with
- Difficult to run on consumer hardware
- Often closed or partially restricted
Most state-of-the-art video models require massive compute clusters and large-scale infrastructure.
They are impressive โ but they are not accessible.
If meaningful video intelligence requires:
- Multiple high-end GPUs
- Hours of inference time
- Proprietary model access
Then it stops feeling truly open.
A Different Philosophy
gurrt is not an attempt to compete with systems like YouTubeโs internal models or other large-scale industrial LVLMs trained on massive GPU clusters. It is an attempt to rethink the approach. Instead of asking how to build a larger end-to-end video transformer, it explores a different path:
- Smarter frame sampling techniques
- Stronger and more modular vision models
- Better structured embedding strategies
- More efficient and grounded RAG pipelines
- Persistent memory-driven reasoning
The idea is how can i just get the job done with minimal efforts yielding high end results
It represents a belief that meaningful video understanding can emerge from:
- Thoughtful engineering
- Smart sampling
- Strong modular components
- Memory-augmented retrieval
Not just from massive GPU clusters and billion-parameter models.
๐ฟ Quick Start Guide for pypi package
1. Installation
Set up gurrt using uv. Note: This project requires Python 3.12.
# 1. Install uv and set Python version
pip install uv
uv venv
uv python pin 3.12
# 2. Activate environment
source .venv/bin/activate # Windows: .venv\Scripts\activate
# 3. Install gurrt (Standard/CPU)
uv pip install gurrt
# 4. OR Install with GPU Support
uv pip install gurrt[cuda] --extra-index-url https://download.pytorch.org/whl/cu121
2. Commands
| Command | Description |
|---|---|
gurrt init |
Configure API keys (Groq, Supermemory, Ollama). |
gurrt models-download |
Download and cache AI models locally. |
gurrt index <path> |
Extract frames and audio for search. |
gurrt index-ollama <path> <model> |
Index using a specific Ollama model. |
gurrt ask "<query>" |
Query your indexed video content. |
The tool automatically optimizes performance by disabling unnecessary logging and tokenizer parallelism to ensure a clean CLI experience yet some logs do appear of Moviepy will resolve it in future iterations.
Architecture Overview
Video
โ
โโโ Smart Frame Extraction
โ โโโ Captioning + Embeddings
โ
โโโ Audio Extraction
โ โโโ Speech-to-Text + Embeddings
โ
โโโ Vector Memory Store
โ
โโโ Supermemory (Persistent Conversation Layer)
โ
โโโ LLM Reasoning Engine
Project Setup (using uv)
# Install uv if you haven't already
pip install uv
# Sync dependencies
uv sync
# Activate environment
.venv\Scripts\activate
File Structure
gurrt/
โโโ src/
โ |
โ โ
โ โโโ videorag/ # Core Video-RAG application package
โ โ
โ โโโ api/
โ โ โโโ server.py # API server (exposes endpoints for querying, ingestion, etc.)
โ โ
โ โโโ cli/
โ โ โโโ main.py # CLI entry point (init, ingest, query commands)
โ โ
โ โโโ config/
โ โ โโโ config.py # Configuration management (API keys, paths, environment setup)
โ โ
โ โโโ core/ # Core intelligence pipeline
โ โ โโโ __init__.py
โ โ โโโ asr.py # Audio extraction + speech-to-text processing
โ โ โโโ embedding.py # Embedding generation for captions & transcripts
โ โ โโโ llm.py # LLM interaction and reasoning logic
โ โ โโโ models.py # Model loading and management utilities
โ โ โโโ pipeline.py # End-to-end ingestion + query pipeline orchestration
โ โ โโโ prompts.py # Prompt templates and structured context injection
โ โ โโโ search.py # Retrieval logic (semantic search over stored embeddings)
โ โ โโโ vectordb.py # Vector database interface and storage abstraction
โ โ
โ โโโ utils/
โ โโโ utils.py # Shared utility functions and helpers
โ
โโโ README.md # Project documentation
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gurrt-1.0.3.tar.gz.
File metadata
- Download URL: gurrt-1.0.3.tar.gz
- Upload date:
- Size: 18.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fc0ffbdb471647894ae6614bfdbbe4374a53b7f4f94c94cf4bee4cc99074ef00
|
|
| MD5 |
02a43525659a70275b51d565668edf36
|
|
| BLAKE2b-256 |
d25d36b1d2cb790080ab8836e1ff8fd3b10c160335c6391c6cb488164279bdc1
|
File details
Details for the file gurrt-1.0.3-py3-none-any.whl.
File metadata
- Download URL: gurrt-1.0.3-py3-none-any.whl
- Upload date:
- Size: 18.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
03c7e0edd69a852a3ead7702c12c029fe72b1eb246e16e8cf347c4364717b211
|
|
| MD5 |
fe7bbf02f253fd2ca04b1d3533c70086
|
|
| BLAKE2b-256 |
e832118ec2e343f05e66e7d18ffffb3d6d68160bc49b72142eae2281e886e9b3
|