Skip to main content

VidChain: High-fidelity multimodal RAG framework featuring the IRIS Intelligence Agent

Project description

VidChain: The "LangChain for Videos"

v0.9.0-Final — Featuring the IRIS Intelligence Assistant. A high-fidelity, local-first multimodal RAG framework for surgical video intelligence.

Python CUDA License Status PyPI version

VidChain v0.9 Dashboard


High-Integrity Neural Architecture

VidChain is powered by the B.A.B.U.R.A.O. Engine 2.0 (Behavioral Analysis & Broadcasting Unit for Real-time Artificial Observation). This engine fuses visual, auditory, and temporal data into a queryable intelligence layer, served through the IRIS (Intelligent Retrieval & Insight System) agent.

graph TD
    %% --- Ingestion Stage ---
    subgraph "1. Ingestion & Optimization Layer"
        VS[Video Source] --> AK[Adaptive Gaussian Filter]
        AK -- "Delta > Threshold" --> PK[Promote to Keyframe]
        AK -- "Redundant" --> DROP{{Neural Compute Firewall}}
    end

    %% --- Inference Stage ---
    subgraph "2. Sensory Node Matrix (Late Fusion)"
        PK --> VLM[LlavaNode: Scene Semantics]
        PK --> ASR[WhisperNode: Audio Trace]
        PK --> OCR[OcrNode: Digital Trace]
        PK --> TRK[TrackerNode: Motion Flow]
    end

    %% --- Intelligence Logic ---
    subgraph "3. B.A.B.U.R.A.O. 2.0 Cognitive Engine"
        VLM & ASR & OCR & TRK --> FUSE[Spatio-Temporal Fusion]
        FUSE --> RDN[Recursive Map-Reduce Summarizer]
    end

    %% --- Persistence ---
    subgraph "4. VidChain Memory Vault"
        FUSE --> KV[(ChromaDB Vector Store)]
        FUSE --> KG[[Isolated Knowledge Graph]]
    end

    %% --- Interaction Stage ---
    subgraph "5. IRIS Intelligence Agent"
        USER[User Query] --> IR{Intent Router}
        IR -- "Insight Search" --> RAG[GraphRAG Retrieval Loop]
        RAG <--> KV
        RAG <--> KG
        RAG --> DISCOVERY([VidChain Insight Canvas])
    end

    style VS fill:#1e1e2e,stroke:#74c7ec,stroke-width:2px;
    style DISCOVERY fill:#11111b,stroke:#e8192c,stroke-width:3px;

Developer SDK: Building a Custom IRIS Pipeline

VidChain is built on a modular "Node & Chain" architecture. You can assemble surgical intelligence pipelines by combining specific sensory nodes.

Example: High-Sensitivity Surveillance Audit

This example demonstrates how to build a custom pipeline that prioritizes motion tracking and OCR (digital trace) extraction.

from vidchain import VidChain
from vidchain.pipeline import VideoChain
from vidchain.nodes import (
    AdaptiveKeyframeNode, 
    LlavaNode, 
    OcrNode, 
    TrackerNode
)

# 1. Initialize the IRIS Engine
vc = VidChain(db_path="./surveillance_vault")

# 2. Assemble a Custom Sensory Chain
surveillance_chain = VideoChain(nodes=[
    AdaptiveKeyframeNode(change_threshold=1.5), # High sensitivity
    LlavaNode(model="moondream"),              # Scene semantics
    OcrNode(),                                 # Digital trace extraction
    TrackerNode()                              # Spatio-temporal motion flow
])

# 3. Execute the Pipeline
metadata = vc.ingest(
    video_path="gate_camera_04.mp4", 
    chain=surveillance_chain
)

# 4. Perform Surgical Reasoning
query = "Was there any vehicle with a visible license plate after 14:00?"
response = vc.query(query, session_id="gate_audit_01")

print(f"\nIRIS Intelligence Report:\n{response['text']}")

Core Sensory Nodes

Node Modality Best For
LlavaNode Visual Scene semantics, object descriptions, behavioral analysis.
WhisperNode Audio Speech-to-text, acoustic anomaly detection.
OcrNode Text Reading license plates, screens, and documents.
TrackerNode Motion Persistent object tracking and co-occurrence mapping.
AdaptiveKeyframeNode Logic Gaussian-differential sampling to reduce GPU load.

Key Features (v0.9 Evolution)

IRIS: The Intelligent Assistant

The v0.9 milestone introduces IRIS, a friendly and surgical AI assistant that mediates between the user and the raw B.A.B.U.R.A.O. data. IRIS handles natural language queries, complex multi-hop reasoning, and executive summaries.

Isolated GraphRAG Intelligence

Every VidChain "Insight Session" now generates a dedicated, persistent knowledge graph.

  • Neural Isolation: Zero leakage between sessions.
  • Entity Tracking: Deep co-occurrence tracking across the video timeline.
  • Secure Purge: Physically wipes all associated neural artifacts on deletion.

VidChain Media Gateway

No more broken paths. VidChain now features a dedicated streaming gateway that resolves absolute local paths, enabling high-fidelity playback of MKV, MP4, and AVI files.


Setup & Installation

git clone https://github.com/rahulsiiitm/videochain-python
cd videochain-python
pip install -e .

# Pull Neural Weights (Ollama)
ollama pull moondream   # Scene Semantics
ollama pull llama3      # Reasoning Hub

# Start the Suite
vidchain-serve

Detailed Evolution (v0.8 to v0.9)

v0.9.0 (The Insight Release)

  • Architecture: Implemented Neural Isolation for per-session knowledge graphs.
  • Media: Introduced the VidChain Media Gateway for absolute Windows path streaming.
  • Persona: Fully integrated IRIS as the primary interaction agent.
  • UI: High-fidelity custom modals for memory purging.

v0.8.8 (The Speed Milestone)

  • Optimization: Snappy Ingest protocol. Decoupled auto-summarization from ingestion.
  • Logic: Implemented recursive map-reduce for long-video summarization.

Author

Rahul Sharma — IIIT Manipur
SEM Project Phase 0.9.0-Final

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vidchain-0.9.0.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vidchain-0.9.0-py3-none-any.whl (2.1 MB view details)

Uploaded Python 3

File details

Details for the file vidchain-0.9.0.tar.gz.

File metadata

  • Download URL: vidchain-0.9.0.tar.gz
  • Upload date:
  • Size: 2.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.8

File hashes

Hashes for vidchain-0.9.0.tar.gz
Algorithm Hash digest
SHA256 d5c2daff428eb5d65d7f4266121552c8a37930e5ce77d360a27181d5fa2816bb
MD5 15c86b02bc50849ee7e24314216f83f4
BLAKE2b-256 0de0da35e47a14d1b835837731611dfeb4e9c87ecd988d35616df639aad930a6

See more details on using hashes here.

File details

Details for the file vidchain-0.9.0-py3-none-any.whl.

File metadata

  • Download URL: vidchain-0.9.0-py3-none-any.whl
  • Upload date:
  • Size: 2.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.8

File hashes

Hashes for vidchain-0.9.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7e69c02bc4bf2f6ba606cbc7cb77f6c2f92f475501fb661fc014191d0a81555b
MD5 d08bba6423e1c332b268952cf4d212a4
BLAKE2b-256 b9884ed18f757dff9a3e1430c83f3b8a48486da0d651a318df04102298f40016

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page