Event-driven, real-time AI video stream processing framework

These details have not been verified by PyPI

Project links

Project description

VisionFlow

Event-driven real-time AI video stream processing framework

Features • Installation • Quick Start • Documentation • Contributing

Overview

VisionFlow is a production-ready Python framework for building scalable, event-driven real-time video AI applications. It provides a clean abstraction layer for video ingestion, AI model inference, and multi-channel event distribution with full async/await support.

Use Cases

Video surveillance with object detection and tracking
Live stream analytics and monitoring
Real-time computer vision applications
IoT video processing at the edge
Enterprise video analysis platforms

Key Features

Multi-Source Ingestion – RTSP streams, local files, and custom sources
Parallel AI Processing – Concurrent YOLO detection, OCR, and custom models
Event-Driven Architecture – Async pub/sub system with declarative handlers
Multi-Channel Output – REST API, WebSocket, Kafka, file, and custom outputs
Fully Typed – 100% type hints for IDE autocomplete and static analysis
Async-First – Full asyncio support throughout the entire framework
Production-Ready – Comprehensive error handling, structured logging, and extensive testing
Extensible – Custom sources, workers, and outputs via simple inheritance
YAML Configuration – Declarative pipeline configuration with Pydantic validation

Installation

From PyPI

# Core installation
pip install visionflow-ai

# With YOLO object detection
pip install visionflow-ai[yolo]

# With OCR text recognition
pip install visionflow-ai[ocr]

# With Apache Kafka integration
pip install visionflow-ai[kafka]

# All optional dependencies
pip install visionflow-ai[yolo,ocr,kafka]

# Development tools
pip install visionflow-ai[dev]

From Source

git clone https://github.com/FaisalAhmedBijoy/visionflow.git
cd visionflow
pip install -e ".[dev,yolo,ocr,kafka]"

Quick Start

Basic Example

import asyncio
from visionflow import StreamPipeline
from visionflow.ingestion import FileSource
from visionflow.processing.yolo import YOLOWorker
from visionflow.processing.pool import WorkerPool
from visionflow.outputs.log import LogOutput

async def main():
    pipeline = StreamPipeline()
    
    # Add video source
    pipeline.add_source(FileSource("video.mp4", source_id="main_camera"))
    
    # Configure AI workers
    pipeline.worker_pool = WorkerPool([
        YOLOWorker("detector", model="yolov8n.pt")
    ])
    
    # Add output handler
    pipeline.add_output(LogOutput())
    
    # Handle events
    @pipeline.on_event("detection")
    async def on_detection(event):
        print(f"Detected: {event.data}")
    
    await pipeline.run()

Configuration-Driven Usage

Create pipeline.yaml:

name: "Real-Time Video Analysis"

sources:
  - id: "main_camera"
    type: "rtsp"
    url: "rtsp://camera.local/stream"
    fps: 30
  
  - id: "backup_file"
    type: "file"
    url: "video.mp4"
    fps: 30

workers:
  - id: "detector"
    type: "yolo"
    model: "yolov8n.pt"
    enabled: true
  
  - id: "ocr"
    type: "ocr"
    enabled: true

outputs:
  - id: "logger"
    type: "log"
    enabled: true
  
  - id: "api"
    type: "rest_api"
    host: "0.0.0.0"
    port: 8000
    enabled: true
  
  - id: "events_kafka"
    type: "kafka"
    broker: "localhost:9092"
    topic: "video_events"
    enabled: false

log_level: "INFO"
debug: false

Run with configuration:

visionflow run pipeline.yaml

Architecture

VisionFlow follows a layered, event-driven architecture designed for scalability, extensibility, and testability.

System Design

┌─────────────────────────────────────────────┐
│        User Application Layer               │
└─────────────────┬───────────────────────────┘
                  │
┌─────────────────▼───────────────────────────┐
│      StreamPipeline (Orchestrator)          │
├──────────┬───────────┬──────────┬───────────┤
│Ingestion │Processing │  Events  │  Outputs  │
│ • RTSP   │  • YOLO   │  • Bus   │ • REST    │
│ • File   │  • OCR    │  • Emit  │ • WS      │
│ • Custom │  • Custom │  • Sub   │ • Kafka   │
└──────────┴───────────┴──────────┴───────────┘
                  │
         ┌────────▼────────┐
         │  External       │
         │  Systems        │
         └─────────────────┘

Core Modules

Module	Purpose	Implementations
Ingestion	Video source abstraction	RTSP, File, Webcam, Custom
Processing	AI model execution	YOLO, OCR, Custom
Events	Async pub/sub messaging	Event, EventEngine, EventGenerator
Outputs	Event distribution	REST API, WebSocket, Kafka, File, Logging
Config	Settings management	YAML + Pydantic validation
CLI	Command-line interface	Configuration, execution, debugging

API Reference

Stream Pipeline

from visionflow import StreamPipeline
from visionflow.ingestion import RTSPSource, FileSource
from visionflow.processing.yolo import YOLOWorker
from visionflow.processing.pool import WorkerPool
from visionflow.outputs import RestAPIOutput, WebSocketOutput

# Initialize pipeline
pipeline = StreamPipeline(name="MyPipeline", debug=True)

# Add video sources (multiple supported)
pipeline.add_source(RTSPSource("rtsp://camera/stream", source_id="cam1"))
pipeline.add_source(FileSource("video.mp4", source_id="file1"))

# Configure worker pool for parallel inference
pipeline.worker_pool = WorkerPool([
    YOLOWorker("detector", model="yolov8m.pt")
])

# Add output handlers
pipeline.add_output(RestAPIOutput(host="0.0.0.0", port=8000))
pipeline.add_output(WebSocketOutput(port=8001))

# Register event handlers
@pipeline.on_event("detection")
async def handle_detection(event):
    """Process detection events."""
    print(f"Objects detected: {event.data}")

# Run the pipeline
await pipeline.run()

Event System

from visionflow import Event, StreamPipeline

# Events are immutable data containers
event = Event(
    event_type="person_detected",
    source_id="camera_1",
    data={"class": "person", "confidence": 0.95},
    metadata={"frame_id": 123, "timestamp": 1234567890}
)

# Register event handlers with decorators
@pipeline.on_event("person_detected")
async def on_person(event: Event) -> None:
    print(f"Event: {event.event_type}")
    print(f"Data: {event.data}")
    print(f"Source: {event.source_id}")

Custom Components

Extend VisionFlow with custom implementations:

Custom Source

from visionflow.ingestion.base import BaseSource

class CustomVideoSource(BaseSource):
    """Custom video source implementation."""
    
    async def connect(self) -> None:
        """Initialize connection."""
        # Setup code here
        pass
    
    async def disconnect(self) -> None:
        """Cleanup connection."""
        pass
    
    async def read_frame(self) -> Optional[Any]:
        """Read and return next frame."""
        # Return frame or None
        pass

Custom Worker

from visionflow.processing.base import BaseWorker

class CustomAIWorker(BaseWorker):
    """Custom AI model worker."""
    
    async def initialize(self) -> None:
        """Load model on startup."""
        self.model = load_model("model.pt")
    
    async def cleanup(self) -> None:
        """Cleanup on shutdown."""
        if hasattr(self, 'model'):
            del self.model
    
    async def process_frame(self, frame: Any) -> Dict[str, Any]:
        """Run inference on frame."""
        results = self.model.predict(frame)
        return {"predictions": results, "worker_id": self.worker_id}

Custom Output

from visionflow.outputs.base import BaseOutput
from visionflow.events import Event

class CustomOutput(BaseOutput):
    """Custom event output handler."""
    
    async def start(self) -> None:
        """Initialize output."""
        self.is_running = True
    
    async def stop(self) -> None:
        """Cleanup output."""
        self.is_running = False
    
    async def send_event(self, event: Event) -> None:
        """Process and send event."""
        if self.is_running:
            # Handle event
            pass

Examples

Complete example implementations are available in the tests/examples/ directory:

basic_detection.py – YOLO object detection with event handling
multi_source_api.py – Multiple video sources with REST API
custom_handlers.py – Event filtering and custom handlers

Run an example:

python tests/examples/basic_detection.py

Testing

VisionFlow includes comprehensive test coverage across all core components.

Run Tests

# Run all tests
pytest tests/ -v

# Run with coverage report
pytest tests/ --cov=visionflow --cov-report=html

# Run specific test file
pytest tests/test_events.py -v

# Run with detailed output and stop on first failure
pytest tests/ -vvs -x

Test Structure

tests/
├── test_config.py      # Configuration validation
├── test_events.py      # Event system and pub/sub
├── test_pipeline.py    # Pipeline integration tests
├── test_sources.py     # Video source tests
├── test_workers.py     # AI worker tests
├── test_outputs.py     # Output handler tests
└── examples/           # Working example implementations

Current test coverage: 105+ tests with high reliability.

Development

Setting Up Development Environment

# Clone repository
git clone https://github.com/FaisalAhmedBijoy/visionflow.git
cd visionflow

# Create virtual environment (Python 3.10+)
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install in development mode with all dependencies
pip install -e ".[dev,yolo,ocr,kafka]"

Code Quality Standards

VisionFlow maintains strict code quality standards:

# Format code with Black
black visionflow/ tests/

# Sort imports with isort
isort visionflow/ tests/

# Lint with flake8
flake8 visionflow/ tests/ --max-line-length=100

# Type check with mypy
mypy visionflow/ --strict

# Run all quality checks
make check

# Run development setup
python setup_dev.py

Make Targets

make test       # Run test suite
make check      # Run all linting and type checks
make format     # Format code with black and isort
make clean      # Remove build artifacts
make help       # Show all available commands

Documentation

Complete documentation is available in the docs/ directory:

Architecture Guide – Detailed design and component architecture
Architecture Diagrams – Visual system diagrams
API Reference – Complete API documentation
Quick Reference – Quick start examples
Contributing Guide – Contribution guidelines
Publishing Guide – PyPI publishing instructions

Project Structure

visionflow/                      # Main package
├── core/                        # Pipeline orchestrator and core logic
├── events/                      # Event system (Event, EventEngine, EventGenerator)
├── ingestion/                   # Video sources (BaseSource, RTSP, File, Webcam)
├── processing/                  # AI workers (BaseWorker, YOLO, OCR, WorkerPool)
├── outputs/                     # Output handlers (REST API, WebSocket, Kafka, File, Log)
├── config/                      # Configuration management (YAML + Pydantic)
├── cli/                         # Command-line interface
├── utils/                       # Utilities and helpers
└── py.typed                     # PEP 561 type marker for mypy

tests/                           # Comprehensive test suite
├── test_config.py               # Configuration tests
├── test_events.py               # Event system tests
├── test_pipeline.py             # Pipeline integration tests
├── test_sources.py              # Source/ingestion tests
├── test_workers.py              # Worker/processing tests
├── test_outputs.py              # Output handler tests
└── examples/                    # Example implementations

docs/                            # Documentation
├── ARCHITECTURE.md              # Architecture and design
├── ARCHITECTURE_DIAGRAM.md      # System diagrams
├── INDEX.md                     # API reference
├── PROJECT_SUMMARY.md           # Project overview
├── PUBLISHING.md                # Publishing guide
└── CODE_CORRECTIONS.md          # Quality metrics

Requirements

Python: 3.10 or higher
OS: Linux, macOS, or Windows

Core Dependencies

Package	Purpose	Version
opencv-python	Video processing	≥4.8.0
fastapi	REST API framework	≥0.104.0
pydantic	Configuration validation	≥2.4.0
numpy	Numerical operations	≥1.24.0
aiofiles	Async file I/O	≥23.2.0

Optional Dependencies

Package	Purpose	Install
ultralytics	YOLO models	`pip install visionflow-ai[yolo]`
pytesseract	OCR support	`pip install visionflow-ai[ocr]`
kafka-python	Kafka integration	`pip install visionflow-ai[kafka]`
paho-mqtt	MQTT integration	`pip install visionflow-ai[mqtt]`

See pyproject.toml for complete dependency list.

Performance

VisionFlow is optimized for production use:

Async I/O – Non-blocking operations throughout
Parallel Processing – Concurrent worker execution
Memory Efficient – Smart frame and event buffering
Low Latency – Optimized for real-time applications
Scalable – Handles multiple streams simultaneously

Contributing

We welcome contributions from the community! Please see CONTRIBUTING.md for:

Code of conduct
Development setup guide
Pull request process
Code style and standards
Testing requirements
Documentation guidelines

Quick Contribution Steps

Fork the repository
Create a feature branch: git checkout -b feature/my-feature
Make your changes and add tests
Run quality checks: make check
Commit with clear messages: git commit -m "Add my feature"
Push to your fork: git push origin feature/my-feature
Open a pull request with description

License

Licensed under the Apache License 2.0. See LICENSE file for full details.

Copyright 2024-2026 VisionFlow Contributors

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Support & Community

Documentation: docs/ directory
Bug Reports: GitHub Issues
Discussions: GitHub Discussions
Contributing: See CONTRIBUTING.md

Acknowledgments

VisionFlow builds on excellent open-source projects:

Ultralytics YOLO – Object detection models
FastAPI – Web framework
Pydantic – Data validation
OpenCV – Computer vision library
Tesseract OCR – Text recognition

Citation

If you use VisionFlow in your research or project, please cite:

@software{visionflow2024,
  author = {Faisal Ahmed},
  title = {VisionFlow: Event-driven Real-time AI Video Processing},
  year = {2026},
  url = {https://github.com/FaisalAhmedBijoy/visionflow}
}

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.0.2

May 15, 2026

0.0.1

May 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

visionflow_ai-0.0.2.tar.gz (48.7 kB view details)

Uploaded May 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

visionflow_ai-0.0.2-py3-none-any.whl (48.9 kB view details)

Uploaded May 15, 2026 Python 3

File details

Details for the file visionflow_ai-0.0.2.tar.gz.

File metadata

Download URL: visionflow_ai-0.0.2.tar.gz
Upload date: May 15, 2026
Size: 48.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for visionflow_ai-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`32d83e870aa54aacd69fb3f5560086b9b064de62952e830313f5534c31409657`
MD5	`3e9695d414f8311bb2520361863766b2`
BLAKE2b-256	`3c94b6d9a66810208ea6d15474169503e6d4d2adf8b4ccfed1209a2c5a1afd8d`

See more details on using hashes here.

File details

Details for the file visionflow_ai-0.0.2-py3-none-any.whl.

File metadata

Download URL: visionflow_ai-0.0.2-py3-none-any.whl
Upload date: May 15, 2026
Size: 48.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for visionflow_ai-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5ff7d148e87f8b18801a46b4117a20d243c9f10fef844fe36daad7f3a38ffdf5`
MD5	`17ebffdb453e62d302e2864f26caa3d0`
BLAKE2b-256	`71c7b271860a99dcb9565435b1072aeaa59fceb66748f620899520883f0a3f84`

See more details on using hashes here.

visionflow-ai 0.0.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

VisionFlow

Overview

Use Cases

Key Features

Installation

From PyPI

From Source

Quick Start

Basic Example

Configuration-Driven Usage

Architecture

System Design

Core Modules

API Reference

Stream Pipeline

Event System

Custom Components

Custom Source

Custom Worker

Custom Output

Examples

Testing

Run Tests

Test Structure

Development

Setting Up Development Environment

Code Quality Standards

Make Targets

Documentation

Project Structure

Requirements

Core Dependencies

Optional Dependencies

Performance

Contributing

Quick Contribution Steps

License

Support & Community

Acknowledgments

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes