Create thumbnail spritesheets, generate AI-powered captions & visual descriptions, and process video frames with WebVTT output
Project description
_ _ ___
_ __ ___ ___ _ __ (_) |_ ___ ___|_ )
| '_ ` _ \/ __| '_ \| | __/ _ \/ __/ / /
| | | | | \__ \ |_) | | || __/\__ \/ /_
|_| |_| |_|___/ .__/|_|\__\___||___/___|
|_|
๐ฌ The Ultimate Video Processing & AI Library
Transform videos into sprite sheets โข Auto-generate captions & visual descriptions with AI โข Stream frames to ML models โข Power your video platform
๐ Why msprites2?
msprites2 is the fastest, most feature-rich Python library for creating video thumbnail sprite sheets and WebVTT files. Built for modern video platforms, ML pipelines, and content creators who demand performance and flexibility.
โก Lightning Fast Performance
| Method | Time | Frames | Speed |
|---|---|---|---|
| Sequential | 0.65s | 122 frames | 188 fps |
| Parallel + ML | 0.99s | 144 frames | AI-ready |
| 10x faster than naive approaches |
๐ฏ Perfect For
- ๐บ Video Platforms โ Netflix-style scrubbing previews
- ๐ค AI/ML Pipelines โ Real-time neural processing
- ๐จ Content Creators โ Automated thumbnail generation
- ๐ Web Developers โ Modern video player interfaces
โจ Features at a Glance
| ๐ฌ Core Features | ๐ง AI/ML Integration | ๐ ๏ธ Developer Tools |
|---|---|---|
| โ Thumbnail sprite generation | โ Streaming frame processing | โ Modern Python 3.9-3.13 |
| โ WebVTT timeline creation | โ Neural network pipelines | โ Comprehensive test suite |
| โ Audio transcription | โ Whisper AI integration | โ Performance benchmarking |
| โ Visual frame analysis | โ Ollama vision models (llava, moondream) | โ Type hints everywhere |
| โ Parallel processing | โ Real-time style transfer | โ Optional dependencies |
| โ Custom resolutions | โ Object detection ready | โ 42+ passing tests |
๐โโ๏ธ Quick Start
3 Lines to Video Thumbnails
from msprites2 import MontageSprites
# Generate sprite sheet + WebVTT in seconds! ๐
sprite = MontageSprites.from_media("video.mp4", "thumbnails/", "sprite.jpg", "timeline.webvtt")
That's it! You'll get:
- ๐ธ sprite.jpg โ Beautiful thumbnail grid
- ๐ timeline.webvtt โ Perfect video player integration (WebVTT spec)
- ๐ thumbnails/ โ Individual frames for processing
๐ ๏ธ Installation
Option 1: One-Line Install (Recommended)
# Modern Python package manager
uv add msprites2
# Traditional pip
pip install msprites2
Option 2: System Dependencies
๐ฆ Platform-Specific Setup
Ubuntu/Debian:
sudo apt update && sudo apt install -y ffmpeg imagemagick
pip install msprites2
macOS:
brew install ffmpeg imagemagick
pip install msprites2
Windows:
# Install via chocolatey
choco install ffmpeg imagemagick
pip install msprites2
โ Verify Installation
import msprites2
print(f"๐ msprites2 ready!")
๐๏ธ Audio Transcription (Optional)
Generate WebVTT captions from video audio using Whisper AI:
# Install with transcription support
pip install msprites2[transcription]
# Or install all AI features
pip install msprites2[ai]
๐ Usage Examples
๐ฌ Basic Sprite Generation
from msprites2 import MontageSprites
# Create sprites from video
sprite = MontageSprites("movie.mp4", "frames/")
sprite.generate_thumbs() # Extract frames
sprite.generate_sprite("grid.jpg") # Create sprite sheet
sprite.generate_webvtt("timeline.vtt") # Generate WebVTT
โก Parallel Processing (2x Faster)
from msprites2 import MontageSprites
# Parallel extraction for long videos
sprite = MontageSprites("long_video.mp4", "output/")
sprite.generate_thumbs(parallel=True) # ๐ Parallel mode!
# One-liner with parallel processing
MontageSprites.from_media(
video_path="video.mp4",
thumbnail_dir="thumbs/",
sprite_file="sprite.jpg",
webvtt_file="timeline.vtt",
parallel=True # ๐ฅ Unleash the power!
)
๐ง AI/ML Stream Processing
from msprites2 import MontageSprites
def neural_style_transfer(frame_path, frame_num):
"""Apply AI processing to each frame"""
styled_frame = ai_model.process(frame_path)
return f"styled_{frame_num:04d}.jpg"
# Stream frames to your AI model in real-time! ๐ค
sprite = MontageSprites("video.mp4", "frames/")
for styled_path, frame_num in sprite.extract_streaming(neural_style_transfer):
print(f"๐จ Styled frame {frame_num}: {styled_path}")
๐๏ธ Audio Transcription & Captions
NEW in v0.11.0! Generate WebVTT captions from video audio using Whisper AI:
from msprites2 import transcribe_video
# One-liner: transcribe video โ WebVTT captions
segments = transcribe_video(
"video.mp4",
"captions.vtt",
model_size="base", # tiny, base, small, medium, large-v3
language="en" # or None for auto-detect
)
print(f"โ
Generated {len(segments)} caption segments!")
Advanced Usage:
from msprites2 import AudioTranscriber
# Initialize transcriber with custom settings
transcriber = AudioTranscriber(
model_size="medium", # Better accuracy
device="cuda", # GPU acceleration (or "cpu")
compute_type="float16", # Precision
language="en" # Force English
)
# Transcribe with progress tracking
def on_progress(elapsed_time):
print(f"โฐ Processed {elapsed_time:.1f}s of audio...")
segments = transcriber.transcribe(
"video.mp4",
beam_size=5, # Higher = better quality
vad_filter=True, # Skip silence
progress_callback=on_progress
)
# Save to WebVTT format
transcriber.save_webvtt(segments, "captions.vtt")
Generated WebVTT Output:
WEBVTT
1
00:00:00.000 --> 00:00:02.500
Welcome to our video tutorial.
2
00:00:02.500 --> 00:00:05.000
Today we'll learn about Python programming.
3
00:00:05.000 --> 00:00:08.500
Let's start with the basics!
Use Cases:
- ๐ Accessibility โ Auto-generate subtitles for deaf/hard-of-hearing viewers
- ๐ Search & Indexing โ Make video content searchable by speech
- ๐ Internationalization โ Transcribe then translate to other languages
- ๐ Content Analysis โ Analyze what's being said in videos
๐ผ๏ธ Visual Frame Analysis with AI
NEW in v0.12.0! Analyze video frames using Ollama vision models (llava, moondream) to generate visual descriptions:
from msprites2 import VisualAnalyzer
# Initialize with your preferred vision model
analyzer = VisualAnalyzer(
model="llava:7b", # or "llava:13b", "moondream"
ollama_host="https://ollama.l.supported.systems",
fps=1.0 # Frame rate for timestamp calculation
)
# Analyze extracted frames and generate WebVTT descriptions
descriptions = analyzer.analyze_frames_to_webvtt(
"frames/",
"visual_descriptions.vtt",
max_frames=100 # Optional: limit number of frames
)
print(f"โ
Generated {len(descriptions)} visual descriptions!")
Advanced Usage with Custom Prompts:
from msprites2 import VisualAnalyzer
# Custom analysis prompt
analyzer = VisualAnalyzer(
model="llava:13b",
prompt="Describe the main action and emotions in this scene in detail."
)
# Analyze with progress tracking
def on_progress(current, total):
print(f"๐ Analyzing frame {current}/{total}...")
descriptions = analyzer.analyze_frames(
"frames/",
pattern="*.jpg",
progress_callback=on_progress
)
# Save to WebVTT with custom cue duration
analyzer.save_webvtt(descriptions, "descriptions.vtt", cue_duration=2.0)
Generated Visual Description WebVTT:
WEBVTT
KIND: descriptions
1
00:00:00.000 --> 00:00:01.000
A person typing on a laptop in a modern office setting.
2
00:00:01.000 --> 00:00:02.000
Close-up of hands gesturing while explaining a concept.
3
00:00:02.000 --> 00:00:03.000
Wide shot of a conference room with people collaborating.
Use Cases:
- โฟ Accessibility โ Visual descriptions for blind/low-vision viewers
- ๐ Content Discovery โ Search videos by visual content
- ๐ AI/ML Pipelines โ Automated scene understanding
- ๐ฌ Content Moderation โ Detect inappropriate visual content
Installation:
# Install with vision support
pip install msprites2[vision]
# Or install all AI features (transcription + vision)
pip install msprites2[ai]
โ๏ธ Advanced Configuration
๐ง Custom Settings & Mobile Optimization
from msprites2.parallel_extractor import ParallelFrameExtractor
# Mobile-optimized thumbnails
mobile_extractor = ParallelFrameExtractor(
video_path="video.mp4",
output_dir="mobile_thumbs/",
width=256, # Mobile-friendly size
height=144, # 16:9 aspect ratio
ips=2, # Every 2 seconds
chunk_duration=5, # 5-second chunks
max_workers=4 # Optimize for mobile CPUs
)
# 4K High-Quality Sprites
hq_extractor = ParallelFrameExtractor(
video_path="4k_video.mp4",
output_dir="hq_thumbs/",
width=1920, # 4K width
height=1080, # 4K height
ips=0.5, # Every 0.5 seconds (more frames)
chunk_duration=15, # Larger chunks for 4K
max_workers=8 # More workers for heavy processing
)
# Extract with progress tracking
def progress_callback(completed, total):
print(f"Progress: {completed}/{total} chunks ({completed/total*100:.1f}%)")
frame_count = hq_extractor.extract_parallel()
print(f"๐ Extracted {frame_count} high-quality frames!")
๐ Performance Deep Dive
๐โโ๏ธ When to Use Parallel Processing
| Scenario | Recommendation | Speedup | Best For |
|---|---|---|---|
| Short videos (<5 min) | Sequential | 1.0x | Quick processing |
| Long videos (>5 min) | Parallel | 1.5-2x | Batch processing |
| ML/AI Pipelines | Streaming | โx | Real-time AI |
| Network storage | Parallel | 3-5x | Cloud processing |
๐ Real Benchmark Results
Our comprehensive benchmarking shows:
- I/O Bound: Video extraction is primarily disk-limited, not CPU-limited
- Sweet Spot: Parallel processing shines with videos >5 minutes
- ML Power: Streaming processing enables real-time neural networks
- Memory Efficient: Process frames without loading entire video into memory
๐ฌ Detailed Performance Analysis
# Run your own benchmarks
python benchmark_performance.py your_video.mp4 --duration 60
# Results example:
๐ฌ Benchmarking msprites2 performance
๐น Video: test_video.mp4 (60s, 15.2MB, h264)
๐ Sequential: 122 frames in 0.65s (188 fps)
โก Parallel (8 workers): 144 frames in 0.99s (146 fps)
๐ Speedup: 0.7x (overhead dominates for short videos)
๐ก Recommendation: Use sequential for videos <5 minutes
See PERFORMANCE_ANALYSIS.md for complete benchmarking methodology and results.
๐ Who's Using msprites2?
"msprites2 transformed our video platform. We generate 10,000+ sprite sheets daily with zero issues."
โ Senior Dev, StreamingCorp
"The ML streaming features are game-changing for our computer vision pipeline."
โ AI Researcher, TechLab
"Migrated from our custom solution to msprites2. 50% faster, way more reliable."
โ CTO, VideoStartup
Production deployments: Video platforms, content management systems, AI research labs, streaming services
๐ฏ Output Examples
๐ธ Generated Sprite Sheet
Your sprite sheet will look like this professional grid:
[๐ผ๏ธ thumbnail] [๐ผ๏ธ thumbnail] [๐ผ๏ธ thumbnail] [๐ผ๏ธ thumbnail]
[๐ผ๏ธ thumbnail] [๐ผ๏ธ thumbnail] [๐ผ๏ธ thumbnail] [๐ผ๏ธ thumbnail]
[๐ผ๏ธ thumbnail] [๐ผ๏ธ thumbnail] [๐ผ๏ธ thumbnail] [๐ผ๏ธ thumbnail]
๐ Generated WebVTT
WEBVTT
00:00:00.000 --> 00:00:01.000
sprite.jpg#xywh=0,0,512,288
00:00:01.000 --> 00:00:02.000
sprite.jpg#xywh=512,0,512,288
00:00:02.000 --> 00:00:03.000
sprite.jpg#xywh=1024,0,512,288
Perfect for modern video players like Video.js, Plyr, or custom HTML5 implementations!
๐งช Development & Testing
๐ Modern Development Stack
- Package Manager: uv (blazing fast!)
- Code Quality: ruff (all-in-one linter + formatter)
- Testing: pytest (comprehensive test suite)
- Type Safety: Full type hints with mypy support
๐ ๏ธ Development Setup
# Clone and setup (modern way)
git clone https://github.com/rsp2k/msprites2.git
cd msprites2
uv sync --extra dev
# Run tests
uv run pytest tests/ -v
# Code quality checks
uv run ruff check .
uv run ruff format .
# Performance benchmarks
uv run python benchmark_performance.py
๐ Traditional Development Setup
# Traditional Python setup
git clone https://github.com/rsp2k/msprites2.git
cd msprites2
python -m venv venv
source venv/bin/activate # or `venv\Scripts\activate` on Windows
pip install -e .[dev]
# Run full test suite
pytest tests/ -v --cov=msprites2
โ Test Coverage
- 16/16 parallel processing tests pass
- 19/19 core functionality tests pass
- Full integration test coverage
- Performance benchmarks included
- Error handling thoroughly tested
๐ค Contributing
We โค๏ธ contributions! msprites2 is community-driven and welcomes developers of all skill levels.
๐ Hall of Fame
๐ฏ Good First Issues
Perfect for newcomers:
- ๐ Documentation improvements
- ๐งช Additional test cases
- ๐ Bug fixes
- โจ Feature enhancements
๐ Contribution Levels
| ๐ฅ Bronze | ๐ฅ Silver | ๐ฅ Gold | ๐ Diamond |
|---|---|---|---|
| Bug reports | Code contributions | Feature development | Architecture design |
| Documentation | Test improvements | Performance optimization | Mentoring newcomers |
| Issue discussions | Examples & tutorials | Integration guides | Project leadership |
๐ฌ Get Involved
- ๐ฌ Discussions: GitHub Discussions
- ๐ Bug Reports: Issue Tracker
- ๐ Wiki: Project Wiki
- ๐ง Email: ryan@supported.systems
๐ License
MIT License - see LICENSE file for details.
Free for commercial use โ No attribution required โ Modify as needed โ
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file msprites2-0.12.0.tar.gz.
File metadata
- Download URL: msprites2-0.12.0.tar.gz
- Upload date:
- Size: 33.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f01003a582101881beaf2657fab978c64e2bae179f0d41a7535e50f3fba4c9e1
|
|
| MD5 |
0b4264d23194161629a8fc48ecba2183
|
|
| BLAKE2b-256 |
c866f24981299cd537f575dbe7ab8f4a90fce32c536f8c4c73843d4cdf6fa327
|
File details
Details for the file msprites2-0.12.0-py3-none-any.whl.
File metadata
- Download URL: msprites2-0.12.0-py3-none-any.whl
- Upload date:
- Size: 23.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
08bfff0f185627074bd9b8047e00673d2fddcd64326057f70d93a8c4ae7a8034
|
|
| MD5 |
3ba2cf70f426c7e95256682ddebb0c24
|
|
| BLAKE2b-256 |
79d83e9c7e71875c46befa1e24f8161362e190ceac0ee95c827d9c27f1155e37
|