Skip to main content

Minimal video generation and processing library.

Project description

videopython

PyPI Python License

Minimal, LLM-friendly Python library for programmatic video editing, processing, and AI video workflows.

Full documentation: videopython.com

Installation

1. Install FFmpeg

# macOS
brew install ffmpeg

# Ubuntu / Debian
sudo apt-get install ffmpeg

# Windows (Chocolatey)
choco install ffmpeg

2. Install videopython

pip install videopython          # core video/audio editing
pip install "videopython[ai]"    # + local AI features (GPU recommended)

Python >=3.10, <3.14. AI features run locally - no cloud API keys required, but model weights are downloaded on first use.

Quick Start

Video editing

from videopython import Video
from videopython.base import FadeTransition

intro = Video.from_path("intro.mp4").resize(1080, 1920)
clip = Video.from_path("raw.mp4").cut(10, 25).resize(1080, 1920).resample_fps(30)
final = intro.transition_to(clip, FadeTransition(effect_time_seconds=0.5))
final = final.add_audio_from_file("music.mp3")
final.save("output.mp4")

JSON editing plans

Define multi-segment edits as JSON - useful for LLM-driven workflows. VideoEdit.json_schema() returns a schema for plan generation/validation.

from videopython.editing import VideoEdit

plan = {
    "segments": [{
        "source": "raw.mp4",
        "start": 10.0,
        "end": 20.0,
        "transforms": [
            {"op": "resize", "args": {"height": 1280}},
            {"op": "speed_change", "args": {"speed": 1.25}},
        ],
    }],
    "post_effects": [
        {"op": "fade", "args": {"mode": "in", "duration": 0.5}, "apply": {"start": 0.0, "stop": 0.5}},
    ],
}

edit = VideoEdit.from_dict(plan)
edit.validate()   # dry-run via metadata (no frame loading)
final = edit.run()
final.save("output.mp4")

AI generation

from videopython.ai import TextToImage, ImageToVideo, TextToSpeech

image = TextToImage().generate_image("A cinematic mountain sunrise")
video = ImageToVideo().generate_video(image=image).resize(1080, 1920)
audio = TextToSpeech().generate_audio("Welcome to videopython.")
video.add_audio(audio).save("ai_video.mp4")

LLM & AI Agent Integration

videopython is designed to be controlled by LLMs. Every video operation exposes a machine-readable spec with descriptions, parameter types, and value constraints - all available as JSON Schema at runtime.

Schema generation - VideoEdit.json_schema() returns a complete JSON Schema describing valid edit plans. Pass it directly as a tool schema or structured-output format to any LLM API:

from videopython.editing import VideoEdit

schema = VideoEdit.json_schema()
# Pass `schema` to your LLM as a function/tool definition or response format.
# The LLM generates a plan dict, then:

edit = VideoEdit.from_dict(plan)
edit.validate()   # dry-run: checks sources, time ranges, params - no frames loaded
final = edit.run()
final.save("output.mp4")

Operation discovery - the registry lets an LLM (or your code) inspect all available operations, their parameters, and constraints:

from videopython.base import get_operation_specs, get_specs_by_category, OperationCategory

all_ops = get_operation_specs()                                    # all registered operations
transforms = get_specs_by_category(OperationCategory.TRANSFORMATION)  # just transforms

spec = all_ops["color_adjust"]
print(spec.description)       # LLM-friendly docstring
print(spec.to_json_schema())  # {"brightness": {"type": "number", "minimum": -1, "maximum": 1}, ...}

Every operation has LLM-optimized descriptions and rich constraints (minimum, maximum, enum, exclusive_minimum, etc.) so models generate valid parameters on the first try.

Docs: Editing Plans | Operation Registry

Features

videopython.base - core editing (no AI dependencies)

Area Highlights
Video I/O Video, VideoMetadata, FrameIterator - load, save, inspect
Editing plans VideoEdit, SegmentConfig - JSON/LLM-friendly multi-segment plans with full JSON Schema generation, dry-run validation, and operation registry
Multicam editing MultiCamEdit, CutPoint - switch between synchronized camera angles with transitions, replace audio with external track
Transforms Cut (time/frame), resize, crop, FPS resampling, speed change, picture-in-picture, reverse, freeze frame, silence removal
Transitions FadeTransition, BlurTransition, InstantTransition
Effects Blur, zoom, color grading, vignette, Ken Burns, image overlay, fade, text overlay, volume adjust
Audio Load/save, overlay, concat, normalize, time-stretch, silence detection, segment classification
Text Transcription data classes, TranscriptionOverlay for subtitle rendering
Scene detection Histogram-based scene boundaries (detect, detect_streaming, detect_parallel)

API docs: Core | Video | Audio | Editing Plans | Transforms | Transitions | Effects | Text

videopython.ai - local AI features (install with [ai])

Area Highlights
Generation TextToVideo, ImageToVideo, TextToImage, TextToSpeech, TextToMusic
Understanding AudioToText (transcription), AudioClassifier, SceneVLM (visual scene description), ActionRecognizer
Scene detection SemanticSceneDetector (neural scene boundaries)
Video analysis VideoAnalyzer - full-pipeline analysis combining multiple AI capabilities
Transforms FaceTracker, FaceTrackingCrop, SplitScreenComposite
Dubbing VideoDubber - voice cloning and revoicing with timing sync
Object swapping ObjectSwapper - detect, segment, and inpaint objects in video

API docs: Generation | Understanding | Transforms | Dubbing | Object Swapping

Examples

Development

See DEVELOPMENT.md for local setup, testing, and contribution workflow.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

videopython-0.26.10.tar.gz (153.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

videopython-0.26.10-py3-none-any.whl (178.0 kB view details)

Uploaded Python 3

File details

Details for the file videopython-0.26.10.tar.gz.

File metadata

  • Download URL: videopython-0.26.10.tar.gz
  • Upload date:
  • Size: 153.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for videopython-0.26.10.tar.gz
Algorithm Hash digest
SHA256 25a714dc7728aac977ee19b4c227c0b8a9e0b5b7be1c132ef00745900ae2134e
MD5 267ef3e817f5abf0a21ddb6ddf955360
BLAKE2b-256 f3d841d96cf3f55a4874b9d6dc9b39a865994e2a42a61a98e879f143f58a521b

See more details on using hashes here.

Provenance

The following attestation bundles were made for videopython-0.26.10.tar.gz:

Publisher: publish.yml on BartWojtowicz/videopython

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file videopython-0.26.10-py3-none-any.whl.

File metadata

  • Download URL: videopython-0.26.10-py3-none-any.whl
  • Upload date:
  • Size: 178.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for videopython-0.26.10-py3-none-any.whl
Algorithm Hash digest
SHA256 f3c208566ca57294848ff60066d287c9b29d140a4af94f7e988cebc258a5f817
MD5 d740b35463a17ae9e77d8f99ee8df4bf
BLAKE2b-256 6589bbd1909d48c0b49628bd089d15b505935c935c263f4d6f959b9595e0d476

See more details on using hashes here.

Provenance

The following attestation bundles were made for videopython-0.26.10-py3-none-any.whl:

Publisher: publish.yml on BartWojtowicz/videopython

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page