GPU-accelerated short video generator using NVIDIA VPF and PyTorch.

Project description

🎬 Shorts Maker (GPU Optimized)

Shorts Maker generates vertical video clips from longer gameplay footage. This Python library and CLI tool detects scenes, computes audio and video action profiles (sound intensity + visual motion), and combines them to rank scenes by overall intensity. It then crops to the desired aspect ratio and renders ready‑to‑upload shorts.

This version has been heavily optimized for NVIDIA GPUs using CUDA.

For the original CPU-only version, please visit Shorts Maker.

Python PyTorch CUDA Docker

Read the Full Documentation 📚

✨ Features

GPU-Accelerated Processing:
- Hardware Decoding & Resizing: Native NVIDIA Video Processing Framework (VPF) integration via PyNvCodec. Decodes, resizes, and converts color spaces directly on NVDEC.
- Scene Detection: Custom implementation using VPF and OpenCV.
- Audio Analysis: Uses torchaudio on GPU for fast RMS and spectral flux calculation.
- Video Analysis: Zero-copy GPU memory streaming for stable motion estimation (replaces heavy frame indices).
- Image Processing: Native PyTorch operators used for heavy operations like blurring backgrounds (separable convolutions).
- Rendering: Custom PyTorch+NVENC engine for high-performance rendering (MoviePy removed from render path).
- Robust Batch Processing: Video processing runs in fully isolated subprocesses, completely clearing CUDA contexts between files to prevent VRAM fragmentation and OOM crashes (especially in Docker/WSL).
Audio + video action scoring:
- Combined ranking with tunable weights (defaults: audio 0.6, video 0.4).
Scenes ranked by combined action score rather than duration.
Smart Scene Cutting:
- Preferentially selects complete scenes if they fit within the time limit.
- Scene Padding: Adds a 1.5-second buffer to the end of scenes to capture exit animations and fades.
- Smart Trimming: For long scenes, searches for "quiet" moments (low audio/motion) to cut, avoiding abrupt endings.
Smart cropping with optional blurred background for non‑vertical footage.
Retry logic during rendering to avoid spurious failures.
Configuration via .env environment variables.

📋 Requirements

NVIDIA GPU with CUDA support.
NVIDIA Drivers (compatible with CUDA 13.0+ recommended).
Python 3.12+
FFmpeg (used for audio extraction and NVENC encoding).
System libraries: libgl1, libglib2.0-0 (often needed for vision libraries).

Python dependencies (see pyproject.toml):

torch, torchaudio (with CUDA support)
PyNvCodec, PytorchNvCodec (Video Processing Framework)

🚀 Installation

Via PyPI (Recommended)

Ensure you have the NVIDIA drivers and CUDA toolkit installed. Then install the package directly:

pip install shorts-maker-gpu

Manual Setup from Source (Linux with CUDA)

Ensure you have the NVIDIA drivers and CUDA toolkit installed.

git clone https://github.com/artryazanov/shorts-maker-gpu.git
cd shorts-maker-gpu
python3 -m venv venv
source venv/bin/activate

# Install the library and its dependencies
pip install -e .

If you encounter issues with PyTorch not finding the GPU, refer to its installation guide for your specific CUDA version.

💡 Usage

Place source videos inside the gameplay/ directory.
Run the CLI tool:

shorts-maker process

You can optionally customize the input and output directories and scene limits:

shorts-maker process --input-dir my_videos/ --output-dir my_shorts/ --scene-limit 3

Generated clips are written to the generated/ directory.

During processing, the log shows an action score for each combined scene and the final list sorted by that score. The top scenes (by action intensity) are rendered first using NVENC.

🐳 Docker (Recommended)

The easiest way to run this application is using Docker with the NVIDIA Container Toolkit.

Prerequisite: NVIDIA Container Toolkit must be installed on the host.

Build and run:

docker build -t shorts-maker .

# Run with GPU access
docker run --rm \
    --gpus all \
    -v $(pwd)/gameplay:/app/gameplay \
    -v $(pwd)/generated:/app/generated \
    --env-file .env \
    shorts-maker

Note the --gpus all flag, which is essential for the application to access hardware acceleration.

⚙️ Configuration

Copy .env.example to .env and adjust values as needed.

Supported variables (defaults shown):

TARGET_RATIO_W=9 — Width part of the target aspect ratio (e.g., 9 for 9:16).
TARGET_RATIO_H=16 — Height part of the target aspect ratio (e.g., 16 for 9:16).
SCENE_LIMIT=4 — Maximum number of top scenes rendered per source video.
X_CENTER=0.5 — Horizontal crop center in range [0.0, 1.0].
Y_CENTER=0.5 — Vertical crop center in range [0.0, 1.0].
MAX_ERROR_DEPTH=3 — Maximum retry depth if rendering fails.
MIN_SHORT_LENGTH=15 — Minimum short length in seconds.
MAX_SHORT_LENGTH=179 — Maximum short length in seconds.
MAX_COMBINED_SCENE_LENGTH=300 — Maximum combined length (in seconds).
SAVE_FFMPEG_LOGS=False — Whether to save FFmpeg logs during rendering.

🛠️ Development

Linting

This project uses ruff for fast linting.

pip install ruff
ruff check .

🧪 Running Tests

Unit tests live in the tests/ folder. Run them with:

pytest -q

Note: The tests are designed to mock GPU availability if it is missing, so they can run in standard CI environments.

🚑 Troubleshooting

"Torch not installed" / "CUDA not available": Ensure you are running inside the Docker container with --gpus all or have the correct CUDA toolkit installed locally.
NVENC Error: If h264_nvenc fails, the script attempts to fall back to software encoding (libx264). Check if your GPU supports NVENC and if the drivers are up to date.

📄 License

This project is released under the MIT License.

Project details

Release history Release notifications | RSS feed

0.1.9

Mar 25, 2026

0.1.8

Mar 25, 2026

0.1.7

Mar 24, 2026

0.1.6

Mar 24, 2026

0.1.5

Mar 24, 2026

This version

0.1.4

Mar 23, 2026

0.1.3

Mar 23, 2026

0.1.2

Mar 21, 2026

0.1.1

Mar 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shorts_maker_gpu-0.1.4.tar.gz (43.5 kB view details)

Uploaded Mar 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

shorts_maker_gpu-0.1.4-py3-none-any.whl (32.1 kB view details)

Uploaded Mar 23, 2026 Python 3

File details

Details for the file shorts_maker_gpu-0.1.4.tar.gz.

File metadata

Download URL: shorts_maker_gpu-0.1.4.tar.gz
Upload date: Mar 23, 2026
Size: 43.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for shorts_maker_gpu-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`a2dacac8288846c824532fd45f2f2716d4960b4ce4cc1da1624f8a52229fad60`
MD5	`69604e132e1571ff858182f42b89f797`
BLAKE2b-256	`1e296ac243e887ee904e164234e1c5423b9cbd91e1b75c189f1062a305b5ee3d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for shorts_maker_gpu-0.1.4.tar.gz:

Publisher: publish.yml on artryazanov/shorts-maker-gpu

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: shorts_maker_gpu-0.1.4.tar.gz
- Subject digest: a2dacac8288846c824532fd45f2f2716d4960b4ce4cc1da1624f8a52229fad60
- Sigstore transparency entry: 1164652049
- Sigstore integration time: Mar 23, 2026
Source repository:
- Permalink: artryazanov/shorts-maker-gpu@6cf074da75cd82ec3046d754201f7d944eae7c38
- Branch / Tag: refs/tags/v0.1.4
- Owner: https://github.com/artryazanov
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6cf074da75cd82ec3046d754201f7d944eae7c38
- Trigger Event: release

File details

Details for the file shorts_maker_gpu-0.1.4-py3-none-any.whl.

File metadata

Download URL: shorts_maker_gpu-0.1.4-py3-none-any.whl
Upload date: Mar 23, 2026
Size: 32.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for shorts_maker_gpu-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`10543319cf280eccdeb3f77b69f401a857c9a21d8a16b14a85462f3cc7c3a00c`
MD5	`2431e7a80b46820fe92a0298bbf5df7d`
BLAKE2b-256	`b2aa9eb2bfd89a078670730dc5ab51226e175cbcec5cf32d864b9f40425793a0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for shorts_maker_gpu-0.1.4-py3-none-any.whl:

Publisher: publish.yml on artryazanov/shorts-maker-gpu

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: shorts_maker_gpu-0.1.4-py3-none-any.whl
- Subject digest: 10543319cf280eccdeb3f77b69f401a857c9a21d8a16b14a85462f3cc7c3a00c
- Sigstore transparency entry: 1164652128
- Sigstore integration time: Mar 23, 2026
Source repository:
- Permalink: artryazanov/shorts-maker-gpu@6cf074da75cd82ec3046d754201f7d944eae7c38
- Branch / Tag: refs/tags/v0.1.4
- Owner: https://github.com/artryazanov
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6cf074da75cd82ec3046d754201f7d944eae7c38
- Trigger Event: release

shorts-maker-gpu 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

🎬 Shorts Maker (GPU Optimized)

Read the Full Documentation 📚

✨ Features

📋 Requirements

🚀 Installation

Via PyPI (Recommended)

Manual Setup from Source (Linux with CUDA)

💡 Usage

🐳 Docker (Recommended)

⚙️ Configuration

🛠️ Development

Linting

🧪 Running Tests

🚑 Troubleshooting

📄 License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance