A powerful Docker-based API for intelligent video generation with professional effects and subtitles

These details have not been verified by PyPI

Project links

Project description

Video Generation API v1.0

🎬 A powerful Docker-based API for intelligent video generation with professional effects and subtitles.

🚀 Quick Start

Pull and Run

# Pull the Docker image
docker pull betashow/video-generation-api:latest

# Run the container
docker run -d \
  --name video-api \
  -p 5000:5000 \
  betashow/video-generation-api:latest

The API will be available at http://localhost:5000

🚀 Want to Deploy This on AWS?

Check out my second open source project: CloudBurst

CloudBurst helps you deploy this Video Generation API on AWS with:

⚡ On-demand instances - Pay only when you need it
💰 96% cost savings - Compared to 24/7 GPU instances
🔄 Fully automated - Create → Deploy → Process → Terminate
📊 Real-time cost tracking - Know exactly what you're paying

Perfect for production use cases where you need to generate videos occasionally but don't want to maintain expensive infrastructure.

📖 API Documentation

Core Endpoint: `/create_video_onestep`

A single intelligent endpoint that automatically handles all video creation scenarios based on your input parameters.

Request Format

URL: POST http://your-server:5000/create_video_onestep

Headers:

{
  "Content-Type": "application/json",
  "X-Authentication-Key": "your-key-if-required"
}

Body Parameters:

Parameter	Type	Required	Description
`input_image`	string	Yes	Base64 encoded image (JPG/PNG)
`input_audio`	string	Yes	Base64 encoded audio (MP3/WAV)
`subtitle`	string	No	Base64 encoded SRT subtitle file
`effects`	array	No	Effects to apply. Available: `"zoom_in"`, `"zoom_out"`, `"pan_left"`, `"pan_right"`, `"random"`
`language`	string	No	Subtitle language: `"chinese"` or `"english"` (default: chinese)
`background_box`	boolean	No	Show subtitle background (default: true)
`background_opacity`	float	No	Subtitle background transparency 0-1 (default: 0.2) See important note below
`font_size`	integer	No	Subtitle font size in pixels (default: auto-calculated based on video size)
`outline_color`	string	No	Subtitle outline color in ASS format (default: "&H00000000" - black)
`is_portrait`	boolean	No	Force portrait orientation (default: auto-detect)
`watermark`	string	No	Base64 encoded watermark image
`output_filename`	string	No	Preferred output filename

Processing Scenarios

The API automatically detects and optimizes for 4 scenarios:

Scenario	Effects	Subtitles	Description
Baseline	❌	❌	Simple image + audio merge (fastest)
Subtitles Only	❌	✅	Basic video with professional subtitles
Effects Only	✅	❌	Cinematic zoom/pan effects
Full Featured	✅	✅	Effects + professional subtitles

Response Format

{
  "success": true,
  "file_id": "f47ac10b-58cc-4372-a567-0e02b2c3d479",
  "download_endpoint": "/download/f47ac10b-58cc-4372-a567-0e02b2c3d479",
  "filename": "output.mp4",
  "size": 15728640,
  "scenario": "full_featured"
}

Complete Examples

1. Baseline (Simplest)

import requests
import base64

def encode_file(filepath):
    with open(filepath, 'rb') as f:
        return base64.b64encode(f.read()).decode('utf-8')

# Prepare inputs
image_b64 = encode_file('image.jpg')
audio_b64 = encode_file('audio.mp3')

# Make request
response = requests.post('http://localhost:5000/create_video_onestep', 
    json={
        'input_image': image_b64,
        'input_audio': audio_b64
    }
)

result = response.json()
if result['success']:
    # Download the video
    download_url = f"http://localhost:5000{result['download_endpoint']}"
    video = requests.get(download_url)
    with open('output.mp4', 'wb') as f:
        f.write(video.content)

2. With Chinese Subtitles

subtitle_b64 = encode_file('subtitles.srt')

response = requests.post('http://localhost:5000/create_video_onestep',
    json={
        'input_image': image_b64,
        'input_audio': audio_b64,
        'subtitle': subtitle_b64,
        'language': 'chinese',
        'background_box': True,
        'background_opacity': 0.2
    }
)

3. With Effects

# Zoom effects (randomly picks one)
response = requests.post('http://localhost:5000/create_video_onestep',
    json={
        'input_image': image_b64,
        'input_audio': audio_b64,
        'effects': ['zoom_in', 'zoom_out']  # Randomly chooses zoom_in OR zoom_out
    }
)

# Pan effects
response = requests.post('http://localhost:5000/create_video_onestep',
    json={
        'input_image': image_b64,
        'input_audio': audio_b64,
        'effects': ['pan_left']  # Pan from right to center
    }
)

# Let system choose randomly from all effects
response = requests.post('http://localhost:5000/create_video_onestep',
    json={
        'input_image': image_b64,
        'input_audio': audio_b64,
        'effects': ['random']  # System picks any available effect
    }
)

4. Full Featured (Effects + Subtitles)

response = requests.post('http://localhost:5000/create_video_onestep',
    json={
        'input_image': image_b64,
        'input_audio': audio_b64,
        'subtitle': subtitle_b64,
        'effects': ['zoom_in', 'zoom_out'],
        'language': 'chinese'
    }
)

5. Advanced Subtitle Customization

response = requests.post('http://localhost:5000/create_video_onestep',
    json={
        'input_image': image_b64,
        'input_audio': audio_b64,
        'subtitle': subtitle_b64,
        'language': 'chinese',
        'font_size': 48,                    # Custom font size
        'outline_color': '&H00FF0000',      # Blue outline
        'background_box': True,             # Show background
        'background_opacity': 0.3           # 30% transparent (dark background)
    }
)

Other Endpoints

Health Check

GET /health

Returns API status, FFmpeg version, and available endpoints.

Download Video

GET /download/{file_id}

Download the generated video file. Files expire after 1 hour.

Cleanup Expired Files

GET /cleanup

Manually trigger cleanup of expired files.

🔧 Authentication

The API supports two modes:

Default Mode (No Authentication)

By default, the API is open and requires no authentication.

Secure Mode

Set the AUTHENTICATION_KEY environment variable to enable authentication:

docker run -d \
  -e AUTHENTICATION_KEY=your-secure-uuid-here \
  -p 5000:5000 \
  betashow/video-generation-api:latest

Then include the key in your requests:

headers = {
    'Content-Type': 'application/json',
    'X-Authentication-Key': 'your-secure-uuid-here'
}

🎯 Features

Intelligent Processing: Automatically optimizes based on input parameters
Professional Subtitles: High-quality subtitle rendering (not FFmpeg filters)
Auto-Orientation: Detects portrait/landscape videos automatically
Cinematic Effects: Hollywood-style zoom and pan effects
Multi-Language: Supports Chinese and English with proper fonts
GPU Acceleration: Automatic GPU detection and usage when available

🎨 Advanced Subtitle Styling

Subtitle Background Transparency

⚠️ IMPORTANT: The background_opacity parameter controls transparency, not opacity!

Value	Visual Result	Description
0.0	Solid black	Completely opaque background
0.2	Dark background	Default - Good readability
0.5	Semi-transparent	50% see-through
0.7	Very transparent	Old default - quite see-through
1.0	No background	Completely transparent

Examples:

For darker, more readable subtitles: Use lower values (0.0 - 0.3)
For more transparent subtitles: Use higher values (0.5 - 1.0)
Recommended: 0.2 (the new default) provides excellent readability

# Dark, readable background (recommended)
'background_opacity': 0.2

# Solid black background
'background_opacity': 0.0

# Very transparent (hard to read)
'background_opacity': 0.8

Color Format (ASS/SSA Style)

The outline_color parameter uses ASS subtitle format: &HAABBGGRR where:

AA = Alpha (transparency): 00 = opaque, FF = transparent
BB = Blue component (00-FF)
GG = Green component (00-FF)
RR = Red component (00-FF)

Common Colors:

&H00000000 - Black (default)
&H00FFFFFF - White
&H000000FF - Red
&H0000FF00 - Green
&H00FF0000 - Blue
&H0000FFFF - Yellow
&H00FF00FF - Magenta

Font Size Guidelines

If not specified, font size is auto-calculated based on video resolution:

1080p Landscape: ~45px for Chinese, ~60px for English
1080p Portrait: ~21px for Chinese, ~30px for English
4K Videos: Proportionally larger

📋 Requirements

Docker
2GB+ RAM (4GB recommended)
10GB+ free disk space
GPU (optional, for faster processing)

🎬 Output Examples

See what this API can generate:

English Example:

Chinese Example:

Features Demonstrated:

✅ Professional subtitles with semi-transparent background
✅ Smooth zoom effects (Ken Burns effect)
✅ Perfect audio-visual synchronization
✅ High-quality 1080p video output
✅ Support for both English and Chinese

Both examples were generated using the "Full Featured" mode with subtitles and effects enabled.

🐳 Docker Image Details

The image includes:

Ubuntu 22.04 base
FFmpeg with GPU support
Python 3.10
Chinese fonts (LXGW WenKai Bold)
All required video processing libraries

📝 Notes

All file inputs must be Base64 encoded
Generated videos expire after 1 hour
The API returns relative download paths, not full URLs
This is designed for on-demand, disposable container usage

🚨 Important

This Docker image is designed for temporary, on-demand usage. The container can be destroyed and recreated as needed - all paths are relative and no persistent storage is required.

Ready to generate amazing videos? Start the container and make your first request!

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.1

Aug 8, 2025

This version

1.0.0

Aug 8, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

video_generation_api-1.0.0.tar.gz (58.1 kB view details)

Uploaded Aug 8, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

video_generation_api-1.0.0-py3-none-any.whl (29.6 kB view details)

Uploaded Aug 8, 2025 Python 3

File details

Details for the file video_generation_api-1.0.0.tar.gz.

File metadata

Download URL: video_generation_api-1.0.0.tar.gz
Upload date: Aug 8, 2025
Size: 58.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.23

File hashes

Hashes for video_generation_api-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`43ba523f8ad85beb248defd7ff9a92c9bf7974af8625597b4e5be1ba9ae35b48`
MD5	`b15e873627f8ecd5e056fd8998a6a726`
BLAKE2b-256	`a63ec8fefff22a661d563ba5b6d2495137d5cbcccf8c29153fc915d06f6a7239`

See more details on using hashes here.

File details

Details for the file video_generation_api-1.0.0-py3-none-any.whl.

File metadata

Download URL: video_generation_api-1.0.0-py3-none-any.whl
Upload date: Aug 8, 2025
Size: 29.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.9.23

File hashes

Hashes for video_generation_api-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e030db49393d73515a58db0cddc2b97bfd41785ef5fdd60cf7f3af7ae11ec603`
MD5	`05fa8d160feb3e79f4aba6d85c8b3203`
BLAKE2b-256	`80bafb1109b44ce8cb2ae085ffe4bfc75c6d2e2ab61dc4f45a97a7b6d07edfc5`

See more details on using hashes here.

video-generation-api 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Video Generation API v1.0

🚀 Quick Start

Pull and Run

🚀 Want to Deploy This on AWS?

📖 API Documentation

Core Endpoint: /create_video_onestep

Request Format

Processing Scenarios

Response Format

Complete Examples

Other Endpoints

Health Check

Download Video

Cleanup Expired Files

🔧 Authentication

Default Mode (No Authentication)

Secure Mode

🎯 Features

🎨 Advanced Subtitle Styling

Subtitle Background Transparency

Color Format (ASS/SSA Style)

Font Size Guidelines

📋 Requirements

🎬 Output Examples

🐳 Docker Image Details

📝 Notes

🚨 Important

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Core Endpoint: `/create_video_onestep`