Add your description here

These details have not been verified by PyPI

Project links

Project description

LongSoraGen

This project provides a full OpenAI-compatible API for generating longer Sora videos by intelligently splitting them into segments and ensuring seamless continuity.

1. Overview

LongSoraGen overcomes Sora's duration limitations by breaking down long video generation requests into multiple connected segments. The system uses AI-powered planning to create coherent narratives across segments and maintains visual continuity by using the last frame of each segment as a reference for the next.

Key Features

Extended Duration Support: Generate videos longer than Sora's standard limits
AI-Powered Segmentation: Uses GPT models to intelligently plan segment transitions
Visual Continuity: Automatically extracts last frames as reference images for seamless transitions
OpenAI-Compatible API: Drop-in replacement for standard OpenAI client
Async Support: Full async/await support for better performance
Flexible Duration Planning: Validates and combines base durations (4s, 8s, 12s) to achieve target length

2. How It Works

LongSoraGen operates in three main stages:

AI Planning: Uses the OpenAI Responses API with GPT models to break down your base prompt into N coherent segments, each with its own refined prompt that maintains narrative continuity.
Sequential Generation: Generates each video segment in order:
- Creates the first segment using the original prompt
- Extracts the last frame from each completed segment
- Uses that frame as input_reference for the next segment to ensure visual consistency
Video Combination: Automatically combines all segments into a single output video using FFmpeg.

The duration validation uses dynamic programming to ensure your requested total duration can be formed from Sora's base durations (4, 8, 12 seconds). For example:

16 seconds = 12 + 4 or 8 + 8 or 4 + 4 + 4 + 4
24 seconds = 12 + 12 or 12 + 8 + 4
20 seconds = 12 + 8 or 12 + 4 + 4

3. Installation

We highly recommend using uv to manage the environment:

Install dependencies:

uv sync

The environment will be installed in .venv. Activate it using:

source .venv/bin/activate

Set up OpenAI API Key:

export OPENAI_API_KEY='your-api-key-here'

4. Quick Start

Basic Example (Synchronous)

from pathlib import Path
from longsora import OpenAI

output_dir = Path("resources") / "case1"
prompt = "A woman is dancing in a bunch of trees."
model = "sora-2"
total_seconds = 24

if __name__ == "__main__":
    client = OpenAI()
    client.create_video(
        prompt=prompt,
        model=model,
        seconds=total_seconds,
        output_dir=output_dir,
        num_generations=3,
        verbose=True,
        save_segments=True,
        plan_model="gpt-5",
    )

Async Example

import asyncio
from pathlib import Path
from longsora import AsyncOpenAI

async def main():
    client = AsyncOpenAI()
    await client.create_video(
        prompt="A woman is dancing in a bunch of trees.",
        model="sora-2",
        seconds=16,
        output_dir=Path("resources") / "case2",
        num_generations=2,
        verbose=True,
        save_segments=True,
        plan_model="gpt-5",
    )

if __name__ == "__main__":
    asyncio.run(main())

5. API Reference

`client.create_video()`

Parameters:

Parameter	Type	Required	Default	Description
`prompt`	`str`	✓	-	Base prompt for video generation
`model`	`str`	✗	`"sora-2"`	Sora model to use
`seconds`	`int`	✓	-	Total video duration (must be formable from 4, 8, 12)
`output_dir`	`Path`	✓	-	Directory to save output and segments
`num_generations`	`int`	✗	`3`	Number of segments to split the video into
`plan_model`	`str`	✗	`"gpt-5"`	GPT model for segment planning
`verbose`	`bool`	✗	`True`	Enable detailed logging
`save_segments`	`bool`	✗	`True`	Save individual segments to disk
`size`	`str`	✗	-	Video resolution (e.g., "1080x1920")
`input_reference`	`FileTypes`	✗	-	Initial reference image/video

Output:

The final combined video is saved as output.mp4 in the specified output_dir. If save_segments=True, individual segments are saved in output_dir/segments/:

segment_01.mp4, segment_02.mp4, etc.
segment_01_last.jpg, segment_02_last.jpg, etc. (last frame extractions)

6. Examples

Generate a 24-second video with 3 segments (8 seconds each)

from pathlib import Path
from longsora import OpenAI

client = OpenAI()
client.create_video(
    prompt="A sunset over the ocean with waves crashing",
    seconds=24,
    output_dir=Path("outputs/sunset"),
    num_generations=3,  # Will create 3×8s segments
)

Generate a 20-second video with custom model

client.create_video(
    prompt="A futuristic cityscape at night",
    model="sora-2",
    seconds=20,
    output_dir=Path("outputs/city"),
    num_generations=5,  # Will create 5×4s segments
    plan_model="gpt-4o",  # Use GPT-4 for planning
)

7. Technical Details

Segment Planning

LongSoraGen uses the OpenAI Responses API to create intelligent segment prompts. The AI planner:

Analyzes your base prompt
Generates num_generations segment prompts that tell a cohesive story
Ensures each segment flows naturally into the next
Returns structured JSON with prompts and durations

Visual Continuity

To ensure smooth transitions between segments:

After generating each segment, the last frame is extracted using OpenCV
This frame becomes the input_reference for the next segment
Sora uses this reference to maintain visual consistency

Duration Validation

The system validates that your requested duration can be formed by combining Sora's base durations using a dynamic programming algorithm (coin change problem).

Valid durations include:

4, 8, 12 (base durations)
16, 20, 24, 28, 32... (combinations)

Invalid durations:

1, 2, 3, 5, 6, 7, 9, 10, 11, 13, 14, 15...

8. Dependencies

The project uses:

OpenAI SDK: For Sora and GPT API access
FFmpeg: For video processing and combination
OpenCV: For frame extraction
httpx: For async HTTP requests
Pydantic: For data validation

See pyproject.toml for full dependency list.

9. License

MIT License

See LICENSE for details.

10. Acknowledgments

OpenAI for the Sora and GPT APIs
mshumer/sora-extend for inspiration on the prompt planning approach
FFmpeg for video processing capabilities

11. Citation

If you use this project in your research or applications, please cite:

@misc{longsoragen2025,
  author = {linkedlist771},
  title = {LongSoraGen: Extended Video Generation with OpenAI Sora},
  year = {2025},
  url = {https://github.com/linkedlist771/LongSoraGen}
}

12. Troubleshooting

Invalid Duration Error

If you get a ValueError about invalid duration:

Ensure your seconds parameter can be formed from 4, 8, and 12
Example: 15 seconds is invalid (cannot be formed), but 16 seconds is valid (12+4)

API Key Issues

Make sure your OPENAI_API_KEY environment variable is set:

export OPENAI_API_KEY='sk-...'

FFmpeg Not Found

Install FFmpeg:

# macOS
brew install ffmpeg

# Ubuntu/Debian
sudo apt-get install ffmpeg

# Windows
# Download from https://ffmpeg.org/download.html

13. Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

14. Roadmap

Add support for custom segment duration distributions
Implement parallel segment generation where possible
Add video quality/style consistency controls
Support for audio continuation across segments
Web UI for easier interaction

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.5

Oct 10, 2025

0.1.4

Oct 10, 2025

0.1.3

Oct 10, 2025

0.1.2

Oct 10, 2025

0.1.1

Oct 10, 2025

This version

0.1.0

Oct 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

longsora-0.1.0.tar.gz (449.4 kB view details)

Uploaded Oct 10, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

longsora-0.1.0-py3-none-any.whl (1.0 MB view details)

Uploaded Oct 10, 2025 Python 3

File details

Details for the file longsora-0.1.0.tar.gz.

File metadata

Download URL: longsora-0.1.0.tar.gz
Upload date: Oct 10, 2025
Size: 449.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.0

File hashes

Hashes for longsora-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`060faf905cb594bd10cdbf3dfa7b07e6fbacfe12c9f9c86baf62d0773d7f0411`
MD5	`7ab296e4e967993c432feafbab449be4`
BLAKE2b-256	`d648d263a773fcb82e3fe2ca4aef014ba74a323f85ff6783435027bd337b7207`

See more details on using hashes here.

File details

Details for the file longsora-0.1.0-py3-none-any.whl.

File metadata

Download URL: longsora-0.1.0-py3-none-any.whl
Upload date: Oct 10, 2025
Size: 1.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.0

File hashes

Hashes for longsora-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b0d4bec838a6e09b56030938c015c105ced7159ab55b6bedd4f75a515fbf87b8`
MD5	`02402cd81bb773d69b3d90fb0aebaf9a`
BLAKE2b-256	`a1a1bdc697aee1ddf19305b9e36ece3012df96899d7b0733a734f216ae7a054b`

See more details on using hashes here.

longsora 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LongSoraGen

1. Overview

Key Features

2. How It Works

3. Installation

Install dependencies:

Set up OpenAI API Key:

4. Quick Start

Basic Example (Synchronous)

Async Example

5. API Reference

client.create_video()

6. Examples

Generate a 24-second video with 3 segments (8 seconds each)

Generate a 20-second video with custom model

7. Technical Details

Segment Planning

Visual Continuity

Duration Validation

8. Dependencies

9. License

10. Acknowledgments

11. Citation

12. Troubleshooting

Invalid Duration Error

API Key Issues

FFmpeg Not Found

13. Contributing

14. Roadmap

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`client.create_video()`