Skip to main content

HW accelerated video reading for ML Inference (CUDA version).

Project description

CeLux

CeLux is a high-performance Python library for video processing built on top of FFmpeg. It offers some of the fastest decode times for full HD video in the world, allowing for efficient and seamless video decoding directly into PyTorch tensors. CeLux gets its name from the latin words celer, and lux, meaning speed and light respectively.

Features

  • Ultra-Fast Video Decoding: Achieve lightning-fast decode times for full HD videos using hardware acceleration.
  • Direct Decoding to Tensors: Decode video frames directly into PyTorch tensors for immediate processing.
  • Hardware Acceleration Support: Utilize CUDA for GPU-accelerated decoding, significantly improving performance.
  • Easy Integration: Seamlessly integrates with existing Python workflows, making it easy to incorporate into your projects.
  • Supports Multiple Data Types: Handle video frames in uint8, float32, or float16 data types.

Table of Contents

Getting Started

Prerequisites

Before you begin, ensure you have met the following requirements:

  • FFmpeg: Required for audio/video processing functionalities.
  • PyTorch with CUDA Support: Required for tensor operations and GPU acceleration.
  • LibTorch: The PyTorch C++ library, necessary for building the project.
  • PyBind11: Used for creating Python bindings for C++ code.
  • C++17 Compiler: The project utilizes C++17 features.
  • CMake (Version 3.12 or higher): Used for building and managing the project configuration.
  • Vcpkg (Optional): For managing dependencies like FFmpeg on Windows.

Installation

Currently, celux can be installed by downloading the latest release package. Please follow these steps:

  1. Download the Latest Release:

    • Visit the Releases page and download the most recent .zip file, which includes all dependencies.
  2. Extract the Package:

    • Extract the contents of the .zip file to your desired location.
  3. Update Python Path:

    • Append the directory to your Python path to ensure Python can find the celux module.
    • Note: Make sure to import torch before importing celux in your Python scripts.

Building from Source

If you prefer to build celux from source, follow these steps:

  1. Clone the Repository:

    git clone https://github.com/Trentonom0r3/celux.git
    cd celux
    
  2. Configure the Project with CMake:

    cmake -B build -S . -DCMAKE_BUILD_TYPE=Release
    
    • Windows Users: If using Vcpkg, include the toolchain file:

      cmake -B build -S . -DCMAKE_BUILD_TYPE=Release -DCMAKE_TOOLCHAIN_FILE=<path_to_vcpkg>/scripts/buildsystems/vcpkg.cmake
      
  3. Build the Project:

    cmake --build build --config Release
    
  4. Install the Package:

    cd build
    cmake --install .
    
  5. Set Up Environment Variables:

    • Ensure that the FFmpeg binaries and any other dependencies are in your system's PATH.
    • Set the LD_LIBRARY_PATH or DYLD_LIBRARY_PATH on Unix systems if necessary.

Usage

Basic Example

Here's a simple example demonstrating how to use celux to read video frames and process them:

import torch  # Import torch first
import celux

def process_frame(frame):
    # Implement your frame processing logic here
    pass

# Ensure torch is imported before celux
with celux.VideoReader(
    "path/to/input/video.mp4",
    device="cpu",
    d_type="uint8"
) as reader:
    for frame in reader:
        # Frame will be in HWC format, uint8 data type, values in [0, 255]
        process_frame(frame)

Parameters:

  • device (str): Device to use. Can be "cpu" or "cuda".
  • dtype (str): Data type of the output frames ("uint8", "float", or "half").

Note: If you set dtype to "float" or "half", the frame values will be normalized between 0.0 and 1.0.

Contributing

Contributions are welcome! Please follow these steps to contribute to the project:

  1. Fork the Repository:

    • Click the "Fork" button at the top right of the repository page to create your own fork.
  2. Clone Your Fork:

    git clone https://github.com/your-username/celux.git
    cd celux
    
  3. Create a New Branch:

    git checkout -b feature/your-feature-name
    
  4. Make Your Changes:

    • Implement your feature or bugfix.
  5. Commit Your Changes:

    git commit -am "Add your commit message here"
    
  6. Push to Your Fork:

    git push origin feature/your-feature-name
    
  7. Submit a Pull Request:

    • Go to the original repository and click on "Pull Requests," then "New Pull Request."

Changelog

Version 0.2.6 (2024-10-15)

  • Pre-Release Update:
    • Removed Numpy support in favor of PyTorch tensors, gpu/cpu support provided.
    • Added NV12ToBGR, BGRToNV12, and NV12ToNV12 conversion modules.
    • Fixed a few small issues.
    • Updated documentation and examples.

Version 0.2.2 (2024-10-14)

  • Pre-Release Update:
    • Fixed a few small issues.
    • Made VideoReader and VideoWriter callable.
    • Created BGR onversions.
    • Added frame range (in/out) args.
    with VideoReader('input.mp4')([10, 20]) as reader:
      for frame in reader:
          print(f"Processing frame {frame}")
    

Version 0.2.1 (2024-10-13)

  • Pre-Release Update:
    • Adjusted Python bindings to use snake_case.
    • Added .pyi stub files to .whl.
    • Adjusted d_type args to (uint8, float32, float16).
    • Added github actions for new releases.
    • Added HW Accel Encoder support, direct encoding from numpy/Tensors.
    • Added has_audio property to VideoReader.get_properties().

Version 0.1.1 (2024-10-06)

  • Pre-Release Update:
    • Implemented support for multiple data types (uint8, float, half).
    • Provided example usage and basic documentation.

License

This project is licensed under the GNU Affero General Public License v3.0 (AGPL-3.0). See the LICENSE file for details.

Acknowledgments

  • FFmpeg: This project is built on top of the powerful FFmpeg library.
  • PyTorch: Utilizes PyTorch for tensor operations and CUDA support.
  • Vcpkg: Simplifies cross-platform dependency management.
  • NevermindNilas - for help testing, API suggestions + more.

FAQ

Q: Why do I need to import torch before celux?

A: celux depends on PyTorch, and importing torch first ensures that all the necessary CUDA context and resources are correctly initialized before celux uses them.

Q: Can I use celux without CUDA or GPU acceleration?

A: Yes, you can set useHardware=False when initializing VideoReader to use CPU decoding. However, performance may be significantly slower compared to GPU-accelerated decoding.

Q: What video formats are supported?

A: celux's goal is to support all video formats and codecs that are supported by FFmpeg. However, hardware-accelerated decoding may only be available for specific codecs like H.264 and HEVC. Currently, H264/H265/HEVC are the only codecs tested.

Q: How do I report a bug or request a feature?

A: Please open an issue on the GitHub Issues page with detailed information about the bug or feature request.

Roadmap

  • Additional Conversion Support:

    • Create additional conversion modules:
      • NV12ToBGR, BGRToNV12, NV12ToNV12(no change converter)
  • Audio Processing:

    • Introduce capabilities for audio extraction and processing.
  • Performance Enhancements:

    • Further optimize decoding performance and memory usage.
  • Cross-Platform Support:

    • Improve compatibility with different operating systems and hardware configurations.
  • Support for Additional Codecs:

    • Expand the range of supported video codecs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

celux_cuda-0.2.6-py3-none-any.whl (16.6 kB view details)

Uploaded Python 3

File details

Details for the file celux_cuda-0.2.6-py3-none-any.whl.

File metadata

  • Download URL: celux_cuda-0.2.6-py3-none-any.whl
  • Upload date:
  • Size: 16.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for celux_cuda-0.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 2ce7fcb0151701004d3c205fc729b4a618f81b2d64bd25d36229f9370837fd54
MD5 fa123d28c3656259fc6bcd59aece17bb
BLAKE2b-256 e50c866ae1d34a18f5ddaac69835c45af71cdecb9f6e686902e06c897a04964a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page