Skip to main content

An animation and video rendering engine for audio-based and voice-based podcast videos.

Project description

Audim ✨

Documentation Author: Atanu Sarkar License: Apache 2.0 Citation

Audio Podcast Animation Engine

An animation and video rendering engine for audio-based and voice-based podcast videos.

| Features | Getting Started | Quick Links |

🚀 Demo

[!NOTE]

For this example, we have transformed a conversation between Grant Sanderson (from 3Blue1Brown) and Sal Khan (from Khan Academy) from YouTube into a visually engaging podcast video using Audim.

See docs/devblog/v0.0.7.md for more details on how this video was generated.

🔗 Quick Links

  1. Getting Started
    • See Setup and ensure you have setup correctly before usage.
    • For developers and contributors, see Development.
  2. API Documentation
    • See API Docs for the audim API documentation.
  3. Usage and Examples
    • See Usage for usage examples.
  4. Dev Blog
    • See Dev Blog for the development blog of the project to gain more insights into the project.
    • See Changelog for the changelog of the project.

🎯 Introduction

Audim is an engine for precise programmatic animation and rendering of podcast videos from audio-based and voice-based file recordings.

✨ Features

  • 💻 Precise programmatic animations.
  • 🎬 Rendering of videos with layout based scenes.
  • 📝 Generate subtitles and transcripts from audio/video files.
  • 🎤 From subtitle and scene elements to podcast video generation.

🚀 Getting Started

Prerequisites

  • 🐍 Python ≥ 3.10
  • 🖥️ Conda
  • 🎥 FFmpeg (optional, for faster video encoding)

Installation

1. Clone the repository:

git clone https://github.com/mratanusarkar/audim.git

2. Install FFmpeg locally (optional)

Using local FFmpeg is optional but recommended for speeding up the video encoding process.

On Ubuntu, install FFmpeg using:

sudo apt install ffmpeg libx264-dev

On Windows and other platforms, download and install FFmpeg from the official website:

3. Install uv and setup project environment:

[!IMPORTANT] If you are using conda base environment as the default base environment for your python projects, run the below command to activate the base environment. If not, skip this step and continue with the next step.

conda activate base
# Install uv
pip install uv

# Setup project environment
uv venv

source .venv/bin/activate   # on Linux
# .venv\Scripts\activate    # on Windows

uv pip install -e ".[dev,docs]"

4. Build and deploy documentation

You can build and serve the documentation by running:

uv pip install -e .[docs]
mkdocs serve

Code Quality

Before committing, please ensure that the code is formatted and styled correctly. Run the following commands to check and fix code style issues:

# Check and fix code style issues
ruff format .
ruff check --fix .

⚖️ License & Attribution

Audim is licensed under Apache 2.0. You can use it freely for personal and commercial projects.

Attribution is encouraged. If you use Audim, please:

  • Keep the default watermark in videos, OR
  • Add "Made with Audim" to video descriptions, OR
  • Link to this repo in your project documentation

See NOTICE file for complete attribution guidelines.

📄 Citation

If you use Audim in your project or research, please cite it as follows:

@software{audim,
  title = {Audim: Audio Podcast Animation Engine},
  author = {Sarkar, Atanu},
  year = {2025},
  url = {https://github.com/mratanusarkar/audim},
  version = {0.0.7}
}

You can also click the "Cite this repository" button on GitHub for other citation formats.

⚠️ Disclaimer

[!WARNING] Early Development Stage

  • This project is actively under development and may contain bugs or limitations.
  • While stable for basic use cases, the rendering engine requires further development and testing across diverse scenarios.
  • The API is subject to change, so keep an eye at the documentation for the latest updates.

[!TIP] We encourage you to:

  • Try Audim for your projects and podcast videos.
  • Report issues when encountered.
  • Feel free to raise a PR to contribute and improve the project.

Your feedback and contributions help make Audim better for everyone!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audim-0.0.6.tar.gz (38.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

audim-0.0.6-py3-none-any.whl (42.5 kB view details)

Uploaded Python 3

File details

Details for the file audim-0.0.6.tar.gz.

File metadata

  • Download URL: audim-0.0.6.tar.gz
  • Upload date:
  • Size: 38.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for audim-0.0.6.tar.gz
Algorithm Hash digest
SHA256 bee47bb4e4569a607ea62fefa8c312c0d1edd3adf28094d4440efdec5fbeebe9
MD5 a71e8977ca4408487f69f90ee019c21e
BLAKE2b-256 b44c3b4d08f7022d9e8eb8dbdaa360b17dcd035e23580d0f9be06ddea988cf5a

See more details on using hashes here.

File details

Details for the file audim-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: audim-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 42.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for audim-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 b90f24d3706ab6014b486c03e0f1b4aa78ec2cd8dc0f4e4c4acfbd856cdd190c
MD5 040a7e08f53c6e66ce834bc1e94ca537
BLAKE2b-256 bd377050fd0892f1b4349213bf21fd54515c70e8c741f759f6950222411a1237

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page