lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based on provided audio, supports CPU/CUDA, and uses caching for faster processing.

These details have not been verified by PyPI

Project links

Project description

lipsync

lipsync is a Python library that moves lips in a video (or image) to match a given audio file. It is based on Wav2Lip, but many unneeded files and libraries have been removed, and the code has been updated to work with the latest versions of Python.

Features

Video lip synchronization
Synchronize lips in an existing video to match a new audio file.
Image lip animation
Provide a single image and an audio file to generate a talking video.
Runs on CPU and CUDA
You can choose whether to run on your CPU or a CUDA-enabled GPU for faster processing.
Caching
If you use the same video multiple times with different audio files, lipsync can cache frames and reuse them. This makes future runs much faster.

Pre-Trained Weights

lipsync works with two different pre-trained models:

Wav2Lip (Download wav2lip.pth)
- More accurate lip synchronization
- Lips in the result may appear somewhat blurred
Wav2Lip + GAN (Download wav2lip_gan.pth)
- Lips in the result are clearer
- Synchronization may be slightly less accurate

Installation

pip install lipsync

Usage Example

Below is a simple example in Python. This assumes you have the model weights (either wav2lip.pth or wav2lip_gan.pth) in a weights/ folder.

from lipsync import LipSync

lip = LipSync(
    model='wav2lip',
    checkpoint_path='weights/wav2lip.pth',
    nosmooth=True,
    device='cuda',
    cache_dir='cache',
    img_size=96,
    save_cache=True,
)

lip.sync(
    'source/person.mp4',
    'source/audio.wav',
    'result.mp4',
)

Important Parameters

model: 'wav2lip' or 'wav2lip_gan'
checkpoint_path: Path to the model weights (e.g., wav2lip.pth, wav2lip_gan.pth)
nosmooth: Set True to disable smoothing
device: 'cpu' or 'cuda'
cache_dir: Directory for saving frames
save_cache: Set True to save frames to cache_dir for faster re-runs

Ethical Use

Please be mindful when using lipsync. This library can generate videos that look convincing, so it could be used to spread disinformation or harm someone’s reputation. We encourage using it only for entertainment or scientific purposes, and always with respect and consent from any people involved.

License and Citation

The software can only be used for personal/research/non-commercial purposes. Please cite the following paper if you have use this code:

@inproceedings{10.1145/3394171.3413532,
    author = {Prajwal, K R and Mukhopadhyay, Rudrabha and Namboodiri, Vinay P. and Jawahar, C.V.},
    title = {A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild},
    year = {2020},
    isbn = {9781450379885},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    url = {https://doi.org/10.1145/3394171.3413532},
    doi = {10.1145/3394171.3413532},
    booktitle = {Proceedings of the 28th ACM International Conference on Multimedia},
    pages = {484–492},
    numpages = {9},
    keywords = {lip sync, talking face generation, video generation},
    location = {Seattle, WA, USA},
    series = {MM '20}
}

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Dec 22, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lipsync-0.1.0.tar.gz (13.4 kB view details)

Uploaded Dec 22, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lipsync-0.1.0-py3-none-any.whl (14.0 kB view details)

Uploaded Dec 22, 2024 Python 3

File details

Details for the file lipsync-0.1.0.tar.gz.

File metadata

Download URL: lipsync-0.1.0.tar.gz
Upload date: Dec 22, 2024
Size: 13.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.5.1 CPython/3.7.17 Linux/6.8.0-49-generic

File hashes

Hashes for lipsync-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`97f7ae48e81bcc42337bf0f2c2895b52a690462556c84f0994fddbb34604c8f6`
MD5	`f946c44410b90303845d00748c2054ec`
BLAKE2b-256	`d24bc82bcba2398bcbc5dd2b7ed368a9d929e5bc486c8f7d8c82e491d8718315`

See more details on using hashes here.

File details

Details for the file lipsync-0.1.0-py3-none-any.whl.

File metadata

Download URL: lipsync-0.1.0-py3-none-any.whl
Upload date: Dec 22, 2024
Size: 14.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.5.1 CPython/3.7.17 Linux/6.8.0-49-generic

File hashes

Hashes for lipsync-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d3a7fc3f945cd0ffc0a9119fa54129062231716f5eb49eb9ed1ad632d2f40a45`
MD5	`a3a75f502213aba3b0f173e06f68c92e`
BLAKE2b-256	`7b1e0ac4a5b3d40f1dbf0bf94c1d9e7d6b50e7ba631882dc6a9e6b8cbf54283d`

See more details on using hashes here.

lipsync 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

lipsync

Features

Pre-Trained Weights

Installation

Usage Example

Important Parameters

Ethical Use

License and Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes