AI model that understands text and humanoids.
Project description
OmniHuman
AI model that understands text and humanoids.
Installation
pip install omnihuman
or install editable from source
git clone https://github.com/mdsrqbl/omnihuman.git
cd omnihuman
pip install -e .
Usage
import omnihuman
import PIL.Image
text = "Raise both hands and clap overhead."
frames = omnihuman.read_frames("path/to/image.jpg") # (n_frames, channels, height, width)
# model = omnihuman.OmniHuman()
# frames = model.video_generation(text, frames)
PIL.Image.fromarray(frames[-1].permute(1,2,0).numpy()).show()
Full documentation is available at omnihuman.readTheDocs.io.
Citation & License
@misc{mdsr2024omnihuman,
author = {Mudassar Iqbal},
title = {OmniHuman: AI model that understands text and humanoids.},
year = {2024},
publisher = {GitHub},
howpublished = {\url{https://github.com/mdsrqbl/omnihuman}}
}
This project is licensed under Apache License 2.0 - see the LICENSE file for details.
You are permitted to use the library & models, create modified versions, or incorporate pieces of the code into your own work. Your product or research, whether commercial or non-commercial, must provide appropriate credit to the original author(s) by citing this repository & research papers.
Stay tuned for research papers!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
omnihuman-0.1.1.tar.gz
(7.4 kB
view hashes)
Built Distribution
Close
Hashes for omnihuman-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ba20b29792ab3c3ba3432c52f7972997292d019f34e70e1cc39a7e22e90f29e2 |
|
MD5 | 656550bbe86af93b9f58812d4537a265 |
|
BLAKE2b-256 | 8aa091fd3ee94aa47b4c7152039cc60f3ba387e636a0f07765eb093f872c6658 |