Portable C++ avatar runtime — Python bindings via pybind11. Powers the bitHuman Essence pipeline cross-platform.

These details have not been verified by PyPI

Project links

Project description

bithuman

This is the Python flavor of Layer 3: a platform-specific library for app developers. It wraps the Layer 1 libessence engine. For the CLI tool see docs/CLI.md.

┌─────────────────────────────────────────────────────────────┐
│ Layer 3: Platform-specific libraries (app developers)       │
│   - Python wheel       pip install bithuman    ◄──── you are here
│   - Swift package      SwiftPM Bithuman                     │
│   - Kotlin AAR         ai.bithuman:sdk                      │
│   - (future) Rust crate, JS/TS, Go, ...                     │
└─────────────────────────────────────────────────────────────┘
                          ▼ embeds
┌─────────────────────────────────────────────────────────────┐
│ Layer 2: bithuman CLI (end-user tool)                       │
│   - one cross-platform binary on macOS / Linux / Windows    │
│   - brew install bithuman · curl-pipe installer             │
└─────────────────────────────────────────────────────────────┘
                          ▼ links
┌─────────────────────────────────────────────────────────────┐
│ Layer 1: libessence engine (cross-platform C++ core)        │
│   - portable C ABI, same source on every target             │
│   - macOS · iOS · Android · Linux · Windows                 │
│   - never imported directly by app developers               │
└─────────────────────────────────────────────────────────────┘

Python bindings for the bitHuman SDK — the portable C++ avatar engine (libessence) that powers our cross-platform lipsync pipeline. The wheel ships a native pybind11 module that talks directly to libessence, so you get the same per-frame cost as our Swift and Kotlin clients with none of the GIL noise.

On an Apple M5 with 24 GB unified memory we measure ~640 FPS sustained compose (1.56 ms/frame mean, 2.03 ms p99) for a 1248×704 avatar, with ~206 MB peak RSS end-to-end. Cold load is ~14 ms for the fixture and ~400 ms for the first compose tick (lazy ONNX init).

This package is namespace-isolated from the v0 bithuman SDK; you can install both side-by-side.

Install

pip install bithuman

Current PyPI version: 0.1.1 (matches the libessence-v0.1.1 tag).

Compatibility

Platforms: macOS arm64, Linux x86_64, Linux arm64 — all ship as wheels. Windows is tracked for a follow-up.
Python: 3.10 – 3.13 (cp310, cp311, cp312, cp313). CPython only.
ABI: wraps libessence ABI v4 (auth + auto-fit canvas).
Auth: ships with live heartbeat against api.bithuman.ai baked into libessence. Avatar.load(api_secret=...) is the entry point; BITHUMAN_API_SECRET env var works too. Set BITHUMAN_UNMETERED=1 for dev / parity-test runs.

What you get

The package exposes three API tiers (all importable from bithuman):

Tier	Types	Use when…
Async	`AsyncAvatar`, `AudioChunk`, `VideoControl`, `VideoFrame`	Hosting a service / parity with legacy `AsyncBithuman`
Sync facade	`Avatar`, `ComposedFrame`, `EP`	Offline / batch / CLI rendering
Low-level	`Fixture`, `Runtime`, `EP_CPU`/`EP_AUTO`/`EP_COREML`/`EP_NNAPI`/`EP_QNN`	Direct C ABI access, custom audio pipeline

Error types: BithumanError (base), TokenError / TokenExpiredError / TokenValidationError / TokenRequestError / AccountStatusError (auth), ModelError / ModelNotFoundError / ModelLoadError / ModelSecurityError / ExpressionModelNotSupported (fixture), RuntimeNotReadyError.

Version info: bithuman.__version__ (Python package), bithuman.__core_version__ (linked libessence), bithuman.__abi_version__.

Quickstart

from bithuman import Avatar

with Avatar.load("model.imx") as avatar:
    for frame in avatar.compose("speech.wav"):
        # frame.bgr is a (H, W, 3) uint8 numpy array in BGR pixel order
        cv2.imshow("avatar", frame.bgr)
        cv2.waitKey(40)

Avatar.compose accepts a 16 kHz float32 mono numpy array OR a path to any WAV / MP3 / FLAC / OGG file (decoded and resampled via soundfile when needed).

CLI

A essence-render console script ships with the wheel:

pip install 'bithuman[cli]'

essence-render \
  --model ~/.cache/bithuman/models/sample-avatar.imx \
  --audio speech.wav \
  --output out.mp4

Pass --output - to stream raw BGR24 frames to stdout (handy for piping into a separate ffmpeg pipeline or a custom encoder). Other flags:

Flag	Default	Description
`--fps`	25	Output FPS for the MP4 container.
`--quality`	80	libx264 quality 1..100 (higher = better).
`--ep`	`cpu`	Execution provider hint (`cpu`/`auto`/`coreml`/…).
`--threads`	1	ORT intra-op thread count.
`--no-audio`	–	Skip audio muxing; produce a silent video.

Example end-to-end run (5 s sine sweep):

essence-render 0.1.0: model=sample-avatar.imx audio=sine_sweep_5s.wav ep=cpu threads=1
essence-render: loaded fixture in 14.9 ms — 1248x704 @ 25 fps, 183 clusters, 202 src frames
essence-render: composed 122 frames in 1.83s (14.96 ms/frame, 66.8 fps)
essence-render: wrote /tmp/sine_sweep_5s.mp4

(Throughput here is bounded by H.264 encode, not Essence inference. Use --output - if you want to measure raw compose speed.)

Low-level API

If you need finer control or want to swap in a custom audio pipeline, the C ABI is exposed directly:

import numpy as np
from bithuman import Fixture, Runtime, EP_CPU

fx = Fixture("model.imx", preferred_ep=EP_CPU, intra_op_threads=1)
rt = Runtime(fx)
pcm = np.fromfile("speech.f32", dtype=np.float32)  # 16 kHz mono float32
cluster_idx, bgr = rt.tick_compose(pcm, frame_idx_hint=-1)
# bgr.shape == (fx.frame_height, fx.frame_width, 3), dtype uint8

Pass the entire pcm buffer to each tick_compose call; the runtime maintains an internal cursor and advances one tick per call until the audio is exhausted.

Build from source

You need the prebuilt parent C++ archive at cpp/build/libessence.a (run the parent CMake build first), plus the runtime deps from Homebrew (onnxruntime, webp, ffmpeg, hdf5, jpeg-turbo).

cd cpp/bindings/python
uv pip install -e '.[cli,test]' --no-build-isolation

The CMake glue links the prebuilt static archive directly — it does NOT re-run the parent build, so iterate on bindings without paying the C++ rebuild cost.

Performance

Measured with tests/bench.py against the v1 compose path (audio → composited BGR frame) on Apple M5 24 GB:

Metric	Value
Fixture load	14 ms
First compose (lazy v1 init)	399 ms
Steady-state mean	1.56 ms / frame
p50	1.54 ms
p99	2.03 ms
Sustained throughput	641 FPS
Peak RSS (proc)	206 MB

Linux wheels

Pre-built manylinux_2_28 wheels ship for x86_64 + aarch64 across cp310 through cp313 — 8 wheels in total, all auditwheel-repaired with the full dep tree bundled (ORT, FFmpeg, HDF5, libjpeg-turbo, libwebp, libcurl, OpenSSL).

To rebuild them locally:

# One-time: build the dep-baked Docker images (~10 min each).
docker build --platform linux/amd64 -t libessence/manylinux-x86_64:0.1 \
    -f scripts/Dockerfile.manylinux-x86_64 scripts/
docker build --platform linux/arm64/v8 -t libessence/manylinux-aarch64:0.1 \
    -f scripts/Dockerfile.manylinux-aarch64 scripts/

# Per wheel build (~2 min):
docker run --rm --platform linux/amd64 -v "$REPO":/src \
    -e PYTAG=cp311 -e ARCH_INSIDE=x86_64 \
    libessence/manylinux-x86_64:0.1 \
    bash /src/cpp/bindings/python/scripts/build-wheel-in-container.sh

Limitations

Windows wheels not yet built — tracked for v0.2.
The CLI's output framerate is fixed at 25 fps to match the model's internal rate. Pass --output - and pipe to your own encoder if you need temporal resampling.
preferred_ep=COREML/NNAPI/QNN is accepted but currently no-ops to CPU in the v0.1 build.

License

Commercial. Contact hello@bithuman.ai.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.19.1

May 18, 2026

1.19.0

May 18, 2026

1.18.5

May 18, 2026

1.18.4

May 16, 2026

1.18.3 yanked

May 16, 2026

Reason this release was yanked:

bug identified

1.18.1 yanked

May 16, 2026

Reason this release was yanked:

bug identified

1.18.0 yanked

May 16, 2026

Reason this release was yanked:

the C++ runtime has a deeper incompatibility that prevents it from loading any production .imx file.

1.17.6 yanked

May 16, 2026

Reason this release was yanked:

the C++ runtime has a deeper incompatibility that prevents it from loading any production .imx file.

1.17.5 yanked

May 15, 2026

Reason this release was yanked:

bug identified

1.17.4 yanked

May 15, 2026

Reason this release was yanked:

bug identified

1.17.3 yanked

May 15, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.17.2 yanked

May 15, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.17.1 yanked

May 15, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.17.0 yanked

May 15, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.15.2 yanked

May 13, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.15.1 yanked

May 13, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.15.0 yanked

May 13, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.14.0 yanked

May 13, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.13.0 yanked

May 13, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.12.4 yanked

May 12, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.12.3 yanked

May 12, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.12.2 yanked

May 12, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.12.1 yanked

May 12, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

This version

1.12.0 yanked

May 12, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

1.11.3

May 12, 2026

1.11.2

May 7, 2026

1.11.1

May 7, 2026

1.11.0

May 6, 2026

1.10.9

May 5, 2026

1.10.8

Apr 30, 2026

1.10.7

Apr 29, 2026

1.10.6

Apr 21, 2026

1.10.5

Apr 21, 2026

1.10.4

Apr 21, 2026

1.10.3

Apr 21, 2026

1.10.0

Apr 21, 2026

1.9.0

Apr 20, 2026

1.8.5

Apr 20, 2026

1.8.4

Apr 18, 2026

1.8.3

Apr 17, 2026

1.7.9

Apr 2, 2026

1.7.8

Mar 27, 2026

1.7.7

Mar 23, 2026

1.7.6

Mar 8, 2026

1.7.5

Mar 5, 2026

1.7.4

Feb 27, 2026

1.7.3

Feb 19, 2026

1.7.2

Feb 19, 2026

1.7.0

Feb 16, 2026

1.6.7

Feb 10, 2026

1.6.6

Feb 10, 2026

1.5.0

Feb 9, 2026

0.8.1

Jan 27, 2026

0.8.0

Jan 27, 2026

0.1.3 yanked

May 12, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

0.1.2 yanked

May 12, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

0.1.1 yanked

May 12, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

0.1.0 yanked

May 12, 2026

Reason this release was yanked:

C++ pybind11 bindings with broken dependencies and incompatible API. Use bithuman==1.11.3 (Python SDK).

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bithuman-1.12.0-cp313-cp313-macosx_26_0_arm64.whl (183.7 kB view details)

Uploaded May 12, 2026 CPython 3.13macOS 26.0+ ARM64

bithuman-1.12.0-cp311-cp311-manylinux_2_28_x86_64.whl (14.5 MB view details)

Uploaded May 12, 2026 CPython 3.11manylinux: glibc 2.28+ x86-64

bithuman-1.12.0-cp311-cp311-manylinux_2_28_aarch64.whl (13.4 MB view details)

Uploaded May 12, 2026 CPython 3.11manylinux: glibc 2.28+ ARM64

File details

Details for the file bithuman-1.12.0-cp313-cp313-macosx_26_0_arm64.whl.

File metadata

Download URL: bithuman-1.12.0-cp313-cp313-macosx_26_0_arm64.whl
Upload date: May 12, 2026
Size: 183.7 kB
Tags: CPython 3.13, macOS 26.0+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for bithuman-1.12.0-cp313-cp313-macosx_26_0_arm64.whl
Algorithm	Hash digest
SHA256	`9a56639b32e9010d952e0237ef256d82d6759a30c52967d7494c0cac660cb482`
MD5	`cc23fa876a9f50fa287b280127058cfe`
BLAKE2b-256	`8a36b17be047aa1b818d9965f1ba27793544bfd83f2251a1e8332104dbf17f8c`

See more details on using hashes here.

File details

Details for the file bithuman-1.12.0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

Download URL: bithuman-1.12.0-cp311-cp311-manylinux_2_28_x86_64.whl
Upload date: May 12, 2026
Size: 14.5 MB
Tags: CPython 3.11, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for bithuman-1.12.0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`49bb78086f8837e809913ca5aed2e09c59d51e2fbbb3ed743a5cc58d5deef7e1`
MD5	`b45af153dd59c12c235e32ed277dad66`
BLAKE2b-256	`89239704ebf36e89626e96efd71fb46cfa2e3cd0355e1ab65e0a968676634407`

See more details on using hashes here.

File details

Details for the file bithuman-1.12.0-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

Download URL: bithuman-1.12.0-cp311-cp311-manylinux_2_28_aarch64.whl
Upload date: May 12, 2026
Size: 13.4 MB
Tags: CPython 3.11, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for bithuman-1.12.0-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`c71dee28409dfb688bdbcbb6e18e516aab78a80ed2e9dae32bf1dc839d3456f3`
MD5	`9757d9be777e8d54e0d2c7d4d6fe6428`
BLAKE2b-256	`07855c3fb6e91a89f25a6e7336da1abab262ab3be5c8e2fe5f1d66656dbb270e`

See more details on using hashes here.

bithuman 1.12.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

bithuman

Install

Compatibility

What you get

Quickstart

CLI

Low-level API

Build from source

Performance

Linux wheels

Limitations

License

See also

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes