Skip to main content

Simultaneous Speech-to-Speech Translation Without Aligned Data.

Project description

Hibiki-Zero

Hibiki-Zero is a real-time and multilingual speech translation model. It translates from French, Spanish, Portuguese and German to English: accurately, with low latency, high audio quality, and voice transfer.

https://github.com/user-attachments/assets/d533ec45-8d5e-4e41-886a-0b2d198be6f3

🤗 Hugging Face Model Card | ⚙️ Tech report | 📄 Paper | 🎧 More samples

Requirements

Hibiki-Zero is a 3B-parameter model and requires an NVIDIA GPU to run: 8 GB VRAM should work, 12 GB is safe.

Run the server

Hibiki-Zero comes with a server you can run to interact with Hibiki in real time. To run it, just use:

uvx -p 3.13 hibiki-zero serve [--gradio-tunnel]

Then go to the URL displayed to try out Hibiki-Zero. The --gradio-tunnel flag will forward the server to a public URL that you can access from anywhere.

If you don't have uv, you must first install hibiki-zero with pip install hibiki-zero and then run the server with hibiki-zero serve [--gradio-tunnel].

Run inference

If you'd like to run Hibiki-Zero on existing audio files, run:

uvx -p 3.13 hibiki-zero generate [--file /path/to/my/audio.wav --file /path/to/another/audio.mp3]

Batch inference is supported, meaning you can run the model on multiple audio files at the same time.

Local development

We recomment using uv, run anything with uv run in this repository. For example

uv run some_file.py
or 
uv run hibiki-zero serve

if you use pip, use pip install -e . before executing python commands.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hibiki_zero-0.0.4.tar.gz (7.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hibiki_zero-0.0.4-py3-none-any.whl (3.6 MB view details)

Uploaded Python 3

File details

Details for the file hibiki_zero-0.0.4.tar.gz.

File metadata

  • Download URL: hibiki_zero-0.0.4.tar.gz
  • Upload date:
  • Size: 7.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for hibiki_zero-0.0.4.tar.gz
Algorithm Hash digest
SHA256 4749b84079e1e4aad98c87bbf90d550cf22b186effdc578fbae3a89e93894df4
MD5 725fc1159500bee46c69f2405366360d
BLAKE2b-256 a575f593c72623ff1cb0227398d443b637038f62ee908319ef84c8a9affc4bff

See more details on using hashes here.

Provenance

The following attestation bundles were made for hibiki_zero-0.0.4.tar.gz:

Publisher: publish-package.yml on kyutai-labs/hibiki-zero

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file hibiki_zero-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: hibiki_zero-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 3.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for hibiki_zero-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 91787bc1ca3a01f84f5e92fa861bd7cdd3c4b19d1941741d7a7b9e6a0becf65c
MD5 89503ba2da2b0988d27fd46ff5f32aa6
BLAKE2b-256 e1338d8e32671b85efb58ed579818839825ee3a2575c0e9616688fea55908005

See more details on using hashes here.

Provenance

The following attestation bundles were made for hibiki_zero-0.0.4-py3-none-any.whl:

Publisher: publish-package.yml on kyutai-labs/hibiki-zero

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page