LunaVox tooling CLI for model setup, conversion, quantization, and build workflows.

Project description

🌌 LunaVox: High-Performance C++ Inference Engine for Qwen3-TTS

Version Platform CoreML C++

LunaVox is a high-performance C++ inference engine built specifically for Qwen3-TTS. A streamlined architecture and deep hardware optimization deliver stable, low-latency TTS for embedded devices, desktop apps, and servers alike.

🚀 Key Features

Lightweight runtime — ONNX Runtime + a custom llama.cpp wrapper, no heavy Python required at inference time.
Native multi-language — automatic language detection across Chinese, English, Japanese, Korean, Russian, German, French, Italian, Spanish, Portuguese.
Unified Voice API — one engine.synthesize(text, voice, params) covers Base, Voice Cloning, Custom Voice, and Voice Design.
HTTP + WebSocket serving (lunavox serve): FastAPI app with POST /v1/synth and streaming WS /v1/stream — see serve guide.
Desktop GUI (lunavox gui): customtkinter app (Synthesize / Library / Settings) driving the same in-process engine.
Profile-driven CLI — layered ~/.lunavox/config.toml / env / flag precedence so lunavox --profile quality synth … is a one-liner.
Cross-platform hardware acceleration — CUDA (NVIDIA), CoreML/Metal (Apple), DML (DirectX 12), and Vulkan.

🛠️ Requirements

Windows 10/11 (VS 2022/2025), Linux Ubuntu 22.04+ (GCC ≥ 9.0), or macOS 12+ on Apple Silicon
CMake 3.16+ (Ninja recommended) and a compatible C++17 compiler
Python 3.10+ for the CLI and conversion toolchain

📊 Performance

Configuration	TTFB (ms)	RTF	Peak RAM	VRAM	Speedup
Official PyTorch Baseline (CPU)	—	5.066	5.06 GB	—	1.00×
Official PyTorch Baseline (GPU)	—	3.788	1.59 GB	2.29 GB	1.34×
LunaVox (Full CPU)	1248	0.858	1.19 GB	—	5.90×
LunaVox (CUDA 13)	175	0.213	1.41 GB	1.33 GB	23.78×
LunaVox (Vulkan + DML)	194	0.152	0.97 GB	1.00 GB	33.33×

Model Qwen3-TTS-12Hz-0.6B-Base with ref/ref_0.6B.json cloning on Intel i9-12900K + RTX 3090 / Windows 11 — 5 warmup + 100 measurement runs on a fixed 25-word English sentence. Full per-run distribution in benchmark/report.md; detailed analysis in Windows performance report.

📦 Install & quick start

pip install lunavox               # core CLI + GUI + HTTP/WebSocket server (default)
pip install "lunavox[convert]"    # + source → GGUF conversion toolchain (heavy, optional)

lunavox bootstrap                 # one-key: pull model + libs + build + smoke test

Prefer step-by-step? Run lunavox model pull, lunavox build libs, then lunavox build --clean. For CUDA see CUDA on Windows. Full command reference: CLI manual.

🎙️ Synthesis

lunavox synth "Hello from LunaVox." -o out.wav                       # base voice
lunavox synth "…" --voice clone  --ref ref/ref_0.6B.json -o out.wav  # voice cloning
lunavox synth "…" --voice custom --speaker Vivian --instruct "…"     # catalog speaker
lunavox synth "…" --voice design --instruct "A warm, calm narrator." # text-to-voice design
lunavox gui                                                          # desktop GUI

The standalone ./build/lunavox-cli works the same way in Python-free environments. Full mode documentation: usage tutorial.

Embedded Python

from lunavox.runtime import Engine, SynthesisParams, Voice

with Engine("models/base_small") as engine:
    result = engine.synthesize(
        "Hello from LunaVox.",
        voice=Voice.clone_file("ref/ref_0.6B.json"),
        params=SynthesisParams(temperature=0.7),
    )
    print(f"RTF {result.stats.rtf:.3f}")  # result.audio is a float32 [-1, 1] mono array

📈 Observability

--stats-json report.json — RTF + memory breakdown per synthesis
logs/latest.log — build and runtime output
-j N — CPU thread count (default 4)

📖 Documentation

Full bilingual docs — guide, CLI reference, technical details, benchmarks, Python API — are published at https://lux-luna.github.io/LunaVox/. Local preview:

pip install -e ".[dev]"
mkdocs serve

Release history: CHANGELOG.md.

🙏 Acknowledgements

Qwen3-TTS — base weights and architecture
onnxruntime — audio decoding backend
llama.cpp — LLM sequence prediction core

Project details

Release history Release notifications | RSS feed

This version

2.2.2

Apr 17, 2026

2.2.1

Apr 17, 2026

2.2.0

Apr 15, 2026

2.1.6

Mar 27, 2026

2.1.5

Mar 27, 2026

2.1.4

Mar 27, 2026

2.1.3

Mar 27, 2026

2.1.2

Mar 27, 2026

2.1.1

Mar 27, 2026

2.1.0

Mar 27, 2026

2.0.0

Mar 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lunavox-2.2.2.tar.gz (325.7 kB view details)

Uploaded Apr 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lunavox-2.2.2-py3-none-any.whl (344.3 kB view details)

Uploaded Apr 17, 2026 Python 3

File details

Details for the file lunavox-2.2.2.tar.gz.

File metadata

Download URL: lunavox-2.2.2.tar.gz
Upload date: Apr 17, 2026
Size: 325.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for lunavox-2.2.2.tar.gz
Algorithm	Hash digest
SHA256	`4c693c4743745a44778189d7a9833542c0b341bff5a12ebbe38ce560dd241069`
MD5	`e50f7ae0a36d9ecc642ec141fd4eec8b`
BLAKE2b-256	`f61985f8676f90050ae561787c6e4945413efec5ebcd7f57f6aef5fa38fd4fb0`

See more details on using hashes here.

File details

Details for the file lunavox-2.2.2-py3-none-any.whl.

File metadata

Download URL: lunavox-2.2.2-py3-none-any.whl
Upload date: Apr 17, 2026
Size: 344.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for lunavox-2.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`da3bc4f16766c0a7b3c34bb3b2db93ea6410a34def555f048686cffeb7740b84`
MD5	`7d9642672f6554843cc0003186579b7e`
BLAKE2b-256	`dcfeced912387b53f4a252a5e7142c816fbce9f3dd354d23926510390399ec92`

See more details on using hashes here.

lunavox 2.2.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

🌌 LunaVox: High-Performance C++ Inference Engine for Qwen3-TTS

🚀 Key Features

🛠️ Requirements

📊 Performance

📦 Install & quick start

🎙️ Synthesis

Embedded Python

📈 Observability

📖 Documentation

🙏 Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes