Skip to main content

The fastest inference framework to run AI on CPUs

Project description

Trillim

Trillim is a local AI stack for CPUs. It gives you a CLI, a Python SDK, and a FastAPI server for running Trillim-formatted LLM bundles, plus optional speech-to-text and text-to-speech support.

Trillim supports both BitNet-style ternary bundles and Bonsai 1-bit (binary) bundles through the same managed model store and runtime surfaces.

DarkNet and the quantization tooling bundled with the package do the heavy inference work. The Python package is the orchestration layer around those binaries.

Install

  • Python 3.12 or newer is required.
  • Linux wheels target glibc >= 2.27.
  • uv is the recommended installer.
  • Voice features require the optional voice extra.

Platform guides:

If you install Trillim with uv, prefix CLI commands with uv run.

Quick Start

Install the package:

uv add trillim

Pull a model and chat with it:

uv run trillim pull Trillim/BitNet-TRNQ
uv run trillim chat Trillim/BitNet-TRNQ

Start the local API server:

uv run trillim serve Trillim/BitNet-TRNQ

Use the Python SDK synchronously through Runtime:

from trillim import LLM, Runtime

with Runtime(LLM("Trillim/BitNet-TRNQ")) as runtime:
    reply = runtime.llm.chat(
        [{"role": "user", "content": "Give me one sentence about local CPU inference."}]
    )
    print(reply)

Common Workflows

Pull and Inspect Bundles

trillim models lists bundles published by the Trillim Hugging Face organization. trillim list lists what you already have locally.

uv run trillim models
uv run trillim list

Quantize a Local Model or Adapter

trillim quantize takes raw local filesystem paths and publishes the output under ~/.trillim/models/Local/.

# Quantize a model bundle
uv run trillim quantize /path/to/model

# Quantize a LoRA adapter against its base model
uv run trillim quantize /path/to/base-model /path/to/adapter

Qwen3-based Bonsai checkpoints quantize into binary bundles, but Trillim still manages them under the same Local/...-TRNQ store naming and load flow.

Use an Adapter

chat accepts an optional second positional argument for the adapter store ID:

uv run trillim chat Trillim/BitNet-TRNQ Trillim/BitNet-GenZ-LoRA-TRNQ

Enable Voice Support

Install the extra first:

uv add "trillim[voice]"

Then start the voice-enabled server:

uv run trillim serve Trillim/BitNet-TRNQ --voice

Documentation

Learn

Extend and Serve

Advanced

License

For the short license summary, see What Is Trillim?. Full terms are in LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

trillim-0.8.0-py3-none-win_arm64.whl (1.5 MB view details)

Uploaded Python 3Windows ARM64

trillim-0.8.0-py3-none-win_amd64.whl (1.6 MB view details)

Uploaded Python 3Windows x86-64

trillim-0.8.0-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl (5.0 MB view details)

Uploaded Python 3manylinux: glibc 2.27+ x86-64

trillim-0.8.0-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl (5.0 MB view details)

Uploaded Python 3manylinux: glibc 2.27+ ARM64

trillim-0.8.0-py3-none-macosx_11_0_x86_64.whl (1.3 MB view details)

Uploaded Python 3macOS 11.0+ x86-64

trillim-0.8.0-py3-none-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Python 3macOS 11.0+ ARM64

File details

Details for the file trillim-0.8.0-py3-none-win_arm64.whl.

File metadata

  • Download URL: trillim-0.8.0-py3-none-win_arm64.whl
  • Upload date:
  • Size: 1.5 MB
  • Tags: Python 3, Windows ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.0-py3-none-win_arm64.whl
Algorithm Hash digest
SHA256 7dd6ef767b88959c1ab48604bb3d6fcaf793ea6f357c69ddb3f5ba2d4d15b70c
MD5 d7c89c8e064bccd5366a22d925e12d4a
BLAKE2b-256 ac8fb9dadd79bc2be90cee536d1650d43087b9eda311c90c18873202690663c3

See more details on using hashes here.

File details

Details for the file trillim-0.8.0-py3-none-win_amd64.whl.

File metadata

  • Download URL: trillim-0.8.0-py3-none-win_amd64.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 8b5b6b3ec25bf7f1ca6cefa3f1dd6beeb3ebfc36f8e61e3db8e95bbe4f88adda
MD5 b5f4e1f316d80c2e9dfc1e5bbe16e28d
BLAKE2b-256 20cb439e1a4fd53fcfa8f35629fff6ec3535dc054708592e3567d8f5dd2e5fd0

See more details on using hashes here.

File details

Details for the file trillim-0.8.0-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl.

File metadata

  • Download URL: trillim-0.8.0-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl
  • Upload date:
  • Size: 5.0 MB
  • Tags: Python 3, manylinux: glibc 2.27+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.0-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 c6852bf0657037994b2d206d309a30922e2333c5978e175b3f78b04a8090ebf5
MD5 30d247b9d16c4f45228d8c00362f687a
BLAKE2b-256 2b39a30418dd9def1e0921a377c6306dfd076a67e25a09b155ba18b79d7806bb

See more details on using hashes here.

File details

Details for the file trillim-0.8.0-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl.

File metadata

  • Download URL: trillim-0.8.0-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl
  • Upload date:
  • Size: 5.0 MB
  • Tags: Python 3, manylinux: glibc 2.27+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.0-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 05ecb442a794cd98856f04b5e6c5f2ede609c1e8dc037435a362dd2b837c1378
MD5 67de28f7c78de16831ade753edbb76e9
BLAKE2b-256 c30ea6c4bbbddf5746252d31d36171ba65de408f4ef081b9658bda99b90cc09f

See more details on using hashes here.

File details

Details for the file trillim-0.8.0-py3-none-macosx_11_0_x86_64.whl.

File metadata

  • Download URL: trillim-0.8.0-py3-none-macosx_11_0_x86_64.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 3, macOS 11.0+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.0-py3-none-macosx_11_0_x86_64.whl
Algorithm Hash digest
SHA256 da2dd0bc79988e492b31b4abfbd22fa8b1eaaefb56769254f3f9e6eb3b2fc403
MD5 f43b076fce5f4af174dda7e8b4cf0be5
BLAKE2b-256 b5eaf33875a553e36b41919ea0352a07af7a49438127225a72022331a816f44c

See more details on using hashes here.

File details

Details for the file trillim-0.8.0-py3-none-macosx_11_0_arm64.whl.

File metadata

  • Download URL: trillim-0.8.0-py3-none-macosx_11_0_arm64.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 3, macOS 11.0+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.0-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 b76385f8d903d6cf80b85bb7ec930b34869c62d3ce853a10aff760b3d37a15da
MD5 cc1b4dfed4cc914b71659cc64f943731
BLAKE2b-256 a1e36ff2e93802849209fe792419397d5b0d94f045061d5fb3fafac5088c6d00

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page