Skip to main content

The fastest inference framework to run AI on CPUs

Project description

Trillim

Trillim is a local AI stack for CPUs. It gives you a CLI, a Python SDK, and a FastAPI server for running Trillim-formatted LLM bundles, plus optional speech-to-text and text-to-speech support.

Trillim supports both BitNet-style ternary bundles and Bonsai 1-bit (binary) bundles through the same managed model store and runtime surfaces.

DarkNet and the quantization tooling bundled with the package do the heavy inference work. The Python package is the orchestration layer around those binaries.

Install

  • Python 3.12 or newer is required.
  • Linux wheels target glibc >= 2.27.
  • uv is the recommended installer.
  • Voice features require the optional voice extra.

Platform guides:

If you install Trillim with uv, prefix CLI commands with uv run.

Quick Start

Install the package:

uv add trillim

Pull a model and chat with it:

uv run trillim pull Trillim/BitNet-TRNQ
uv run trillim chat Trillim/BitNet-TRNQ

Start the local API server:

uv run trillim serve Trillim/BitNet-TRNQ

Use the Python SDK synchronously through Runtime:

from trillim import LLM, Runtime

with Runtime(LLM("Trillim/BitNet-TRNQ")) as runtime:
    reply = runtime.llm.chat(
        [{"role": "user", "content": "Give me one sentence about local CPU inference."}]
    )
    print(reply)

Common Workflows

Pull and Inspect Bundles

trillim models lists bundles published by the Trillim Hugging Face organization. trillim list lists what you already have locally.

uv run trillim models
uv run trillim list

Quantize a Local Model or Adapter

trillim quantize takes raw local filesystem paths and publishes the output under ~/.trillim/models/Local/.

# Quantize a model bundle
uv run trillim quantize /path/to/model

# Quantize a LoRA adapter against its base model
uv run trillim quantize /path/to/base-model /path/to/adapter

Qwen3-based Bonsai checkpoints quantize into binary bundles, but Trillim still manages them under the same Local/...-TRNQ store naming and load flow.

Use an Adapter

chat accepts an optional second positional argument for the adapter store ID:

uv run trillim chat Trillim/BitNet-TRNQ Trillim/BitNet-GenZ-LoRA-TRNQ

Enable Voice Support

Install the extra first:

uv add "trillim[voice]"

Then start the voice-enabled server:

uv run trillim serve Trillim/BitNet-TRNQ --voice

Documentation

Learn

Extend and Serve

Advanced

License

For the short license summary, see What Is Trillim?. Full terms are in LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

trillim-0.8.1-py3-none-win_arm64.whl (1.5 MB view details)

Uploaded Python 3Windows ARM64

trillim-0.8.1-py3-none-win_amd64.whl (1.6 MB view details)

Uploaded Python 3Windows x86-64

trillim-0.8.1-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl (5.0 MB view details)

Uploaded Python 3manylinux: glibc 2.27+ x86-64

trillim-0.8.1-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl (5.0 MB view details)

Uploaded Python 3manylinux: glibc 2.27+ ARM64

trillim-0.8.1-py3-none-macosx_11_0_x86_64.whl (1.3 MB view details)

Uploaded Python 3macOS 11.0+ x86-64

trillim-0.8.1-py3-none-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Python 3macOS 11.0+ ARM64

File details

Details for the file trillim-0.8.1-py3-none-win_arm64.whl.

File metadata

  • Download URL: trillim-0.8.1-py3-none-win_arm64.whl
  • Upload date:
  • Size: 1.5 MB
  • Tags: Python 3, Windows ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.1-py3-none-win_arm64.whl
Algorithm Hash digest
SHA256 10f09112af5361eaa4b6ce2847c0b0743b750728525627fb112251401db10b4d
MD5 ae4038cedba021311812aee9036b8c18
BLAKE2b-256 f0920ce76dd22e03f5c4b9a2de994fa6557861355cfa6d75b00281de822cf3fa

See more details on using hashes here.

File details

Details for the file trillim-0.8.1-py3-none-win_amd64.whl.

File metadata

  • Download URL: trillim-0.8.1-py3-none-win_amd64.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.1-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 4632df5aefe626ec76b1776cc42fd9ce6587faf601cc4e9114db5d76c41388ef
MD5 abcb6b6d9e98a5296a1dbb0c15a020bd
BLAKE2b-256 8d25f0dd039cb943a47513eaaa89666f93b68b339b01a35b36fd95585a4df1b8

See more details on using hashes here.

File details

Details for the file trillim-0.8.1-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl.

File metadata

  • Download URL: trillim-0.8.1-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl
  • Upload date:
  • Size: 5.0 MB
  • Tags: Python 3, manylinux: glibc 2.27+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.1-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 2facb038426235cf94d004bfa47dbe4e01554ab2692f7dc729a050ec2e7e032e
MD5 a28d1fb1007a6581cfae7874e5a98b74
BLAKE2b-256 8af7062913e83a485cc3310fc0902062ebde47fec54153d8331459229507e6f1

See more details on using hashes here.

File details

Details for the file trillim-0.8.1-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl.

File metadata

  • Download URL: trillim-0.8.1-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl
  • Upload date:
  • Size: 5.0 MB
  • Tags: Python 3, manylinux: glibc 2.27+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.1-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 39746a0fedaea760fc3dd38c5efd85346e5c19f4738c8bd1711bc1217c155b9e
MD5 1981e44c653ed9dc09c424d10e1e9acd
BLAKE2b-256 e14a00c3097bc1b7028a0776fe7121bc59d36bcfccb72b935e345644e4b021e0

See more details on using hashes here.

File details

Details for the file trillim-0.8.1-py3-none-macosx_11_0_x86_64.whl.

File metadata

  • Download URL: trillim-0.8.1-py3-none-macosx_11_0_x86_64.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 3, macOS 11.0+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.1-py3-none-macosx_11_0_x86_64.whl
Algorithm Hash digest
SHA256 4cf248015bc7e031488057e88d21bfb30ac8705c46c9a0330cfcb49fb7bcdab0
MD5 8ed4fbfe065d7b51415c30dccbcff903
BLAKE2b-256 025967c1831e076ac42367269621d45a5d3e3b68452190169672309c6a4f7a2b

See more details on using hashes here.

File details

Details for the file trillim-0.8.1-py3-none-macosx_11_0_arm64.whl.

File metadata

  • Download URL: trillim-0.8.1-py3-none-macosx_11_0_arm64.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 3, macOS 11.0+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.8.1-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 70840c35cd1286f3bd79e3dbec8de9f51622ad41fb874c130e32ec6978e0040b
MD5 f4f3da99ae983fcb6fd01d38a6fdb1b7
BLAKE2b-256 ffc93dd1ba782402ffb55a23c62202f1c8483c0a42d46428d1ff53cdf830ecb8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page