Skip to main content

The fastest inference framework to run AI on CPUs

Project description

Trillim

Trillim is a local AI stack for CPUs. It gives you a CLI, a Python SDK, and a FastAPI server for running Trillim-formatted LLM bundles, plus optional speech-to-text and text-to-speech support.

Trillim supports both BitNet-style ternary bundles and PrismML's Bonsai (1-bit and ternary) bundles through the same managed model store and runtime surfaces.

DarkNet and the quantization tooling bundled with the package do the heavy inference work. The Python package is the orchestration layer around those binaries.

Install

  • Python 3.12 or newer is required.
  • Linux wheels target glibc >= 2.27.
  • uv is the recommended installer.
  • Voice features require the optional voice extra.

Platform guides:

If you install Trillim with uv, prefix CLI commands with uv run.

Quick Start

Install the package:

uv add trillim

Pull a model and chat with it:

uv run trillim pull Trillim/BitNet-TRNQ
uv run trillim chat Trillim/BitNet-TRNQ

Start the local API server:

uv run trillim serve Trillim/BitNet-TRNQ

Use the Python SDK synchronously through Runtime:

from trillim import LLM, Runtime

with Runtime(LLM("Trillim/BitNet-TRNQ")) as runtime:
    reply = runtime.llm.chat(
        [{"role": "user", "content": "Give me one sentence about local CPU inference."}]
    )
    print(reply)

Common Workflows

Pull and Inspect Bundles

trillim models lists bundles published by the Trillim Hugging Face organization. trillim list lists what you already have locally.

uv run trillim models
uv run trillim list

Quantize a Local Model or Adapter

trillim quantize takes raw local filesystem paths and publishes the output under ~/.trillim/models/Local/.

# Quantize a model bundle
uv run trillim quantize /path/to/model

# Quantize a LoRA adapter against its base model
uv run trillim quantize /path/to/base-model /path/to/adapter

Qwen3-based Bonsai checkpoints quantize into binary or grouped-ternary bundles, but Trillim still manages them under the same Local/...-TRNQ store naming and load flow.

Use an Adapter

chat accepts an optional second positional argument for the adapter store ID:

uv run trillim chat Trillim/BitNet-TRNQ Trillim/BitNet-GenZ-LoRA-TRNQ

Enable Voice Support

Install the extra first:

uv add "trillim[voice]"

Then start the voice-enabled server:

uv run trillim serve Trillim/BitNet-TRNQ --voice

Documentation

Learn

Extend and Serve

Advanced

License

For the short license summary, see What Is Trillim?. Full terms are in LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

trillim-0.9.0-py3-none-win_arm64.whl (1.5 MB view details)

Uploaded Python 3Windows ARM64

trillim-0.9.0-py3-none-win_amd64.whl (1.6 MB view details)

Uploaded Python 3Windows x86-64

trillim-0.9.0-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl (5.0 MB view details)

Uploaded Python 3manylinux: glibc 2.27+ x86-64

trillim-0.9.0-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl (5.1 MB view details)

Uploaded Python 3manylinux: glibc 2.27+ ARM64

trillim-0.9.0-py3-none-macosx_11_0_x86_64.whl (1.3 MB view details)

Uploaded Python 3macOS 11.0+ x86-64

trillim-0.9.0-py3-none-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Python 3macOS 11.0+ ARM64

File details

Details for the file trillim-0.9.0-py3-none-win_arm64.whl.

File metadata

  • Download URL: trillim-0.9.0-py3-none-win_arm64.whl
  • Upload date:
  • Size: 1.5 MB
  • Tags: Python 3, Windows ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.9.0-py3-none-win_arm64.whl
Algorithm Hash digest
SHA256 974b740d5950567836afdd38d8f25891967b7a306ead054d600aa776048f10c0
MD5 89c83279998e53cdefebfe10977c570e
BLAKE2b-256 0ccfa501b6e47e47f0bbe242d460a30f09a2ad4c4b1e1e8d819ff49272d587d9

See more details on using hashes here.

File details

Details for the file trillim-0.9.0-py3-none-win_amd64.whl.

File metadata

  • Download URL: trillim-0.9.0-py3-none-win_amd64.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.9.0-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 07310224dfce056ede14abd5f9edee5a9fc20126df242e6c8723aff2abe047d7
MD5 dfe62f6e2b42f0a98cc181e8ba38a737
BLAKE2b-256 09cdef47b23e904c121b249576e427c7cca1f11598da17862081ba4f681f9a15

See more details on using hashes here.

File details

Details for the file trillim-0.9.0-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl.

File metadata

  • Download URL: trillim-0.9.0-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl
  • Upload date:
  • Size: 5.0 MB
  • Tags: Python 3, manylinux: glibc 2.27+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.9.0-py3-none-manylinux_2_27_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 83ad03692fda180b340eefe18d01cf9cc3cde9bb1b4112494986fa3b6f9691ac
MD5 57daa7093bf5c4af0188ddf39a9fd3bc
BLAKE2b-256 2027002d40868ddb9449f79124abbcad16b061c5bc7418113f9452076d14a7e1

See more details on using hashes here.

File details

Details for the file trillim-0.9.0-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl.

File metadata

  • Download URL: trillim-0.9.0-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl
  • Upload date:
  • Size: 5.1 MB
  • Tags: Python 3, manylinux: glibc 2.27+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.9.0-py3-none-manylinux_2_27_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 91022b32b6ec7fe353a9fbcc8af478e99ae4110c2120978f0ebd15dc8057047f
MD5 1a0c0de71724426797016399a09e234a
BLAKE2b-256 3a192fc1ead182d1b39ac8da0b0327dd734ec0c528127f72c25d578c38fe9678

See more details on using hashes here.

File details

Details for the file trillim-0.9.0-py3-none-macosx_11_0_x86_64.whl.

File metadata

  • Download URL: trillim-0.9.0-py3-none-macosx_11_0_x86_64.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 3, macOS 11.0+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.9.0-py3-none-macosx_11_0_x86_64.whl
Algorithm Hash digest
SHA256 e7078c9fe2a1024f5fd05a60aa9d6fc8e0b1da6c0acec617f91f6119ed5e478b
MD5 4300dfdefdd4556d61d8584bb3b8a048
BLAKE2b-256 a8ede14524707bc91c6e19c469dd1d43883a081cac6ec83258a8e30230a69234

See more details on using hashes here.

File details

Details for the file trillim-0.9.0-py3-none-macosx_11_0_arm64.whl.

File metadata

  • Download URL: trillim-0.9.0-py3-none-macosx_11_0_arm64.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 3, macOS 11.0+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.6 {"installer":{"name":"uv","version":"0.11.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for trillim-0.9.0-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 556f3db5a1e008368363bada6f65673ff5657e7518da160c9ecd75f822a9f6f7
MD5 d2884723602338633b72c5c1b0a71716
BLAKE2b-256 6b85b9d3070cadb00c2f4abccd0109d74c4e97ba00c3da7a3277261d9b3b53ac

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page