JAX library for optimization and export of models for use with the UZU inference engine.
Project description
lalamo
A set of tools for adapting Large Language Models to on-device inference using the uzu inference engine.
Quick Start
To get the list of supported models, run:
uv run lalamo list-models
To convert a model, run:
uv run lalamo convert MODEL_REPO --precision float16
After that, you can find the converted model in the models folder. For more options see uv run lalamo convert --help.
Model Support
To add support for a new model, write the corresponding ModelSpec, as shown in the example below:
ModelSpec(
vendor="Google",
family="Gemma-3",
name="Gemma-3-1B-Instruct",
size="1B",
quantization=None,
repo="google/gemma-3-1b-it",
config_type=HFGemma3TextConfig,
config_file_name="config.json",
weights_file_names=huggingface_weight_files(1),
weights_type=WeightsType.SAFETENSORS,
tokenizer_files=HUGGINGFACE_TOKENIZER_FILES,
use_cases=tuple(),
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file lalamo-0.2.7.tar.gz.
File metadata
- Download URL: lalamo-0.2.7.tar.gz
- Upload date:
- Size: 43.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.8.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4c8eb0eb950dcbfe0710f4124a236a119829655adb3a1eff05fc46b994d2466b
|
|
| MD5 |
9bcc1f3259fe2327b0b50868556a61f4
|
|
| BLAKE2b-256 |
2843bc157e2808069fe0c2e15fee14163327b70b1a347f21d49d1fcc8eda558e
|
File details
Details for the file lalamo-0.2.7-py3-none-any.whl.
File metadata
- Download URL: lalamo-0.2.7-py3-none-any.whl
- Upload date:
- Size: 63.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.8.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
78b17243ac7a033861d0c9226089724fceadca4498d2b80df12e5c3a5c3d66c1
|
|
| MD5 |
0a8f03d09e8de9d0593a5b3c527a4c6e
|
|
| BLAKE2b-256 |
40927d8ff0825d5523d9ddc12cf8ad103a7a234e10e3fbbf1a41c279dfecadb5
|