Skip to main content

JAX library for optimization and export of models for use with the UZU inference engine.

Project description

Mirai

Listen to our podcast View our deck Contact us Read docs License

lalamo

A set of tools for adapting Large Language Models to on-device inference using the uzu inference engine.

Quick Start

To get the list of supported models, run:

uv run lalamo list-models

To convert a model, run:

uv run lalamo convert MODEL_REPO --precision float16

After that, you can find the converted model in the models folder. For more options see uv run lalamo convert --help.

Model Support

To add support for a new model, write the corresponding ModelSpec, as shown in the example below:

ModelSpec(
    vendor="Google",
    family="Gemma-3",
    name="Gemma-3-1B-Instruct",
    size="1B",
    quantization=None,
    repo="google/gemma-3-1b-it",
    config_type=HFGemma3TextConfig,
    config_file_name="config.json",
    weights_file_names=huggingface_weight_files(1),
    weights_type=WeightsType.SAFETENSORS,
    tokenizer_files=HUGGINGFACE_TOKENIZER_FILES,
    use_cases=tuple(),
)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lalamo-0.2.2.tar.gz (43.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lalamo-0.2.2-py3-none-any.whl (63.1 kB view details)

Uploaded Python 3

File details

Details for the file lalamo-0.2.2.tar.gz.

File metadata

  • Download URL: lalamo-0.2.2.tar.gz
  • Upload date:
  • Size: 43.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.8.14

File hashes

Hashes for lalamo-0.2.2.tar.gz
Algorithm Hash digest
SHA256 af7c924aab5cacb109c062405bd341b2da5d9d1352f83e9eb04338e16d886433
MD5 dd5d67f8df02f41a2d417b288a7a6701
BLAKE2b-256 1edc17c281059670a5bb7714d9e680dd711e891975af0361967bc0390521732a

See more details on using hashes here.

File details

Details for the file lalamo-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: lalamo-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 63.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.8.14

File hashes

Hashes for lalamo-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a2e3c050c610b22a4c939b0c3fb15c999aada21a279a0e66e6d7efe84baed93d
MD5 bf84637bff66ea03d978f88ee35a9658
BLAKE2b-256 dff146b41e3e4d92827ab0edbd8d24fc4f422c24f296d5b7e29e9127d8fbbd41

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page