Skip to main content

ISIRO Runtime - BF16 LLM inference efficiency layer. https://isiro.ai

Project description

isiro-runtime

The ISIRO Runtime executes .TIC model artifacts with ~30% BF16 memory traffic reduction. Bit-exact. No quantization.

Full release coming soon at isiro.ai.

Install

pip install isiro-runtime

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

isiro_runtime-0.0.1.tar.gz (774 Bytes view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

isiro_runtime-0.0.1-py2.py3-none-any.whl (1.3 kB view details)

Uploaded Python 2Python 3

File details

Details for the file isiro_runtime-0.0.1.tar.gz.

File metadata

  • Download URL: isiro_runtime-0.0.1.tar.gz
  • Upload date:
  • Size: 774 Bytes
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for isiro_runtime-0.0.1.tar.gz
Algorithm Hash digest
SHA256 a0a8ec76ba8f0fc2fc38a323556aa1e9b28c90e106934ee32c735fa833d4f9c9
MD5 1f04782836a260eb9e0985cfd313c0ed
BLAKE2b-256 c5cc9fcb2ef1eb3634f4485a196d9fb9e7fe32bd17e9f2dfb85fb5b6d1fea241

See more details on using hashes here.

File details

Details for the file isiro_runtime-0.0.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for isiro_runtime-0.0.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 96f1755ffdc2f9ac14ad61cecf0556beeb8ef99b243c2b07f4183981a6138fcc
MD5 1f546f426c64e8e373eb4ab51e0418cc
BLAKE2b-256 3a258be4ac08da4f0d1cfe5b6f1ab59b333c70565a482d34e2551eccf118f298

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page