ISIRO Runtime - BF16 LLM inference efficiency layer. https://isiro.ai
Project description
isiro-runtime
The ISIRO Runtime executes .TIC model artifacts with ~30% BF16 memory
traffic reduction. Bit-exact. No quantization.
Full release coming soon at isiro.ai.
Install
pip install isiro-runtime
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
isiro_runtime-0.0.1.tar.gz
(774 Bytes
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file isiro_runtime-0.0.1.tar.gz.
File metadata
- Download URL: isiro_runtime-0.0.1.tar.gz
- Upload date:
- Size: 774 Bytes
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a0a8ec76ba8f0fc2fc38a323556aa1e9b28c90e106934ee32c735fa833d4f9c9
|
|
| MD5 |
1f04782836a260eb9e0985cfd313c0ed
|
|
| BLAKE2b-256 |
c5cc9fcb2ef1eb3634f4485a196d9fb9e7fe32bd17e9f2dfb85fb5b6d1fea241
|
File details
Details for the file isiro_runtime-0.0.1-py2.py3-none-any.whl.
File metadata
- Download URL: isiro_runtime-0.0.1-py2.py3-none-any.whl
- Upload date:
- Size: 1.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
96f1755ffdc2f9ac14ad61cecf0556beeb8ef99b243c2b07f4183981a6138fcc
|
|
| MD5 |
1f546f426c64e8e373eb4ab51e0418cc
|
|
| BLAKE2b-256 |
3a258be4ac08da4f0d1cfe5b6f1ab59b333c70565a482d34e2551eccf118f298
|