OnnxSlim: A Toolkit to Help Optimize Onnx Model
Project description
OnnxSlim
OnnxSlim can help you slim your onnx model, with less operators, but same accuracy, better inference speed.
- 🚀 OnnxSlim is merged to mnn-llm, performance increased by 5%
- 🚀 Rank 1st in the AICAS 2024 LLM inference optimization challenge held by Arm and T-head
- 🚀 OnnxSlim is merged into ultralytics ❤️❤️❤️
- 🚀 OnnxSlim is merged into transformers.js 🤗🤗🤗
Installation
Using Prebuilt
pip install onnxslim
Install From Source
pip install git+https://github.com/inisis/OnnxSlim@main
Install From Local
git clone https://github.com/inisis/OnnxSlim && cd OnnxSlim/
pip install .
How to use
onnxslim your_onnx_model slimmed_onnx_model
For more usage, see onnxslim -h or refer to our examples
References
Contact
Discord: https://discord.gg/nRw2Fd3VUS QQ Group: 873569894
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
onnxslim-0.1.37.tar.gz
(119.7 kB
view details)
Built Distribution
onnxslim-0.1.37-py3-none-any.whl
(141.7 kB
view details)
File details
Details for the file onnxslim-0.1.37.tar.gz
.
File metadata
- Download URL: onnxslim-0.1.37.tar.gz
- Upload date:
- Size: 119.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 96ff49d554edda22db0c8de3a7cd83fad4db1be8a86e9f1848e517a147bb832c |
|
MD5 | 6abc7fe7a530ad6dbd5d0673103fc1de |
|
BLAKE2b-256 | 88c3c2eaf8522d4d594900057093280a032be38b9f8e6d7950bef2d02ef01b3e |
File details
Details for the file onnxslim-0.1.37-py3-none-any.whl
.
File metadata
- Download URL: onnxslim-0.1.37-py3-none-any.whl
- Upload date:
- Size: 141.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7fa53d5f4ac984b58948502197fc910e110ea21d9981986f2a18c991ce82e11b |
|
MD5 | 488b48760dcc2890861ee887b4729690 |
|
BLAKE2b-256 | ddf36e8fc1739fc93f6f0c5a161e554c2dc98f612a36436cd756005f6fd16bea |