OnnxSlim: A Toolkit to Help Optimize Onnx Model
Project description
OnnxSlim
OnnxSlim can help you slim your onnx model, with less operators, but same accuracy, better inference speed.
- 🚀 OnnxSlim is merged to mnn-llm, performance increased by 5%
- 🚀 Rank 1st in the AICAS 2024 LLM inference optimization challenge held by Arm and T-head
- 🚀 OnnxSlim is merged into ultralytics ❤️❤️❤️
- 🚀 OnnxSlim is merged into transformers.js 🤗🤗🤗
Installation
Using Prebuilt
pip install onnxslim
Install From Source
pip install git+https://github.com/inisis/OnnxSlim@main
Install From Local
git clone https://github.com/inisis/OnnxSlim && cd OnnxSlim/
pip install .
How to use
onnxslim your_onnx_model slimmed_onnx_model
For more usage, see onnxslim -h or refer to our examples
References
Contact
Discord: https://discord.gg/nRw2Fd3VUS QQ Group: 873569894
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
onnxslim-0.1.36.tar.gz
(119.2 kB
view details)
Built Distribution
onnxslim-0.1.36-py3-none-any.whl
(140.6 kB
view details)
File details
Details for the file onnxslim-0.1.36.tar.gz
.
File metadata
- Download URL: onnxslim-0.1.36.tar.gz
- Upload date:
- Size: 119.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c1203696ab9f88f9020c4ee3e749efb7b2edd30ebdb4d99769443a56a4489458 |
|
MD5 | b302998e9c663881f0cf472601b01ce0 |
|
BLAKE2b-256 | a69eb01acbf0a2149b53eb030f00d3cc397ab8fd4c2ced01552a0d7d55c1337e |
File details
Details for the file onnxslim-0.1.36-py3-none-any.whl
.
File metadata
- Download URL: onnxslim-0.1.36-py3-none-any.whl
- Upload date:
- Size: 140.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2f4775c1d42ffb72baa816282caed40690e1c458fd644759b41bd71ffc217b8c |
|
MD5 | b1d3c94788894ca27e44ffcea79c46a2 |
|
BLAKE2b-256 | 486f94482970e96df5189d9c947a8cb56da9f8763af36f7bc281ee3026f46734 |