OnnxSlim: A Toolkit to Help Optimize Onnx Model
Project description
OnnxSlim
OnnxSlim can help you slim your onnx model, with less operators, but same accuracy, better inference speed.
- 🚀 OnnxSlim is merged to mnn-llm, performance increased by 5%
- 🚀 Rank 1st in the AICAS 2024 LLM inference optimization challenge held by Arm and T-head
- 🚀 OnnxSlim is merged into ultralytics ❤️❤️❤️
- 🚀 OnnxSlim is merged into transformers.js 🤗🤗🤗
Installation
Using Prebuilt
pip install onnxslim
Install From Source
pip install git+https://github.com/inisis/OnnxSlim@main
Install From Local
git clone https://github.com/inisis/OnnxSlim && cd OnnxSlim/
pip install .
How to use
onnxslim your_onnx_model slimmed_onnx_model
For more usage, see onnxslim -h or refer to our examples
References
Contact
Discord: https://discord.gg/nRw2Fd3VUS QQ Group: 873569894
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
onnxslim-0.1.38.tar.gz
(119.7 kB
view details)
Built Distribution
onnxslim-0.1.38-py3-none-any.whl
(141.7 kB
view details)
File details
Details for the file onnxslim-0.1.38.tar.gz
.
File metadata
- Download URL: onnxslim-0.1.38.tar.gz
- Upload date:
- Size: 119.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a632a8dbe0dd5c71bfaf5e565cbb7459f1d8a83921689dd09e642c3ce6acf5fc |
|
MD5 | 8c8582a7f40beda438f67dbff7881b63 |
|
BLAKE2b-256 | 6f57d705a087816a60af00d87e53851821bd8ae572fb1695d3577bb79e8a7ada |
File details
Details for the file onnxslim-0.1.38-py3-none-any.whl
.
File metadata
- Download URL: onnxslim-0.1.38-py3-none-any.whl
- Upload date:
- Size: 141.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 51f589db63c8ea7232d219b440b65a7ed5ef61d56be1f504f069bd6a2ee72d4e |
|
MD5 | 19eb45bf58384628356299135f37afcc |
|
BLAKE2b-256 | a479ca30e66bf56f917139a19c1b73b8272b80554d936c97cce2003680a8e11c |