A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible. Simple Constant value Shrink for ONNX.
Project description
scs4onnx
A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible. Simple Constant value Shrink for ONNX.
Key concept
- If the same constant tensor is found by scanning the entire graph for Constant values, it is aggregated into a single constant tensor.
- Ignore scalar values.
- Ignore variables.
1. Setup
### option
$ echo export PATH="~/.local/bin:$PATH" >> ~/.bashrc \
&& source ~/.bashrc
### run
$ pip install -U onnx \
&& python3 -m pip install -U onnx_graphsurgeon --index-url https://pypi.ngc.nvidia.com \
&& pip install -U scs4onnx
2. Usage
$ scs4onnx -h
usage: scs4onnx [-h] [--mode {shrink,npy}] input_onnx_file_path output_onnx_file_path
positional arguments:
input_onnx_file_path
Input onnx file path.
output_onnx_file_path
Output onnx file path.
optional arguments:
-h, --help show this help message and exit
--mode {shrink,npy} Constant Value Compression Mode.
shrink: Share constant values inside the model as much as possible.
The model size is slightly larger because
some shared constant values remain inside the model,
but performance is maximized.
npy: Outputs constant values used repeatedly in the model to an
external file .npy. Instead of the smallest model body size,
the file loading overhead is greater.
Default: shrink
3. CLI Execution
$ scs4onnx input.onnx output.onnx --mode=shrink
4. In-script Execution
from scs4onnx import shrinking
shrunk_graph, npy_file_paths = shrinking('input.onnx', 'output.onnx', mode='npy')
5. Sample
5-1. shrink
mode sample
-
297.8MB -> 67.4MB
5-2. npy
mode sample
-
297.8MB -> 21.3MB
5. Reference
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
scs4onnx-1.0.2.tar.gz
(5.6 kB
view details)
Built Distribution
File details
Details for the file scs4onnx-1.0.2.tar.gz
.
File metadata
- Download URL: scs4onnx-1.0.2.tar.gz
- Upload date:
- Size: 5.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.10.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 701d0c8dc42138f2e8ddf6a8f0326f2f4f7913de21f24a554041900c3b290cb8 |
|
MD5 | a4324a1428834e35334ca4ac3ec82e9c |
|
BLAKE2b-256 | d3e6bf80c6bbb0f94224d8666bd5c39246b5536c371cb4654d1db7fab18ea11f |
File details
Details for the file scs4onnx-1.0.2-py3-none-any.whl
.
File metadata
- Download URL: scs4onnx-1.0.2-py3-none-any.whl
- Upload date:
- Size: 6.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.10.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 37dc0ff9016df24cf7d66c4621736fecdd675c89a4806771a2d89b15d4c601a6 |
|
MD5 | 52c3cb2d9c15ba5fcf477611e0df4cdf |
|
BLAKE2b-256 | fddb9ae8f132bd4738a0b348bd046800301897c7a4ccce14ab372fa5a044f43f |