Skip to main content

A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible. Simple Constant value Shrink for ONNX.

Project description

scs4onnx

A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible. Simple Constant value Shrink for ONNX.

Downloads GitHub PyPI

Key concept

  1. If the same constant tensor is found by scanning the entire graph for Constant values, it is aggregated into a single constant tensor.
  2. Ignore scalar values.
  3. Ignore variables.

1. Setup

### option
$ echo export PATH="~/.local/bin:$PATH" >> ~/.bashrc \
&& source ~/.bashrc

### run
$ pip install -U onnx \
&& python3 -m pip install -U onnx_graphsurgeon --index-url https://pypi.ngc.nvidia.com \
&& pip install -U scs4onnx

2. Usage

$ scs4onnx -h

usage: scs4onnx [-h] [--mode {shrink,npy}] [--non_verbose] input_onnx_file_path output_onnx_file_path

positional arguments:
  input_onnx_file_path
                        Input onnx file path.
  output_onnx_file_path
                        Output onnx file path.

optional arguments:
  -h, --help            show this help message and exit
  --mode {shrink,npy}   Constant Value Compression Mode.
                        shrink: Share constant values inside the model as much as possible.
                                The model size is slightly larger because
                                some shared constant values remain inside the model,
                                but performance is maximized.
                        npy:    Outputs constant values used repeatedly in the model to an
                                external file .npy. Instead of the smallest model body size,
                                the file loading overhead is greater.
                        Default: shrink
  --non_verbose         Do not show all information logs. Only error logs are displayed.

3. CLI Execution

$ scs4onnx input.onnx output.onnx --mode shrink

image

4. In-script Execution

from scs4onnx import shrinking

shrunk_graph, npy_file_paths = shrinking('input.onnx', 'output.onnx', mode='npy')

image

5. Sample

5-1. shrink mode sample

  • 297.8MB -> 67.4MB

    image image

5-2. npy mode sample

  • 297.8MB -> 21.3MB

    image image

5. Reference

  1. https://docs.nvidia.com/deeplearning/tensorrt/onnx-graphsurgeon/docs/index.html
  2. https://github.com/NVIDIA/TensorRT/tree/main/tools/onnx-graphsurgeon

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scs4onnx-1.0.4.tar.gz (5.8 kB view details)

Uploaded Source

Built Distribution

scs4onnx-1.0.4-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file scs4onnx-1.0.4.tar.gz.

File metadata

  • Download URL: scs4onnx-1.0.4.tar.gz
  • Upload date:
  • Size: 5.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.10.4

File hashes

Hashes for scs4onnx-1.0.4.tar.gz
Algorithm Hash digest
SHA256 8f9f9d95e415346ccb02c82c4443719f457f15eb11fad16a6e53f7cdec2df80b
MD5 77a0120ff6a10e2244a17059992d8586
BLAKE2b-256 d43fc66e4ffdd41ffb1b1f00c3677fb480c3cc531dec8e811e5b6c274b223d00

See more details on using hashes here.

File details

Details for the file scs4onnx-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: scs4onnx-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 6.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.10.4

File hashes

Hashes for scs4onnx-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 178e9250e107c276d169f2f5b3bf5b600fd13ab414efe1498556d95e1e9bc738
MD5 11d766b759960c828b88b16b3d9d8333
BLAKE2b-256 356019fcbfda743083057063039eebd52b5a677854d93820e8629a9ecfdc51f7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page