emx-onnx-cgen

emmtrix ONNX-to-C Code Generator

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Project description

emmtrix ONNX-to-C Code Generator (emx-onnx-cgen) compiles ONNX models to portable, deterministic C code for deeply embedded systems. The generated code is designed to run without dynamic memory allocation, operating-system services, or external runtimes, making it suitable for safety-critical and resource-constrained targets.

Key characteristics:

No dynamic memory allocation (malloc, free, heap usage)
Static, compile-time known memory layout for parameters, activations, and temporaries
Deterministic control flow (explicit loops, no hidden dispatch or callbacks)
No OS dependencies, using only standard C headers (for example, stdint.h and stddef.h)
Single-threaded execution model
Bitwise-stable code generation for reproducible builds
Readable, auditable C code suitable for certification and code reviews
Generated C output format spec: docs/output-format.md
Designed for bare-metal and RTOS-based systems

For PyTorch models, see the related project emx-pytorch-cgen.

Goals

Correctness-first compilation with outputs comparable to ONNX Runtime.
Deterministic and reproducible C code generation.
Clean, pass-based compiler architecture (import → normalize → optimize → lower → emit).
Minimal C runtime with explicit, predictable data movement.

Non-goals

Aggressive performance optimizations in generated C.
Implicit runtime dependencies or dynamic loading.
Training/backpropagation support.

Features

CLI for ONNX-to-C compilation and verification.
Deterministic codegen with explicit tensor shapes and loop nests.
Minimal C runtime templates in src/emx_onnx_cgen/templates/.
ONNX Runtime comparison for end-to-end validation.
Official ONNX operator coverage tracking.
Support for a wide range of ONNX operators (see SUPPORT_OPS.md).
Supported data types:
- bfloat16, float16, float, double
- int8, uint8, int16, uint16, int32, uint32, int64, uint64
- bool
- string (fixed-size '\0'-terminated C strings; see docs/output-format.md)
- sequence(<tensor type>) (fixed-capacity tensor sequences with presence/length metadata; see docs/output-format.md)
- optional(<tensor type>) (optional tensors represented via an extra _Bool <name>_present flag; see docs/output-format.md)
- Not supported: float8*, float4e2m1, int2/int4, uint2/uint4, complex64/complex128, and ONNX map/sparse_tensor/opaque value types.
Optional support for dynamic dimensions using C99 variable-length arrays (VLAs), when the target compiler supports them.

Usage Scenarios

1. Fully Embedded, Standalone C Firmware

The generated C code can be embedded directly into a bare-metal C firmware or application where all model weights and parameters are compiled into the C source.

Typical characteristics:

No file system or OS required.
All weights stored as static const arrays in flash/ROM.
Deterministic memory usage with no runtime allocation.
Suitable for:
- Microcontrollers
- Safety-critical firmware
- Systems with strict certification requirements

This scenario is enabled via --large-weight-threshold 0, forcing all weights to be embedded directly into the generated C code.

2. Embedded or Host C/C++ Application with External Weights

The generated C code can be embedded into C or C++ applications where large model weights are stored externally and loaded from a binary file at runtime.

Typical characteristics:

Code and control logic compiled into the application.
Large constant tensors packed into a separate .bin file.
Explicit, generated loader functions handle weight initialization.
Suitable for:
- Embedded Linux or RTOS systems
- Applications with limited flash but available external storage
- Larger models where code size must be minimized

This scenario is enabled automatically once the cumulative weight size exceeds --large-weight-threshold (default: 102400 bytes).

3. Target-Optimized Code Generation via emmtrix Source-to-Source Tooling

In both of the above scenarios, the generated C code can serve as input to emmtrix source-to-source compilation and optimization tools, enabling target-specific optimizations while preserving functional correctness.

Examples of applied transformations include:

Kernel fusion and loop restructuring
Memory layout optimization and buffer reuse
Reduction of internal temporary memory
Utilization of SIMD / vector instruction sets
Offloading of large weights to external memory
Dynamic loading of weights or activations via DMA

This workflow allows a clear separation between:

Correctness-first, deterministic ONNX lowering, and
Target-specific performance and memory optimization,

while keeping the generated C code readable, auditable, and traceable.

The generated C code is intentionally structured to make such transformations explicit and analyzable, rather than relying on opaque backend-specific code generation.

Installation

Install the package directly from PyPI (recommended):

pip install emx-onnx-cgen

Minimum Python version: 3.11.

Local release builds (matching GitHub release workflows)

To reproduce the release artifacts locally with the same settings as the GitHub workflows:

Linux (uses the same Docker image as .github/workflows/linux-release.yml):
```
./tools/local_release_linux.sh
```
Windows (matches the install/build steps from .github/workflows/windows-release.yml):
```
powershell -ExecutionPolicy Bypass -File .\tools\local_release_windows.ps1
```
If the exact pinned onnxruntime version from requirements-ci.txt is not available on your package index, the script falls back to a compatible version (onnxruntime>=<pinned>,<2). To enforce strict pin-only behavior, use:
```
powershell -ExecutionPolicy Bypass -File .\tools\local_release_windows.ps1 -DisableOnnxruntimeFallback
```

Both scripts produce the same archive names used by the release workflows:

emx-onnx-cgen-linux-amd64.tar.gz
emx-onnx-cgen-windows-amd64.zip

Required at runtime (both compile and verify):

onnx
numpy
jinja2

Optional for verification and tests:

onnxruntime
A C compiler (cc, gcc, clang or via --cc)
pytest for ONNX backend compliance smoke tests

Quickstart

Compile an ONNX model into a C source file:

emx-onnx-cgen compile path/to/model.onnx build/model.c

Verify an ONNX model end-to-end against ONNX Runtime (default):

emx-onnx-cgen verify path/to/model.onnx

If a model cannot be code-generated without representative concrete input shapes, pass them explicitly to both commands:

emx-onnx-cgen compile path/to/model.onnx build/model.c \
  --shape-inference-shapes "x=1x3x224x224;size=[1,3,224,224]"

emx-onnx-cgen verify path/to/model.onnx \
  --shape-inference-shapes "x=1x3x224x224;size=[1,3,224,224]"

--test-data-dir is verification input/output data only. It does not change the generated C code.

Use emx-onnx-cgen as an importable ONNX backend:

import onnx
from onnx.backend import prepare

import emx_onnx_cgen.onnx_backend as emx_backend

model = onnx.load("path/to/model.onnx")
rep = prepare(model, backend=emx_backend)
outputs = rep.run(inputs)

The backend module is emx_onnx_cgen.onnx_backend. It compiles the ONNX model to C on demand, builds a temporary executable, and runs that executable through the standard ONNX backend interface.

You can also call it directly without onnx.backend.prepare:

import onnx

from emx_onnx_cgen.onnx_backend import run_model

model = onnx.load("path/to/model.onnx")
outputs = run_model(model, inputs)

CLI Reference

emx-onnx-cgen provides two subcommands: compile and verify.

Common options

These options are accepted by both compile and verify:

--model-base-dir: Base directory for resolving the model path (and related paths).
--color: Colorize CLI output (auto, always, never; default: auto).
--large-weight-threshold: Store weights in a binary file once the cumulative byte size exceeds this threshold (default: 102400; set to 0 to disable).
--large-temp-threshold: Mark local arrays larger than this threshold as static (default: 1024). This applies to generated model temporaries and to generated testbench input/output buffers.
--fp32-accumulation-strategy: Accumulation strategy for float32 inputs (simple uses float32, fp64 uses double; default: fp64).
--fp16-accumulation-strategy: Accumulation strategy for float16 inputs (simple uses float16, fp32 uses float; default: fp32).
--replicate-ort-bugs: Compatibility switch for verification/debugging. Enables emulation of known behavior differences of the ONNX Runtime version pinned in requirements-ci.txt.
--shape-inference-shapes: Explicit shape/value specs used only to concretize dynamic shapes during code generation. Use name=2x3x4 for ordinary tensors, name=[1,3,224,224] for shape-carrying 1D tensors, and name=seq[4]:8x16 for sequences. This option is explicit and behaves the same for compile and verify.

`compile`

emx-onnx-cgen compile <model.onnx> <output.c> [options]

Options:

--model-name: Override the generated model name (default: output file stem).
--emit-testbench: Emit a JSON-producing main() testbench for validation.
--testbench-file: Emit the testbench into a separate C file at the given path (implies --emit-testbench). If not set, the testbench is embedded in the main output C file (legacy behavior).
--emit-data-file: Emit constant data arrays into a companion _data C file.
--no-restrict-arrays: Disable restrict qualifiers on generated array parameters.

`verify`

emx-onnx-cgen verify <model.onnx> [options]

Options:

--cc: Explicit C compiler command for building the testbench binary.
--sanitize: Enable sanitizer instrumentation when compiling the verification binary (-fsanitize=address,undefined). If EMX_ENABLE_SANITIZE is set, it overrides this flag.
--per-node-accuracy: Also compare intermediate tensor outputs and print max error per node.
--max-ulp: Maximum allowed ULP distance for floating outputs (default: 100).
--atol-eps: Absolute tolerance as a multiple of machine epsilon for floating outputs (default: 1.0).
--runtime: Runtime backend for verification (onnxruntime or onnx-reference, default: onnxruntime).
--replicate-ort-bugs: Verification-only compatibility mode to reproduce known behavior differences of the ONNX Runtime version pinned in requirements-ci.txt.
--temp-dir-root: Root directory in which to create a temporary verification directory (default: system temp dir).
--temp-dir: Exact directory to use for temporary verification files (default: create a temporary directory).
--keep-temp-dir: Keep the temporary verification directory instead of deleting it.

How verification works:

Compile with a testbench: the compiler is invoked with --emit-testbench, generating a C program that runs the model and prints inputs/outputs as JSON.
Build and execute: the testbench is compiled with the selected C compiler (--cc, CC, or a detected cc/gcc/clang) and executed in a temporary directory.
Run runtime backend: the JSON inputs from the testbench are fed to the selected runtime (onnxruntime or onnx-reference) using the same model. The compiler no longer ships a Python runtime evaluator.
Compare outputs: floating outputs are compared by maximum ULP distance. Floating-point verification first ignores very small differences up to --atol-eps × machine epsilon of the evaluated floating-point type, treating such values as equal. For values with a larger absolute difference, the ULP distance is computed, and the maximum ULP distance is reported; non-floating outputs must match exactly. Missing outputs or mismatches are treated as failures.
ORT unsupported models: when using onnxruntime, if ORT reports NOT_IMPLEMENTED, verification is skipped with a warning (exit code 0).

Official ONNX test coverage

ONNX_SUPPORT.md: overview of ONNX models and their current verification status.
ONNX_ERRORS.md: summary of the most common verification outcomes and failure reasons.
SUPPORT_OPS.md: list of ONNX operators and whether they are currently supported.

Related Projects

emx-pytorch-cgen
A PyTorch-to-C compiler following the same design principles as emx-onnx-cgen, but operating directly on PyTorch models instead of ONNX graphs.
https://github.com/emmtrix/emx-pytorch-cgen
onnx2c
An ONNX-to-C code generator with a different design focus and code generation approach.
https://github.com/kraiskil/onnx2c

Maintained by

This project is maintained by emmtrix.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

emx-wolf timo.stripf

Release history Release notifications | RSS feed

1.3.1

Apr 2, 2026

1.3.0

Apr 2, 2026

1.2.2

Mar 21, 2026

1.2.1

Mar 20, 2026

1.2.0

Mar 18, 2026

1.1.1

Mar 13, 2026

This version

1.1.0

Mar 12, 2026

1.0.0

Mar 10, 2026

0.5.0

Feb 18, 2026

0.4.5

Feb 3, 2026

0.4.4

Feb 3, 2026

0.4.3

Feb 3, 2026

0.4.3.dev0 pre-release

Feb 3, 2026

0.4.2.dev0 pre-release

Feb 3, 2026

0.4.1.dev0 pre-release

Feb 3, 2026

0.3.8

Jan 23, 2026

0.3.7

Jan 23, 2026

0.3.6

Jan 23, 2026

0.3.5

Jan 23, 2026

0.3.4

Jan 23, 2026

0.3.3

Jan 23, 2026

0.3.2

Jan 23, 2026

0.3.1

Jan 23, 2026

0.3.0

Jan 20, 2026

0.2.0

Jan 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

emx_onnx_cgen-1.1.0.tar.gz (1.2 MB view details)

Uploaded Mar 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

emx_onnx_cgen-1.1.0-py3-none-any.whl (455.9 kB view details)

Uploaded Mar 12, 2026 Python 3

File details

Details for the file emx_onnx_cgen-1.1.0.tar.gz.

File metadata

Download URL: emx_onnx_cgen-1.1.0.tar.gz
Upload date: Mar 12, 2026
Size: 1.2 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for emx_onnx_cgen-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`91e7826046367dc2c4f6b2d5700252ec60a1ddd1bfb78dfcc405f84581485fe9`
MD5	`3138be142bb30e630261f757c4db8a8f`
BLAKE2b-256	`4d4f127c0429f3b21e85b7efca730072740d2534661a63849aacfae130c5f639`

See more details on using hashes here.

Provenance

The following attestation bundles were made for emx_onnx_cgen-1.1.0.tar.gz:

Publisher: release.yml on emmtrix/emx-onnx-cgen

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: emx_onnx_cgen-1.1.0.tar.gz
- Subject digest: 91e7826046367dc2c4f6b2d5700252ec60a1ddd1bfb78dfcc405f84581485fe9
- Sigstore transparency entry: 1092014291
- Sigstore integration time: Mar 12, 2026
Source repository:
- Permalink: emmtrix/emx-onnx-cgen@c91af3f49868099e36b74b70ae440db208d01ba9
- Branch / Tag: refs/tags/v1.1.0
- Owner: https://github.com/emmtrix
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@c91af3f49868099e36b74b70ae440db208d01ba9
- Trigger Event: release

File details

Details for the file emx_onnx_cgen-1.1.0-py3-none-any.whl.

File metadata

Download URL: emx_onnx_cgen-1.1.0-py3-none-any.whl
Upload date: Mar 12, 2026
Size: 455.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for emx_onnx_cgen-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1d56ddb92c1fa48a1628a8bcb5776453ee13f43dfd6256dab75552a721044b6f`
MD5	`35acc602c2ce93123ebeb411c194e016`
BLAKE2b-256	`74a6978befa3bf169c273b38263afcbb30c89e8bc72bc11c77a362782b01d2eb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for emx_onnx_cgen-1.1.0-py3-none-any.whl:

Publisher: release.yml on emmtrix/emx-onnx-cgen

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: emx_onnx_cgen-1.1.0-py3-none-any.whl
- Subject digest: 1d56ddb92c1fa48a1628a8bcb5776453ee13f43dfd6256dab75552a721044b6f
- Sigstore transparency entry: 1092014298
- Sigstore integration time: Mar 12, 2026
Source repository:
- Permalink: emmtrix/emx-onnx-cgen@c91af3f49868099e36b74b70ae440db208d01ba9
- Branch / Tag: refs/tags/v1.1.0
- Owner: https://github.com/emmtrix
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@c91af3f49868099e36b74b70ae440db208d01ba9
- Trigger Event: release

emx-onnx-cgen 1.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Project description

Goals

Non-goals

Features

Usage Scenarios

1. Fully Embedded, Standalone C Firmware

2. Embedded or Host C/C++ Application with External Weights

3. Target-Optimized Code Generation via emmtrix Source-to-Source Tooling

Installation

Local release builds (matching GitHub release workflows)

Quickstart

CLI Reference

Common options

compile

verify

Official ONNX test coverage

Related Projects

Maintained by

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`compile`

`verify`