JIT unit-stripping decorator: write Pint-annotated code, run at float speed

Project description

unit-jit

We love explicit tracking of physical units in code, but do not want to pay the runtime overhead in hot loops. unit-jit solves this with a single decorator: write your functions against Pint as usual, and let unit-jit strip the unit machinery at JIT compile time so every call runs on plain floats.

from pint import Quantity, UnitRegistry
from unit_jit import unit_jit

ureg = UnitRegistry()

@unit_jit
def velocity(d: Quantity, t: Quantity) -> Quantity:
    return d / t

velocity(10 * ureg.m, 2 * ureg.s)   # first call: unit inference + fast
velocity(10 * ureg.m, 2 * ureg.s)   # fast (pure float internally)
velocity(10 * ureg.cm, 2 * ureg.s)  # fast and fine: same dimension, different unit
velocity(10 * ureg.m, 2 * ureg.m)   # TypeError: wrong dimension for arg 1

On the first call, unit-jit abstract-interprets the function body with the input units, checks dimensional correctness across all branches, infers return units, and caches a CST-rewritten version that operates on raw floats. All subsequent calls convert arguments to SI floats at the boundary, run the rewritten pure-float version, and wrap the result back into a Quantity with the cached units.

Benchmark

Both functions below are identical in structure. simulate_pint runs with full Pint overhead on every call; simulate_fast runs as plain floats on every call after the first.

import time

import numpy as np
from pint import Quantity, UnitRegistry
from unit_jit import unit_jit

ureg = UnitRegistry()


@unit_jit
def simulate_fast(t: Quantity) -> Quantity:
    mrna  = 10.0 * ureg.nmol / ureg.L        # 10 nM initial concentration
    dt    =  1.0 * ureg.s                    # 1 s timestep
    delta = np.log(2) / (5.0 * ureg.min)     # half-life 5 min (E. coli mRNA)
    n = int(t / dt)
    out = np.empty(n)
    for i in range(n):
        mrna = mrna - delta * mrna * dt
        out[i] = mrna.to_base_units().magnitude
    return out * ureg.mol / ureg.m**3


def simulate_pint(t: Quantity) -> Quantity:
    mrna  = 10.0 * ureg.nmol / ureg.L
    dt    =  1.0 * ureg.s
    delta = np.log(2) / (5.0 * ureg.min)
    n = int(t / dt)
    out = np.empty(n)
    for i in range(n):
        mrna = mrna - delta * mrna * dt
        out[i] = mrna.to_base_units().magnitude
    return out * ureg.mol / ureg.m**3


T, repeats = 10 * ureg.min, 300

t0 = time.perf_counter()
for _ in range(repeats): simulate_pint(T)
t_pint = time.perf_counter() - t0

t0 = time.perf_counter()
for _ in range(repeats): simulate_fast(T)
t_fast = time.perf_counter() - t0

print(f"plain Pint: {t_pint / repeats * 1e3:.2f} ms per call")
print(f"unit_jit:   {t_fast / repeats * 1e3:.2f} ms per call  ({t_pint / t_fast:.0f}x vs Pint)")

Result on an Apple M3 Pro (600 steps, 300 repetitions):

plain Pint: 22.39 ms per call
unit_jit:    0.08 ms per call  (292x vs Pint)

The speedup scales with loop length: the longer the loop, the more Pint overhead is avoided per call.

How it works

Unit inference: on the first call, all @unit_jit functions in the module are rewritten together. The function body is abstract-interpreted with the input units: dimensional errors (e.g. adding meters to seconds) are caught across all branches, and return units are inferred. If source is unavailable, the function falls back to running as plain Pint on every call.
Eager snapshot: Quantity attributes on objects (e.g. self.params.alpha) are pre-converted to SI floats once at boundary entry. Attribute access inside the loop is then a plain dict lookup.
Fast zone: a thread-local flag marks the outermost @unit_jit frame. Inner @unit_jit calls skip boundary conversion entirely.
Return wrapping: the SI unit of the return value is determined by abstract interpretation and cached. For NamedTuple returns, each field's unit is tracked independently and the result is reconstructed as the same NamedTuple type with all fields wrapped back as Quantity objects. The registry is captured from the first call's arguments, so results always belong to the same registry that produced them.
Lazy callee inference: when a @unit_jit function calls a method that is not yet inferred, including plain (non-decorated) methods and abstract methods implemented in subclasses, the inferrer analyses the callee recursively at inference time to resolve its return unit. The result is not written to global state; it is used only to complete the caller's unit chain.
Dimension guard: argument dimensions are cached from the first call; any later call with a different dimension raises TypeError immediately.

The right entry point is the outermost function that owns the hot loop, not the leaf functions it calls.

Installation

uv add unit-jit

pip install unit-jit

From source:

git clone https://github.com/BioDisCo/unit-jit && cd unit-jit
uv sync --extra dev  # or: pip install -e ".[dev]"

Usage

Scalar loop

The primary use case is a tight loop over scalars. unit_jit rewrites the function body so that all Pint calls disappear: ureg.nmol / ureg.L becomes the corresponding SI float, .to_base_units() is stripped, and arithmetic runs on plain floats. The result is wrapped back into a Quantity with the inferred units.

import numpy as np
from pint import Quantity, UnitRegistry
from unit_jit import unit_jit

ureg = UnitRegistry()

@unit_jit
def simulate(t: Quantity) -> Quantity:
    mrna  = 10.0 * ureg.nmol / ureg.L        # 10 nM initial concentration
    dt    =  1.0 * ureg.s                     # 1 s timestep
    delta = np.log(2) / (5.0 * ureg.min)     # half-life 5 min (E. coli mRNA)
    n = int(t / dt)
    out = np.empty(n)
    for i in range(n):
        mrna = mrna - delta * mrna * dt
        out[i] = mrna.to_base_units().magnitude
    return out * ureg.mol / ureg.m**3

NumPy array argument

When the argument is a Quantity wrapping a NumPy array, unit_jit converts it to the underlying SI ndarray at the boundary. The function body then runs on plain NumPy, and the result is wrapped back.

import numpy as np
from pint import Quantity, UnitRegistry
from unit_jit import unit_jit

ureg = UnitRegistry()

@unit_jit
def path_total(path: Quantity) -> Quantity:
    return np.sum(path)

path = np.array([1.0, 2.0, 3.0]) * ureg.m
path_total(path)   # first call: inference + fast; returns 6.0 m as Quantity
path_total(path)   # fast

Multiple Quantity array arguments work the same way: each is converted to its SI ndarray independently, and the operation runs without any Pint overhead.

@unit_jit
def speeds(distances: Quantity, times: Quantity) -> Quantity:
    return distances / times

d = np.array([10.0, 20.0, 30.0]) * ureg.m
t = np.array([2.0,  4.0,  5.0]) * ureg.s
speeds(d, t)   # returns [5., 5., 6.] m/s as Quantity

Class with Quantity attributes

unit_jit can be applied to individual methods or to the whole class at once. When applied to a class, it decorates all non-dunder methods automatically.

unit_jit snapshots all Quantity attributes on self once at the outermost boundary entry, replacing them with SI floats. Inner methods skip boundary conversion entirely, so there is no double-conversion overhead.

from dataclasses import dataclass

import numpy as np
from pint import Quantity, UnitRegistry
from unit_jit import unit_jit

ureg = UnitRegistry()

@dataclass
class Params:
    alpha: Quantity   # [mol/L/s]
    delta: Quantity   # [1/s]

@unit_jit
class Model:
    def __init__(self, params: Params) -> None:
        self.params = params

    def rate(self, mrna: Quantity) -> Quantity:
        return self.params.alpha - self.params.delta * mrna

    def simulate(self, t: Quantity) -> Quantity:  # entry point: owns the hot loop
        dt   = 10.0 * ureg.s
        mrna = self.params.alpha / self.params.delta
        n    = int(t / dt)
        out  = np.empty(n)
        for i in range(n):
            mrna = mrna + self.rate(mrna) * dt
            out[i] = mrna.to_base_units().magnitude
        return out * ureg.mol / ureg.m**3

simulate is the entry point: it owns the hot loop and is where boundary conversion happens. rate is an inner call, so it receives plain floats directly and its rewritten body runs without any Pint calls.

NamedTuple return values

A @unit_jit function can return a NamedTuple of Quantity fields. The inferrer tracks each field's unit independently; on the way out, the result is reconstructed as the same NamedTuple type with all fields wrapped back as Quantity objects.

from typing import NamedTuple

from pint import Quantity, UnitRegistry
from unit_jit import unit_jit

ureg = UnitRegistry()

class StepResult(NamedTuple):
    time: Quantity
    state: Quantity

class Integrator:
    def __init__(self, dt: Quantity, decay: Quantity) -> None:
        self.dt = dt
        self.decay = decay

    @unit_jit
    def step(self, t: Quantity, x: Quantity) -> StepResult:
        return StepResult(
            time=t + self.dt,
            state=x * (1.0 - self.decay * self.dt),
        )

sys = Integrator(dt=0.1 * ureg.s, decay=0.5 / ureg.s)
result = sys.step(0.0 * ureg.s, 1.0 * ureg.mol / ureg.L)
result.time   # Quantity in [time]
result.state  # Quantity in [substance / volume]

Abstract and plain helper methods

@unit_jit functions can call methods that are not themselves decorated, including abstract methods whose concrete implementation lives in a subclass. The inferrer analyses the callee lazily at inference time to determine its return unit, without requiring @unit_jit on the callee.

from abc import ABC, abstractmethod
from typing import cast

from pint import Quantity, UnitRegistry
from unit_jit import unit_jit

ureg = UnitRegistry()

class KineticBase(ABC):
    def __init__(self, volume: Quantity) -> None:
        self.volume = volume

    @abstractmethod
    def propensity(self, n: Quantity) -> Quantity: ...

    @unit_jit
    def total_rate(self, n: Quantity) -> Quantity:
        # propensity is abstract here; its return unit is inferred from the
        # concrete override at inference time
        return cast("Quantity", self.propensity(n) * self.volume)

class ConcreteModel(KineticBase):
    def __init__(self, alpha: Quantity, volume: Quantity) -> None:
        super().__init__(volume)
        self.alpha = alpha

    def propensity(self, n: Quantity) -> Quantity:  # plain method, no @unit_jit needed
        return cast("Quantity", self.alpha * n)

model = ConcreteModel(
    alpha=2.0 / ureg.s / (ureg.mol / ureg.L),
    volume=1.0 * ureg.L,
)
model.total_rate(3.0 * ureg.mol / ureg.L)  # returns Quantity in [volume / time]

The same mechanism applies to plain (non-abstract) helper methods: any method called from within a @unit_jit function is analysed lazily if its return unit is not yet known.

Pre-compilation with input_args

By default, unit inference runs on the first call. Pass input_args to the decorator to trigger it at decoration time instead, so every subsequent call is immediately fast.

import numpy as np
from pint import Quantity, UnitRegistry
from unit_jit import unit_jit

ureg = UnitRegistry()

@unit_jit(input_args=(ureg.m, ureg.s))
def velocity(d: Quantity, t: Quantity) -> Quantity:
    return d / t

velocity(10 * ureg.m, 2 * ureg.s)  # already fast: inference already ran

Bare units like ureg.m are treated as 1 * ureg.m. Full Quantity values work too, and so do Quantity wrapping NumPy arrays:

@unit_jit(input_args=(np.array([1.0, 2.0, 3.0]) * ureg.m,))
def path_total(path: Quantity) -> Quantity:
    return np.sum(path)

path_total(np.array([10.0, 20.0]) * ureg.m)  # already fast

Note: input_args compiles all @unit_jit functions registered in the module up to that point. If the entry point is defined after other @unit_jit functions it calls, that ordering is fine: decorate the callees first, then decorate the entry point with input_args.

Debugging

To inspect what code actually runs after rewriting, use get_rewritten_source. It triggers compilation if needed and returns the rewritten function source as a string.

import numpy as np
from pint import Quantity, UnitRegistry
from unit_jit import unit_jit, get_rewritten_source

ureg = UnitRegistry()

@unit_jit
def simulate(t: Quantity) -> Quantity:
    mrna  = 10.0 * ureg.nmol / ureg.L        # 10 nM initial concentration
    dt    =  1.0 * ureg.s                    # 1 s timestep
    delta = np.log(2) / (5.0 * ureg.min)     # half-life 5 min (E. coli mRNA)
    n = int(t / dt)
    out = np.empty(n)
    for i in range(n):
        mrna = mrna - delta * mrna * dt
        out[i] = mrna.to_base_units().magnitude
    return out * ureg.mol / ureg.m**3

simulate(10 * ureg.min)  # trigger inference and compilation
print(get_rewritten_source(simulate))

Output:

def simulate(t: Quantity) -> Quantity:
    mrna  = 10.0 * 1e-09 / 0.0010000000000000002
    dt    =  1.0 * 1.0
    delta = np.log(2) / (5.0 * 60.0)
    n = int(t / dt)
    out = np.empty(n)
    for i in range(n):
        mrna = mrna - delta * mrna * dt
        out[i] = mrna
    return out * 1.0 / 1.0 ** 3

All ureg unit references are replaced by their SI float values (ureg.nmol / ureg.L becomes 1e-9 / 0.001, ureg.min becomes 60.0, ureg.mol / ureg.m**3 becomes 1.0 / 1.0**3), .to_base_units().magnitude is stripped, and the arithmetic is otherwise unchanged.

get_rewritten_source shows only what runs in the rewritten version. The boundary is not shown: arguments arrive as plain SI floats (so t is a float in seconds, not a Quantity), and the raw return value is wrapped back into a Quantity by the runtime using the inferred units.

Numba integration

For functions with a pure float/NumPy inner loop, use_numba=True additionally compiles the rewritten function with Numba, giving a further speedup on top of the Pint stripping.

import numpy as np
from pint import Quantity, UnitRegistry
from unit_jit import unit_jit

ureg = UnitRegistry()

@unit_jit(use_numba=True)
def simulate(t: Quantity) -> Quantity:
    mrna  = 10.0 * ureg.nmol / ureg.L        # 10 nM initial concentration
    dt    =  1.0 * ureg.s                     # 1 s timestep
    delta = np.log(2) / (5.0 * ureg.min)     # half-life 5 min (E. coli mRNA)
    n = int(t / dt)
    out = np.empty(n)
    for i in range(n):
        mrna = mrna - delta * mrna * dt
        out[i] = mrna.to_base_units().magnitude
    return out * ureg.mol / ureg.m**3

simulate(10 * ureg.min)  # 1st call: unit inference + compilation
simulate(10 * ureg.min)  # 2nd call: triggers Numba compilation
simulate(10 * ureg.min)  # 3rd call onwards: Numba-compiled float loop

Two calls are needed before reaching full speed: the first runs unit inference and CST rewriting, and the second triggers Numba's own JIT compilation. From the third call on, the full pipeline runs at native speed.

On the same mRNA decay benchmark (Apple M3 Pro, 600 steps, 300 repetitions):

plain Pint:        23.10 ms per call
unit_jit:           0.08 ms per call   (291x vs Pint)
unit_jit + Numba:   0.01 ms per call  (1687x vs Pint)

The additional 5x on top of unit_jit comes from Numba compiling the inner loop to native code. The gain grows with loop complexity and body size. Numba is imported lazily and only required when use_numba=True is set.

use_numba=True requires the rewritten function body to be pure float/NumPy with no calls back into Python-wrapped @unit_jit methods, as Numba cannot compile through the Python wrapper. It is best suited for self-contained leaf functions.

pintrs compatibility

unit-jit also works with pintrs, a Rust-backed drop-in replacement for Pint. No configuration is needed: if pintrs is installed, its Quantity, UnitRegistry, and Unit types are detected automatically alongside pint's.

import pintrs
from unit_jit import unit_jit

ureg = pintrs.UnitRegistry()

@unit_jit
def velocity(d: pintrs.Quantity, t: pintrs.Quantity) -> pintrs.Quantity:
    return d / t

velocity(10 * ureg.m, 2 * ureg.s)  # returns a pintrs Quantity

The registry is captured from the first call's arguments, so results are always wrapped in the same pintrs registry and interoperate naturally with the rest of your quantities.

Running tests

pytest

Feedback

If you find this library useful, feel free to drop a message. Hearing about your experience would be very welcome. If you have any suggestions or run into an issue, don't hesitate to open an issue.

License

Apache-2.0

Project details

Release history Release notifications | RSS feed

0.4.10

Apr 7, 2026

This version

0.4.9

Apr 7, 2026

0.4.8

Mar 28, 2026

0.4.7

Mar 28, 2026

0.4.6

Mar 28, 2026

0.4.5

Mar 27, 2026

0.4.4

Mar 27, 2026

0.4.3

Mar 26, 2026

0.4.2

Mar 24, 2026

0.4.1

Mar 24, 2026

0.4.0

Mar 24, 2026

0.3.0

Mar 21, 2026

0.2.0

Mar 21, 2026

0.1.0

Mar 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unit_jit-0.4.9.tar.gz (52.5 kB view details)

Uploaded Apr 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

unit_jit-0.4.9-py3-none-any.whl (28.2 kB view details)

Uploaded Apr 7, 2026 Python 3

File details

Details for the file unit_jit-0.4.9.tar.gz.

File metadata

Download URL: unit_jit-0.4.9.tar.gz
Upload date: Apr 7, 2026
Size: 52.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for unit_jit-0.4.9.tar.gz
Algorithm	Hash digest
SHA256	`d8c2025f5ce6d1b1291dfe69c4a5f9b92d5e73626398c7456fb8da971ef3a6a1`
MD5	`30054f16d1a2b42982fce4f218c497bb`
BLAKE2b-256	`40e568e443d0df03d5ff2a5b608fe9cd350f839829a08a81c4a7c6a56359e9ea`

See more details on using hashes here.

File details

Details for the file unit_jit-0.4.9-py3-none-any.whl.

File metadata

Download URL: unit_jit-0.4.9-py3-none-any.whl
Upload date: Apr 7, 2026
Size: 28.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for unit_jit-0.4.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`21967295b8e02423a4e33ce54db6a558876a50a28067c8d01e23d88a4a938c2b`
MD5	`9e84da6102f6499722d31d0c396425fb`
BLAKE2b-256	`93ec43f3fa8914d10d30a56881624f42ba21743fb85a246744894dd206d901d4`

See more details on using hashes here.

unit-jit 0.4.9

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

unit-jit

Benchmark

How it works

Installation

Usage

Scalar loop

NumPy array argument

Class with Quantity attributes

NamedTuple return values

Abstract and plain helper methods

Pre-compilation with input_args

Debugging

Numba integration

pintrs compatibility

Running tests

Feedback

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes