Classical ML Inference Compiler — compiles trained ML models into optimized native inference binaries

These details have not been verified by PyPI

Project description

Timber

Compile classical ML models to native C. Serve them in microseconds.

Documentation · Changelog · PyPI · Technical Paper

Timber takes a trained tree-based model — XGBoost, LightGBM, scikit-learn, CatBoost, or ONNX — runs it through a multi-pass optimizing compiler, and emits a self-contained C99 inference binary with zero runtime dependencies. A built-in HTTP server (Ollama-compatible API) lets you serve any model — local file or remote URL — in one command.

~2 µs single-sample inference · ~336× faster than Python XGBoost · ~48 KB artifact · zero runtime dependencies

See it in action

Point Timber at any URL and it downloads, compiles, and serves in one command — no setup required.

$ timber serve https://raw.githubusercontent.com/kossisoroyce/timber/main/examples/breast_cancer_model.json

  ✓ Downloaded              breast_cancer_model.json
  ✓ Format detected         xgboost
  ✓ Parsed model            50 trees · 30 features · binary:logistic
  ✓ Optimized               3/5 passes applied
  ✓ Generated C99           169 lines
  ✓ Compiled binary         47.9 KB

  Serving    breast_cancer_model
  Endpoint   http://localhost:11434
  Framework  xgboost  ·  50 trees  ·  30 features

  POST  http://localhost:11434/api/predict
  GET   http://localhost:11434/api/models
  GET   http://localhost:11434/api/health

Predict immediately:

$ curl -s http://localhost:11434/api/predict \
    -H 'Content-Type: application/json' \
    -d '{"model": "breast_cancer_model", "inputs": [[1.799e+01, 1.038e+01, 1.228e+02, 1.001e+03, 0.1184, 0.2776, 0.3001, 0.1471, 0.2419, 0.07871, 1.095, 0.9053, 8.589, 153.4, 0.006399, 0.04904, 0.05373, 0.01587, 0.03003, 0.006193, 2.538e+01, 1.733e+01, 1.846e+02, 2.019e+03, 0.1622, 0.6656, 0.7119, 0.2654, 0.4601, 0.1189]]}'

{"model": "breast_cancer_model", "outputs": [[0.9971]], "n_samples": 1}

Or load from a local file and serve by name:

$ timber load fraud_model.json --name fraud-detector
$ timber serve fraud-detector

Who is this for?
How it works
Quick Start
Supported Formats
Performance
Runtime Comparison
API Reference
CLI Reference
Examples
Limitations
Roadmap
Development
Citation
Community & Governance
License

Who is this for?

Timber is built for teams that need fast, predictable, and portable inference:

Fraud & risk teams — run classical models in sub-millisecond transaction paths without Python overhead
Edge & IoT deployments — ship a ~48 KB C artifact to gateways, microcontrollers, or ARM Cortex-M targets
Regulated industries — finance, healthcare, and automotive teams that need deterministic, auditable inference artifacts
Platform & infra teams — eliminate the Python model-serving stack from your critical path entirely

How it works

  ┌─────────────────────────────────────────────────────────┐
  │                     timber load                         │
  │                                                         │
  │  Model file  ──►  Parser  ──►  Timber IR  ──►  Optimizer│
  │  (.json/.pkl/                  (typed AST)   (dead-leaf  │
  │   .txt/.onnx)                               elim, quant, │
  │                                              branch-sort) │
  │                                     │                    │
  │                                     ▼                    │
  │                               C99 Emitter                │
  │                                     │                    │
  │                    ┌────────────────┼────────────────┐   │
  │                    ▼                ▼                ▼   │
  │               model.c         model.h        model_data.c│
  │               (inference)     (public API)   (tree data)  │
  │                    │                                     │
  │                    └──► gcc / clang ──► model.so         │
  └─────────────────────────────────────────────────────────┘
                              │
                              ▼
                      timber serve <name>
                   http://localhost:11434/api/predict

The compiler pipeline:

Parse — reads the native model format into a framework-agnostic Timber IR
Optimize — dead-leaf elimination, threshold quantization, constant-feature folding, branch sorting
Emit — generates deterministic, portable C99 with no dynamic allocation and no recursion
Compile — gcc/clang produces a shared library loaded via ctypes
Serve — an Ollama-compatible HTTP API wraps the binary for drop-in integration

Quick Start

pip install timber-compiler

Serve any model directly from a URL — no pre-download step:

timber serve https://yourhost.com/models/fraud_model.json

Or load a local model and serve by name:

timber load fraud_model.json --name fraud-detector
timber serve fraud-detector

Predict:

curl -s http://localhost:11434/api/predict \
  -H "Content-Type: application/json" \
  -d '{"model": "fraud-detector", "inputs": [[1.2, 0.4, 3.1, 0.9]]}'

{"model": "fraud-detector", "outputs": [[0.031]], "latency_us": 1.8}

That's it. No model server configuration, no Python runtime in the hot path.

Supported Formats

Framework	File format	Notes
XGBoost	`.json`	All objectives; multiclass, binary, regression
LightGBM	`.txt`, `.model`, `.lgb`	All objectives including multiclass
scikit-learn	`.pkl`, `.pickle`	GradientBoostingClassifier/Regressor, RandomForest, ExtraTrees, DecisionTree, Pipeline
ONNX	`.onnx`	`TreeEnsembleClassifier` and `TreeEnsembleRegressor` ML operators
CatBoost	`.json`	JSON export (`save_model(..., format='json')`)

Performance

Benchmarks run on Apple M2 Pro · 16 GB RAM · macOS · XGBoost binary classifier · 50 trees · max depth 4 · 30 features (sklearn breast_cancer) · 10,000 timed iterations after 1,000 warmup.

Runtime	Single-sample latency	Throughput	Speedup vs Python
Timber (native C)	~2 µs	~500,000 / sec	336×
ONNX Runtime	~80–150 µs	~10,000 / sec	~5×
Treelite (compiled)	~10–30 µs	~50,000 / sec	~20×
Python XGBoost	~670 µs	~1,500 / sec	1× (baseline)
Python scikit-learn	~900 µs	~1,100 / sec	0.7×

Latency is in-process (not HTTP round-trip). Network overhead adds ~50–200 µs depending on your stack.

Reproduce these numbers

python benchmarks/run_benchmarks.py --output benchmarks/results.json
python benchmarks/render_table.py   --input  benchmarks/results.json

See benchmarks/ for full methodology, hardware capture script, and optional ONNX Runtime / Treelite / lleaves comparisons.

Runtime Comparison

	Timber	Python serving	ONNX Runtime	Treelite	lleaves
Latency	~2 µs	100s of µs–ms	~100 µs	~10–30 µs	~50 µs
Runtime deps	None	Python + framework	ONNX Runtime libs	Treelite runtime	Python + LightGBM
Artifact size	~48 KB	50–200+ MB process	MBs	MB-scale	Python env
Formats	5	Each framework only	ONNX only	GBDTs	LightGBM only
C export	Yes (C99)	No	No	Yes	No
Edge / embedded	Yes	No	Partial	Partial	No
Audit / MISRA	Roadmap	No	No	No	No

API Reference

Timber's server exposes an Ollama-compatible REST API on http://localhost:11434 by default.

Endpoint	Method	Body / Params	Description
`/api/predict`	POST	`{"model": str, "inputs": [[float]]}`	Run inference
`/api/generate`	POST	same as `/api/predict`	Ollama alias
`/api/models`	GET	—	List all loaded models
`/api/model/:name`	GET	—	Model metadata & schema
`/api/health`	GET	—	Health check

Example — batch inference:

curl -s http://localhost:11434/api/predict \
  -H "Content-Type: application/json" \
  -d '{
    "model": "fraud-detector",
    "inputs": [
      [1.2, 0.4, 3.1, 0.9],
      [0.1, 2.3, 1.0, 4.4]
    ]
  }'

CLI Reference

timber load   <path> --name <name>   Compile and register a model
timber serve  <name> [--port N]      Start the inference server
timber list                          List registered models
timber inspect <name>                Show model IR summary and schema
timber validate <name>               Run numerical validation against source
timber bench  <name>                 Benchmark latency and throughput
timber pull   <url>  --name <name>   Download and compile from URL
timber remove <name>                 Remove a model from the registry

Examples

Runnable end-to-end examples live in examples/:

python examples/quickstart_xgboost.py   # trains, compiles, and benchmarks
python examples/quickstart_lightgbm.py
python examples/quickstart_sklearn.py

Each script trains a model, saves it, runs timber load, and validates predictions against the source framework.

Limitations

ONNX — currently supports TreeEnsembleClassifier / TreeEnsembleRegressor operators only
CatBoost — requires JSON export (save_model(..., format='json')); native binary format not supported
scikit-learn — major estimators and Pipeline wrappers are supported; uncommon custom estimators may require a custom front-end
Pickle — follow standard pickle security hygiene; only load artifacts from trusted sources
XGBoost — JSON model format is the primary path; binary booster format is not supported
MISRA-C / safety certification — deterministic output is guaranteed but formal MISRA-C compliance is on the roadmap, not yet certified

Roadmap

Status	Item
✅	XGBoost, LightGBM, scikit-learn, CatBoost, ONNX front-ends
✅	Multi-pass IR optimizer (dead-leaf, quantization, branch sort, scaler fusion)
✅	C99 emitter with WebAssembly target
✅	Ollama-compatible HTTP inference server
✅	PyPI packaging with OIDC trusted publishing
🔄	Remote model registry (`timber pull` from hosted model library)
🔄	Broader ONNX operator support (linear, SVM, normalizers)
🔄	ARM Cortex-M / RISC-V embedded deployment profiles
🔄	MISRA-C compliant output mode for automotive/aerospace
🔄	Richer benchmark matrices and public reproducibility report
🔲	LLVM IR target for hardware-specific optimization
🔲	Differential privacy inference mode

Development

git clone https://github.com/kossisoroyce/timber.git
cd timber
pip install -e ".[dev]"
pytest tests/ -v                    # 146 tests
ruff check timber/                  # linting

The test suite covers parsers, IR, optimizer passes, C99 emission, WebAssembly emission, numerical accuracy (± 1e-4), and end-to-end compilation for all supported frameworks.

See CONTRIBUTING.md for the full development guide.

Citation

If you use Timber in research or production, please cite the accompanying technical paper:

@misc{royce2026timber,
  title        = {Timber: Compiling Classical Machine Learning Models to Native Inference Binaries},
  author       = {Kossiso Royce},
  year         = {2026},
  howpublished = {GitHub repository and technical paper},
  institution  = {Electricsheep Africa},
  url          = {https://github.com/kossisoroyce/timber}
}

The full paper is available at paper/timber_paper.pdf.

Community & Governance

Contributing: CONTRIBUTING.md
Code of conduct: CODE_OF_CONDUCT.md
Security policy: SECURITY.md
Changelog: CHANGELOG.md

Bugs and feature requests: open an issue. Questions: start a discussion.

Support the Project

Timber is developed and maintained by Electricsheep Africa. If Timber saves your team engineering time, consider supporting continued development:

We're also building a hosted library of compiled models — if you have high-performance models you'd like to contribute, get in touch.

License

Apache-2.0 — see LICENSE for the full text.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.6.0

Apr 16, 2026

0.5.0

Mar 13, 2026

0.4.0

Mar 4, 2026

0.3.0 yanked

Mar 4, 2026

Reason this release was yanked:

Multi threading implementation showed degradation in performance. Http overhed even in C is not worth it. We will implement alternatives in managed cloud.

This version

0.2.0

Mar 3, 2026

0.1.0

Feb 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

timber_compiler-0.2.0.tar.gz (884.0 kB view details)

Uploaded Mar 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

timber_compiler-0.2.0-py3-none-any.whl (77.7 kB view details)

Uploaded Mar 3, 2026 Python 3

File details

Details for the file timber_compiler-0.2.0.tar.gz.

File metadata

Download URL: timber_compiler-0.2.0.tar.gz
Upload date: Mar 3, 2026
Size: 884.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for timber_compiler-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`e4ad9a3ee2a14d35b7f71d448283ed1cc3a7d0228b241463f35735a43d81b6d3`
MD5	`c8e936ae9c1a50ff9266d5384be5d2ce`
BLAKE2b-256	`0e4e572f18336a713619ac4763c7943b7b8e3fc837d337de45e714fdcec74b71`

See more details on using hashes here.

Provenance

The following attestation bundles were made for timber_compiler-0.2.0.tar.gz:

Publisher: release.yml on kossisoroyce/timber

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: timber_compiler-0.2.0.tar.gz
- Subject digest: e4ad9a3ee2a14d35b7f71d448283ed1cc3a7d0228b241463f35735a43d81b6d3
- Sigstore transparency entry: 1017889781
- Sigstore integration time: Mar 3, 2026
Source repository:
- Permalink: kossisoroyce/timber@449be29f3d93f42612e6e4171ca139dd3461f871
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/kossisoroyce
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@449be29f3d93f42612e6e4171ca139dd3461f871
- Trigger Event: push

File details

Details for the file timber_compiler-0.2.0-py3-none-any.whl.

File metadata

Download URL: timber_compiler-0.2.0-py3-none-any.whl
Upload date: Mar 3, 2026
Size: 77.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for timber_compiler-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6cb82926f7a18e07649fe10de3dcbb33de101ed9ba2b9173c6bd14b6b09a01a9`
MD5	`f3cc9383a712ab43a8f2b530e31d18ce`
BLAKE2b-256	`97959c62be4e4169de156a99a6500c2c3188f426175d9bb19cd13301ecc8c6e9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for timber_compiler-0.2.0-py3-none-any.whl:

Publisher: release.yml on kossisoroyce/timber

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: timber_compiler-0.2.0-py3-none-any.whl
- Subject digest: 6cb82926f7a18e07649fe10de3dcbb33de101ed9ba2b9173c6bd14b6b09a01a9
- Sigstore transparency entry: 1017889795
- Sigstore integration time: Mar 3, 2026
Source repository:
- Permalink: kossisoroyce/timber@449be29f3d93f42612e6e4171ca139dd3461f871
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/kossisoroyce
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@449be29f3d93f42612e6e4171ca139dd3461f871
- Trigger Event: push

timber-compiler 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Timber

See it in action

Table of Contents

Who is this for?

How it works

Quick Start

Supported Formats

Performance

Reproduce these numbers

Runtime Comparison

API Reference

CLI Reference

Examples

Limitations

Roadmap

Development

Citation

Community & Governance

Support the Project

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance