Type-Safe Deep Learning Framework for Computer Vision

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tomrussobuilds

These details have not been verified by PyPI

Project description

Orchard ML

Type-safe deep learning framework for reproducible computer vision research

CI/CD & Coverage
Code Quality
Platform
Tech Stack
Code Style
Project

Overview
Hardware Requirements
Quick Start
Colab Notebooks
Experiment Management
Documentation
Citation
Roadmap
License

Overview

Orchard ML is a research-grade PyTorch training framework engineered for reproducible, scalable computer vision experiments across diverse domains. Built on MedMNIST v2 medical imaging datasets and expanded to astronomical imaging (Galaxy10 DECals), it provides a domain-agnostic platform supporting multi-resolution architectures (28×28 to 224×224+), automated hyperparameter optimization, and cluster-safe execution.

Key Differentiators:

Type-Safe Configuration Engine: Pydantic V2-based declarative manifests eliminate runtime errors
Zero-Conflict Execution: Kernel-level file locking (fcntl) prevents concurrent runs from corrupting shared resources
Intelligent Hyperparameter Search: Optuna integration with TPE sampling and Median Pruning
Hardware-Agnostic: Auto-detection and optimization for CPU/CUDA/MPS backends
Audit-Grade Traceability: BLAKE2b-hashed run directories with full YAML snapshots

Supported Architectures:

Resolution	Architectures	Parameters	Use Case
28×28 / 224×224	ResNet-18	~11M	Multi-resolution baseline, transfer learning
28×28	MiniCNN	~94K	Fast prototyping, ablation studies
224×224	EfficientNet-B0	~4.0M	Efficient compound scaling
224×224	ConvNeXt-Tiny	~27.8M	Modern ConvNet design
224×224	ViT-Tiny	~5.5M	Patch-based attention, multiple weight variants

Hardware Requirements

CPU Training (28×28 Only)

Supported Resolution: 28×28 only
Time: ~2.5 hours (ResNet-18, 60 epochs, 16 cores)
Time: ~5-10 minutes (MiniCNN, 60 epochs, 16 cores)
Architectures: ResNet-18, MiniCNN
Use Case: Development, testing, limited hardware environments

GPU Training (All Resolutions)

28×28 Resolution:
- MiniCNN: ~2-3 minutes (60 epochs)
- ResNet-18: ~10-15 minutes (60 epochs)
224×224 Resolution:
- EfficientNet-B0: ~30 minutes per trial (15 epochs)
- ViT-Tiny: ~25-35 minutes per trial (15 epochs)
VRAM: 8GB recommended for 224×224 resolution
Architectures: ResNet-18, EfficientNet-B0, ConvNeXt-Tiny, ViT-Tiny

[!WARNING] 224×224 training on CPU is not recommended - it would take 10+ hours per trial. High-resolution training requires GPU acceleration. Only 28×28 resolution has been tested and validated for CPU training.

[!NOTE] Apple Silicon (MPS): The codebase includes MPS backend support (device detection, seeding, memory management), but it has not been tested on real hardware. If you encounter issues, please open an issue.

Representative Benchmarks (RTX 5070 Laptop GPU):

Task	Architecture	Resolution	Device	Time	Notes
Smoke Test	MiniCNN	28×28	CPU/GPU	<30s	1-epoch sanity check
Quick Training	MiniCNN	28×28	GPU	~2-3 min	60 epochs
Quick Training	MiniCNN	28×28	CPU (16 cores)	~30 min	60 epochs, CPU-validated
Transfer Learning	ResNet-18	28×28	GPU	~5 min	60 epochs
Transfer Learning	ResNet-18	28×28	CPU (16 cores)	~2.5h	60 epochs, CPU-validated
High-Res Training	EfficientNet-B0	224×224	GPU	~30 min/trial	15 epochs per trial, GPU required
High-Res Training	ViT-Tiny	224×224	GPU	~25-35 min/trial	15 epochs per trial, GPU required
Optimization Study	EfficientNet-B0	224×224	GPU	~2h	4 trials (early stop at AUC≥0.9999)
Optimization Study	Various	224×224	GPU	~1.5-5h	20 trials, highly variable

[!Note] Timing Variance: Optimization times are highly dependent on early stopping criteria, pruning configuration, and dataset complexity:

Early Stopping: Studies may finish in 1-3 hours if performance thresholds are met quickly (e.g., AUC ≥ 0.9999 after 4 trials)

Full Exploration: Without early stopping, 20 trials can extend to 5+ hours

Pruning Impact: Median pruning can save 30-50% of total time by terminating underperforming trials

Quick Start

Step 1: Environment Setup

# Option A: Install from PyPI
pip install orchard-ml

# Option B: Install from source
git clone https://github.com/tomrussobuilds/orchard-ml.git
cd orchard-ml
pip install -e .

# With development tools (linting, testing, type checking)
pip install -e ".[dev]"

Step 2: Verify Installation (Optional)

# Run 1-epoch sanity check (~30 seconds, CPU/GPU)
# Downloads BloodMNIST 28×28 by default
python -m tests.smoke_test

# Note: You can skip this step - forge.py will auto-download datasets as needed

Step 3: Training Workflow

Orchard ML uses forge.py as the single entry point for all workflows. The pipeline behavior is controlled entirely by the YAML configuration:

Training only: Use a config_*.yaml file (no optuna: section)
Optimization + Training: Use an optuna_*.yaml file (has optuna: section)
With Export: Add an export: section to your config

Training Only (Quick start)

# 28×28 resolution (CPU-compatible)
python forge.py --config recipes/config_mini_cnn.yaml              # ~2-3 min GPU, ~10 min CPU
python forge.py --config recipes/config_resnet_18.yaml             # ~10-15 min GPU, ~2.5h CPU

# 224×224 resolution (GPU required)
python forge.py --config recipes/config_efficientnet_b0.yaml       # ~30 min GPU
python forge.py --config recipes/config_vit_tiny.yaml              # ~25-35 min GPU

What happens:

Dataset auto-downloaded to ./dataset/
Training runs for 60 epochs with early stopping
Results saved to timestamped directory in outputs/

Hyperparameter Optimization + Training (Full pipeline)

# 28×28 resolution - fast iteration
python forge.py --config recipes/optuna_mini_cnn.yaml              # ~5 min GPU, ~10 min CPU
python forge.py --config recipes/optuna_resnet_18.yaml             # ~15-20 min GPU

# 224×224 resolution - requires GPU
python forge.py --config recipes/optuna_efficientnet_b0.yaml       # ~1.5-5h*, GPU
python forge.py --config recipes/optuna_vit_tiny.yaml              # ~3-5h*, GPU

# *Time varies due to early stopping (may finish in 1-3h if target AUC reached)

What happens:

Optimization: Explores hyperparameter combinations with Optuna
Training: Full 60-epoch training with best hyperparameters found
Artifacts: Interactive plots, best_config.yaml, model weights

[!TIP] Model Search: Enable optuna.enable_model_search: true in your YAML config to let Optuna automatically explore all registered architectures for the target resolution. The optimizer will select the best model alongside the best hyperparameters.

View optimization results:

firefox outputs/*/figures/param_importances.html       # Which hyperparameters matter most
firefox outputs/*/figures/optimization_history.html    # Trial progression

Model Export (Production deployment)

All training configs (config_*.yaml) include ONNX export by default:

python forge.py --config recipes/config_efficientnet_b0.yaml
# → Training + ONNX export to outputs/*/exports/model.onnx

See the Export Guide for configuration options (format, quantization, validation).

Colab Notebooks

Try Orchard ML directly in Google Colab — no local setup required:

Notebook	Description	Runtime	Time
Quick Start: BloodMNIST CPU	MiniCNN training on BloodMNIST 28×28 — end-to-end training, evaluation, and ONNX export	CPU	~15 min
Optuna Model Search: Galaxy10 GPU	Automatic architecture search (EfficientNet-B0, ViT-Tiny, ConvNeXt-Tiny, ResNet-18) on Galaxy10 224×224 with Optuna	T4 GPU	~30-45 min

Experiment Management

Every run generates a complete artifact suite for total traceability. Both training-only and optimization workflows share the same RunPath orchestrator, producing BLAKE2b-hashed timestamped directories.

Browse Sample Artifacts — Excel reports, YAML configs, and diagnostic plots from real training runs. See the full artifact tree for the complete directory layout — logs, model weights, and HTML plots are generated locally and not tracked in the repo.

Browse Recipe Configs — Ready-to-use YAML configurations for every architecture and workflow. Copy the closest recipe, tweak the parameters, and run:

cp recipes/config_efficientnet_b0.yaml my_run.yaml
# edit hyperparameters, swap dataset/model, add or remove sections (optuna, export, tracking)
python forge.py --config my_run.yaml

Documentation

Guide	Covers
Framework Guide	System architecture diagrams, design principles, component deep-dives
Architecture Guide	Supported model architectures, weight transfer, grayscale adaptation, MixUp
Configuration Guide	Full parameter reference, usage patterns, adding new datasets
Optimization Guide	Optuna integration, search space config, pruning strategies, visualization
Docker Guide	Container build instructions, GPU-accelerated execution, reproducibility mode
Export Guide	ONNX export pipeline, quantization options, validation and benchmarking
Tracking Guide	MLflow local setup, dashboard and run comparison, programmatic querying
Artifact Guide	Output directory structure, training vs optimization artifact differences
Testing Guide	1,000+ test suite, quality automation scripts, CI/CD pipeline details
`orchard/` / `tests/`	Internal package structure, module responsibilities, extension points

Citation

@software{orchardml2026,
  author = {Tommaso Russo},
  title  = {Orchard ML: Type-Safe Deep Learning Framework},
  year   = {2026},
  url    = {https://github.com/tomrussobuilds/orchard-ml},
  note   = {PyTorch framework with Pydantic configuration and Optuna optimization}
}

Roadmap

Additional Architectures: EfficientNet-V2, DeiT
Expanded Dataset Domains: Climate, remote sensing, microscopy
Multi-modal Support: Detection, segmentation hooks
Distributed Training: DDP, FSDP support for multi-GPU

License

MIT License - See LICENSE for details.

Contributing

Contributions welcome! Please:

Fork the repository
Create a feature branch
Add tests for new functionality
Ensure all tests pass: pytest tests/ -v
Submit a pull request

For detailed guidelines, see CONTRIBUTING.md.

Contact

For questions or collaboration: GitHub Issues

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tomrussobuilds

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.4

Mar 25, 2026

0.2.3

Mar 22, 2026

0.2.2

Mar 18, 2026

0.2.1

Mar 9, 2026

0.2.0

Mar 4, 2026

0.1.9

Mar 3, 2026

0.1.8

Feb 28, 2026

0.1.7

Feb 25, 2026

0.1.6

Feb 23, 2026

0.1.5

Feb 20, 2026

0.1.4

Feb 19, 2026

0.1.3

Feb 19, 2026

0.1.2

Feb 19, 2026

0.1.1

Feb 18, 2026

This version

0.1.0

Feb 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orchard_ml-0.1.0.tar.gz (142.6 kB view details)

Uploaded Feb 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

orchard_ml-0.1.0-py3-none-any.whl (177.1 kB view details)

Uploaded Feb 17, 2026 Python 3

File details

Details for the file orchard_ml-0.1.0.tar.gz.

File metadata

Download URL: orchard_ml-0.1.0.tar.gz
Upload date: Feb 17, 2026
Size: 142.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for orchard_ml-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`e8d15c71e83dbf09403e69765f6c91e75c76a2ddefbb1d8c7fc521e8aa38ce03`
MD5	`eb652f0f406faf94a584bfb1a5f492c8`
BLAKE2b-256	`6de92c429c1343a32f9fca8dfff6e1c0ce2b3fe2db8d2990d3fc36176e3e136b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for orchard_ml-0.1.0.tar.gz:

Publisher: publish.yml on tomrussobuilds/orchard-ml

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: orchard_ml-0.1.0.tar.gz
- Subject digest: e8d15c71e83dbf09403e69765f6c91e75c76a2ddefbb1d8c7fc521e8aa38ce03
- Sigstore transparency entry: 957591445
- Sigstore integration time: Feb 17, 2026
Source repository:
- Permalink: tomrussobuilds/orchard-ml@f3c639a790b28d1f06c1a347211417e394445686
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/tomrussobuilds
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f3c639a790b28d1f06c1a347211417e394445686
- Trigger Event: push

File details

Details for the file orchard_ml-0.1.0-py3-none-any.whl.

File metadata

Download URL: orchard_ml-0.1.0-py3-none-any.whl
Upload date: Feb 17, 2026
Size: 177.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for orchard_ml-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`82813518b7aeee892daef99573a98920f4ac097551356cc43b1c4d22c2dff42e`
MD5	`a7a9c611941378a11f2d7e63a3b539ba`
BLAKE2b-256	`a30764b7b16cd784199a1bcb9d67339ee84c95a8398ab2ce60e1691562814cb8`

See more details on using hashes here.

Provenance

The following attestation bundles were made for orchard_ml-0.1.0-py3-none-any.whl:

Publisher: publish.yml on tomrussobuilds/orchard-ml

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: orchard_ml-0.1.0-py3-none-any.whl
- Subject digest: 82813518b7aeee892daef99573a98920f4ac097551356cc43b1c4d22c2dff42e
- Sigstore transparency entry: 957591464
- Sigstore integration time: Feb 17, 2026
Source repository:
- Permalink: tomrussobuilds/orchard-ml@f3c639a790b28d1f06c1a347211417e394445686
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/tomrussobuilds
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f3c639a790b28d1f06c1a347211417e394445686
- Trigger Event: push

orchard-ml 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Orchard ML

Table of Contents

Overview

Hardware Requirements

CPU Training (28×28 Only)

GPU Training (All Resolutions)

Quick Start

Step 1: Environment Setup

Step 2: Verify Installation (Optional)

Step 3: Training Workflow

Training Only (Quick start)

Hyperparameter Optimization + Training (Full pipeline)

Model Export (Production deployment)

Colab Notebooks

Experiment Management

Documentation

Citation

Roadmap

License

Contributing

Contact

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance