ophelian

Declarative ML framework: write a Pipeline once, run it anywhere — local Docker, AWS, GCP, Azure, or routed to the cheapest cloud.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

falvaluis

These details have not been verified by PyPI

Project description

Ophelian

Write your ML pipeline once. Run it anywhere. Pay the lowest GPU price on the market.

Ophelian is a small, opinionated Python framework for taking ML / AI prototypes to production without rewriting them every time the runtime changes. You declare a Pipeline of nodes (Data, Train, Tune, Eval, Deploy), pick an env, and Ophelian compiles + runs it — locally inside Docker, on AWS, GCP, or Azure, or routed automatically to the cheapest cloud for the GPU you need.

Hello, pipeline
Why Ophelian
Install
Architecture
Envs at a glance
Three demos in three minutes
Use cases
Observability
Compatibility
Documentation
Roadmap
Status & versioning
Community & support
Governance & maintainers
Security
Contributing
Citation
Acknowledgments
License

Hello, pipeline

from ophelian import Pipeline, Train, Auto

pipe = Pipeline([
    Train(
        model="meta-llama/Llama-3.2-1B",
        data="s3://my-bucket/dataset.jsonl",
        epochs=3,
    ),
])

# Picks the cheapest A100 across AWS / GCP / Azure right now.
pipe.run(env=Auto(cheapest_gpu="A100"))

That's it. The same source runs on your laptop, on EC2 spot, on a GCE preemptible VM, or on an Azure Spot VM — Ophelian handles checkpointing, artifact persistence (S3 / GCS / Azure Blob), structured logs, and a rich summary at the end.

Why Ophelian

	Ophelian	SageMaker	Vertex AI	Azure ML	Bare cloud SDKs
Single API across AWS + GCP + Azure	✅	❌	❌	❌	❌
Write pipeline once, run anywhere	✅	❌	❌	❌	❌
Auto-router that picks the cheapest GPU	✅	❌	❌	❌	❌
Spot / preemptible / Azure Spot resume	✅	partial	partial	partial	DIY
Local-first dev (Docker, no cloud auth)	✅	❌	❌	❌	❌
Structured JSON logs + run_id	✅	partial	partial	partial	DIY
Apache-2.0, OSS, no vendor SDK lock-in	✅	❌	❌	❌	mixed
`pip install` and go	✅	❌	❌	❌	partial

Install

pip install ophelian            # core, runs locally
pip install 'ophelian[aws]'     # + EC2 / S3
pip install 'ophelian[gcp]'     # + GCE / GCS
pip install 'ophelian[azure]'   # + Azure VM / Blob
pip install 'ophelian[all]'     # every extra

Python 3.11+.

Architecture

Ophelian draws a hard line between what a pipeline does and where it runs. Nodes are declarative. Envs pick a provider. Providers pick a driver. Drivers persist artifacts to a store. The cost router is the only component that crosses providers.

                 +--------------------------------------------+
                 |              Pipeline (DAG)                |
                 |  Data → Train → Tune → Eval → Deploy       |
                 +--------------------------------------------+
                                     |
                                     v
                 +--------------------------------------------+
                 |  Env  (Standalone | AWS | GCP | Azure |    |
                 |        Auto)                               |
                 +--------------------------------------------+
                                     |
                  +------------------+--------------------+
                  |                  |                    |
                  v                  v                    v
          +---------------+  +---------------+   +-----------------+
          |  Provider     |  |  Provider     |   |  Provider       |
          |  (AWS, GCP,   |  |  (Standalone) |   |  (chosen by     |
          |   Azure)      |  |               |   |   Auto router)  |
          +---------------+  +---------------+   +-----------------+
                  |
       +----------+----------+
       |                     |
       v                     v
  +---------+         +-------------+         +-------------------+
  | Driver  |  ...    |   Driver    |  <----  |   Cost router     |
  | (EC2,   |         |  (LocalEC2, |         |  (live spot +     |
  |  EKS,   |         |   FakeGCE…) |         |   retail prices,  |
  |  GCE,   |         |             |         |   24h disk cache) |
  |  AzureVM|         +-------------+         +-------------------+
  +---------+
       |
       v
  +-----------------------------------------------------+
  |  ArtifactStore  (Local | S3 | GCS | Azure Blob)     |
  |   put/get/exists/delete/list  — same Protocol       |
  +-----------------------------------------------------+

Every cloud code path has a Local*Driver mirror, so the test suite (and any contributor without cloud creds) exercises the full framework offline.

Envs at a glance

from ophelian import Standalone, AWS, GCP, Azure, Auto

Standalone(local=True)                                     # local Docker / in-process
AWS(region="us-east-1", instance="g5.xlarge", spot=True)   # EC2 / S3
GCP(project="my-proj", region="us-central1",
    machine_type="n1-standard-4", gpu_type="nvidia-tesla-t4",
    preemptible=True)                                      # GCE / GCS
Azure(subscription_id=..., resource_group="ml",
      region="eastus", vm_size="Standard_NC6s_v3",
      spot=True)                                           # Azure VM / Blob
Auto(cheapest_gpu="A100",
     regions=["us-east-1", "us-central1", "eastus"])       # cost router

The same Pipeline(...) runs on every one of them.

Three demos in three minutes

All three live under examples/ and default to Auto(cheapest_gpu=...). Set OPHELIAN_DRY_RUN=1 to print the chosen provider/region/price without spinning anything up.

python examples/llama_finetune.py    # Llama-3 fine-tune on the cheapest A100
python examples/resnet_train.py      # ResNet-50 on ImageNet, preemptible-friendly
python examples/xgboost_tabular.py   # XGBoost tabular, CPU only, sub-$0.05/run

Use cases

Scenario	Why Ophelian fits	Snippet
Cost-driven LLM fine-tuning — you want an A100 right now and don't care which cloud sells it cheapest	`Auto(cheapest_gpu=...)` queries live AWS spot, Azure retail, and GCP billing prices, picks the winner, and resumes from checkpoint if the spot is reclaimed	`pipe.run(env=Auto(cheapest_gpu="A100"))`
Local → cloud without rewrites — prototype on your laptop, ship the same `pipe` to production	Every cloud code path mirrors a `Local*Driver`, so the same `Pipeline` object runs in-process, in Docker, on EC2, on GCE, or on an Azure VM	`pipe.run(env=Standalone(local=True))` then `pipe.run(env=AWS(...))`
Multi-cloud failover for batch inference — your primary region runs out of GPU capacity at 3am	List acceptable regions in `Auto(...)`; the router falls through to the next cheapest provider with capacity, transparently	`Auto(cheapest_gpu="L4", regions=["us-east-1","us-central1","eastus"])`
Reproducible academic benchmarks — you need to publish numbers another lab can re-run	`run_id`-tagged structured logs, deterministic artifact layout, pinned price table, Apache-2.0 license, citable	`configure_logging(json=True)` + cite the version you used

Observability

from ophelian.observability import configure_logging
configure_logging(json=True)

Every step emits structured JSON with run_id, step, env, instance, duration_s, cost_estimate_usd. At the end you get a rich table summary you can paste into a Slack thread.

Compatibility

Combined matrix (Python × OS × provider). Cell values: full — gated by CI on every push · community — works, exercised by contributors but not gated by CI · planned — on the roadmap, not shipped yet · n/a — does not apply.

Runtime	Standalone	AWS (EC2 / EKS)	GCP (GCE)	Azure (VM)	Auto router
Linux (Ubuntu) · Python 3.11	full	full	full	full	full
Linux (Ubuntu) · Python 3.12	full	full	full	full	full
Linux (Ubuntu) · Python 3.13	full	full	full	full	full
macOS · Python 3.12	full	full	full	full	full
macOS · Python 3.11 / 3.13	community	community	community	community	community
Windows · any Python 3.11+	community	community	community	community	community

GKE (gcp_backend='gke') and AKS (azure_backend='aks') are planned for the post-1.0 roadmap and currently raise a clear NotImplementedError.

Third-party clouds plug in through the ophelian.envs entry-point group — see CONTRIBUTING.md.

Documentation

Full docs at https://ophelianio.github.io/ophelian/:

Quickstart
Concepts
Envs: AWS · GCP · Azure · Standalone · Auto
Cookbook · Troubleshooting

Roadmap

Tracked in CHANGELOG.md under ## [Unreleased] and in the GitHub project board. Headline items currently on deck:

GKE driver — production-grade Kubernetes backend for GCP (gcp_backend='gke' is reserved and raises a clear NotImplementedError today).
AKS driver — same, for Azure (azure_backend='aks').
Real-AWS smoke run in nightly CI — opt-in integration tests promoted from local-only to a scheduled workflow.
Pre-launch cost preview — print estimated $/run before any AWS pipeline actually provisions infrastructure.
Live pricing for more GPU classes — extend the router beyond the current T4 / L4 / V100 / A10G / A100 / H100 set.

Anything that breaks the public API requires a major version bump and a migration note — see Status & versioning.

Status & versioning

v1.0.0 — public API is stable. Ophelian follows Semantic Versioning 2.0:

MAJOR — breaking changes to the public surface re-exported from ophelian.__init__ or to documented env / provider kwargs.
MINOR — backwards-compatible additions (new envs, new node fields with safe defaults, new optional extras).
PATCH — bug fixes, documentation, dependency bumps, perf.

Every breaking change gets a migration note in CHANGELOG.md. Deprecated symbols stay importable with a DeprecationWarning for at least one minor release before removal.

Community & support

Pick the right channel:

You want to...	Go to
Ask a usage question	GitHub Discussions
Report a reproducible bug	Issues → Bug report
Request a feature	Issues → Feature request
Report a security vulnerability	See `SECURITY.md` — please do not file a public issue
Show what you built with it	Discussions → Show and tell

We follow the Contributor Covenant 2.1 in every space we maintain.

Governance & maintainers

Ophelian is currently maintained by @LuisFalva under a lightweight BDFL model: the maintainer has final say on architecture and API decisions, contributors propose changes via PRs, and every public-API change goes through a CHANGELOG-gated review.

A formal GOVERNANCE.md and MAINTAINERS.md are planned for the 1.x series as the contributor base grows. The intended trajectory is the standard meritocratic open-source model: contributors who make sustained, high-quality contributions are invited to become committers, and committers vote on new committers. PRs that move the project in that direction are welcome.

Security

Please do not report security vulnerabilities via public GitHub issues. We take security seriously and want a chance to ship a fix before the bug is public.

The full disclosure policy — supported versions, in-scope components, response timeline, safe-harbour, and credit process — lives in SECURITY.md. GitHub auto-detects that file and surfaces a "Report a vulnerability" button on the repo's Security tab.

How to report (in order of preference):

GitHub Private Vulnerability Reporting — open https://github.com/ophelianio/ophelian/security/advisories/new ("Report a vulnerability" button on the repo's Security tab). This is the recommended channel: end-to-end private, no email round-trip, and triaged automatically into a GitHub Security Advisory if accepted.
Direct contact with the maintainer — only if you do not have a GitHub account. Reach the maintainer @LuisFalva privately via the email on their public GitHub profile, with [ophelian-security] in the subject line. PVR is preferred for everyone else.

You can expect an acknowledgement within 5 business days and an initial triage within 10 business days — see SECURITY.md for the full timeline. Patched releases ship with a GitHub Security Advisory.

CI guardrails (defense in depth, not a substitute for reports):

CodeQL — deep static analysis on every push to main and on every PR. Manual re-runs available via Run workflow in the Actions tab.
pip-audit + bandit + gitleaks — run on every push to any branch and on every PR. Manual re-runs available via Run workflow in the Actions tab to surface newly-disclosed CVEs on a quiet branch.
GitHub Actions pinned to commit SHAs with Dependabot watching for supply-chain regressions, and an action-pin-check CI job that fails the build if any workflow re-introduces a mutable @v4-style reference.

Contributing

PRs welcome. See CONTRIBUTING.md for dev setup (uv-based), the test slicing strategy, the third-party env plug-in contract, and the pricing-table refresh process. The bar for new public API is "this makes pipelines more portable, more honest, or more pleasant to use across every supported env."

Found a security issue? Please follow the disclosure policy in SECURITY.md instead of opening a public issue.

By participating you agree to abide by the Code of Conduct.

Citation

If Ophelian shows up in research output (papers, theses, technical reports), please cite the specific version you used:

@software{ophelian_2026,
  author    = {Falva, Luis and the Ophelian contributors},
  title     = {{Ophelian: a declarative, multi-cloud ML pipeline framework}},
  year      = {2026},
  version   = {1.0.0},
  license   = {Apache-2.0},
  url       = {https://github.com/ophelianio/ophelian},
  howpublished = {PyPI: \url{https://pypi.org/project/ophelian/}},
  publisher = {GitHub},
}

Both URLs matter: the GitHub repo is the canonical source and issue tracker, and the PyPI page is the immutable artifact archive that reproducibility tooling resolves against. For other versions, swap version and check the matching tag at https://github.com/ophelianio/ophelian/releases (and the matching release on https://pypi.org/project/ophelian/#history).

Acknowledgments

Ophelian stands on a lot of upstream work and would not exist without the ecosystems behind:

HuggingFace Transformers and PyTorch — model adapters and the training stack that the framework wraps.
scikit-learn and XGBoost — the tabular adapters that keep the framework honest for non-LLM workloads.
boto3 / google-cloud- / azure-sdk-for-python* — the cloud SDKs that make the multi-cloud surface possible.
pydantic v2, typer, rich, FastAPI, uv, hatchling, mypy, ruff — the small Python-tooling stack that the codebase is built on.

And to every contributor who has filed an issue, opened a PR, refreshed the price table, or kicked the tyres on a real cloud: thank you. Run git shortlog -sn --no-merges for the full list.

Changes under .github/ (CI workflows, dependabot.yml, CODEOWNERS) are owned by the release maintainers — see .github/CODEOWNERS. Those PRs auto-request a maintainer review and should not be self-merged. See CONTRIBUTING → Changes to .github/ for what to call out in the PR description.

Maintainer setup: branch protection on `main` (one-time, manual)

CODEOWNERS only requests reviews by default. To make them required — i.e. to actually block merges of workflow / Dependabot changes that haven't been approved by a maintainer — a repo admin needs to enable it in GitHub's UI; this can't be expressed in-repo.

In Settings → Branches → Branch protection rules for main:

Require a pull request before merging — on
Require approvals — at least 1
Require review from Code Owners — on (this is the key toggle; without it, the .github/CODEOWNERS entries are advisory only)
Dismiss stale pull request approvals when new commits are pushed — on (so a force-push to a workflow PR re-triggers maintainer review)
Do not allow bypassing the above settings — on, including for admins, so the policy can't be silently sidestepped

Once that's set, any PR touching .github/workflows/, .github/dependabot.yml, or .github/CODEOWNERS will be blocked from merging until a code owner approves it.

License

Licensed under the Apache License, Version 2.0 — see LICENSE for the full text. Attributions for the third-party components Ophelian declares as runtime / optional dependencies (PyTorch, HuggingFace Transformers, scikit-learn, XGBoost, boto3 / botocore, the google-cloud-* SDKs, the azure-* SDKs, pydantic, FastAPI, OpenTelemetry, etc.) are enumerated in NOTICE, which ships in both the sdist and the wheel on PyPI.

You may not use this project except in compliance with the License. Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

falvaluis

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.1.1

May 13, 2026

1.1.0

May 6, 2026

1.0.1

May 5, 2026

This version

1.0.0

May 5, 2026

0.1.4

Jul 23, 2024

0.1.3

Jul 23, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ophelian-1.0.0.tar.gz (526.2 kB view details)

Uploaded May 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ophelian-1.0.0-py3-none-any.whl (185.9 kB view details)

Uploaded May 5, 2026 Python 3

File details

Details for the file ophelian-1.0.0.tar.gz.

File metadata

Download URL: ophelian-1.0.0.tar.gz
Upload date: May 5, 2026
Size: 526.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for ophelian-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`e5ce3325ab1e8010975f03e79ea20c4e53d15993e0aaae13ea40140201c377c8`
MD5	`026fd6ef59fe00ae062b88cd5f5c4e9e`
BLAKE2b-256	`6df86c77e9f40f1abd7b58f6c64693462a9244c2639e9045e698f9a4653d3b8a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ophelian-1.0.0.tar.gz:

Publisher: release.yml on ophelianio/ophelian

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ophelian-1.0.0.tar.gz
- Subject digest: e5ce3325ab1e8010975f03e79ea20c4e53d15993e0aaae13ea40140201c377c8
- Sigstore transparency entry: 1439116017
- Sigstore integration time: May 5, 2026
Source repository:
- Permalink: ophelianio/ophelian@e4b1a79bdc122b77753bd63dac33c8e3c6ab871a
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/ophelianio
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@e4b1a79bdc122b77753bd63dac33c8e3c6ab871a
- Trigger Event: push

File details

Details for the file ophelian-1.0.0-py3-none-any.whl.

File metadata

Download URL: ophelian-1.0.0-py3-none-any.whl
Upload date: May 5, 2026
Size: 185.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for ophelian-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`78a107064d9678bfa4dbfdad5a50384a5572900e345096c70c40ea22c9ffcf32`
MD5	`d3a257fbcf59b16107de797fd453b3ca`
BLAKE2b-256	`3a1249821b8a8488e9b3e3a6c815cf041ab3ddd516f88508f2e0e3c568353db3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ophelian-1.0.0-py3-none-any.whl:

Publisher: release.yml on ophelianio/ophelian

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ophelian-1.0.0-py3-none-any.whl
- Subject digest: 78a107064d9678bfa4dbfdad5a50384a5572900e345096c70c40ea22c9ffcf32
- Sigstore transparency entry: 1439116028
- Sigstore integration time: May 5, 2026
Source repository:
- Permalink: ophelianio/ophelian@e4b1a79bdc122b77753bd63dac33c8e3c6ab871a
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/ophelianio
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@e4b1a79bdc122b77753bd63dac33c8e3c6ab871a
- Trigger Event: push

ophelian 1.0.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Ophelian

Table of contents

Hello, pipeline

Why Ophelian

Install

Architecture

Envs at a glance

Three demos in three minutes

Use cases

Observability

Compatibility

Documentation

Roadmap

Status & versioning

Community & support

Governance & maintainers

Security

Contributing

Citation

Acknowledgments

Maintainer setup: branch protection on main (one-time, manual)

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Maintainer setup: branch protection on `main` (one-time, manual)