trainything

A visual graph editor for building and running multi-stage ML training pipelines

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kvark

These details have not been verified by PyPI

Project links

Homepage

Project description

Trainything

A visual graph editor for composing and running multi-stage ML training pipelines.

Trainything lets you compose training pipelines visually: drop a dataset, drop a model, wire them together, configure, hit Start. Watch it train. Drag the checkpoint into the next stage. The goal is not to beat a hand-tuned Python script -- it's to make the entire multi-stage process legible, explorable, and fast to iterate on.

The primary audience is people learning how modern ML training works: how vision encoders connect to language models, how VLAs consume demonstration data, how RL rollouts feed back into training. If you can see the graph, you can build intuition.

Graph Model

The graph is bipartite (Petri-net style): artifact nodes (ovals) hold passive data at rest, process nodes (rectangles) run active computation. Every data edge crosses the artifact/process boundary -- never same-type to same-type.

A second edge type -- module edges -- wire process nodes together without implying execution order. For example, a Score node wired to a Train node provides a scoring function that Train calls internally during its loop.

Graph screenshot

Node types

Artifacts	Processes
Dataset	Train (supervised, SFT, behavior cloning, PPO, GRPO, GRPO-text, DQN, diffusion)
Model	Evaluate (classify, generate text, generate images)
Environment	Transform (filter, split, map, tokenize, augment)
Metrics	Score (episode reward, task success, format/correctness reward)

What you can build

Supervised learning — Image classification, regression, fine-tuning pretrained vision models. CIFAR-10 → ResNet-18 → Train (supervised) → Evaluate

LLM pre-training — Train a GPT-2 from scratch on text data with next-token prediction. Tiny Shakespeare → Tokenize → GPT-2 (from config) → Train (SFT) → Generate

LLM fine-tuning — SFT or LoRA on instruction/chat datasets. Dataset → Model (LoRA) → Train (SFT) → Evaluate (generate)

R1-style reasoning — Teach a language model to reason step-by-step using GRPO with correctness and format rewards. GSM8K → GPT-2 → Train (GRPO-text) → Generate Answers

Image diffusion — Train a DDPM denoising model from scratch, then generate samples. CIFAR-10 → UNet2D → Train (diffusion) → Evaluate (generate_images)

Image generation — Run inference with pretrained diffusion pipelines (Stable Diffusion, etc.). Model (diffusion) → Evaluate (generate_images) → Generated Images

RL control — DQN, PPO, or GRPO on gymnasium environments (CartPole, LunarLander, Atari, MuJoCo). Environment → Score → Train (PPO/DQN) ← Model (MLP/CNN)

Multi-stage RL — Behavior cloning → GRPO → PPO, with checkpoint warmstarting between stages. Demos → BC → GRPO warmstart → PPO polish

Imitation learning — Clone expert demonstrations into a policy network. Dataset (demos) → Model (MLP) → Train (behavior_cloning) → Evaluate

Robot learning — Evaluate vision-language-action models (LeRobot/SmolVLA) on manipulation tasks. Model (lerobot) → Evaluate → Metrics

Getting Started

Requires Docker or Podman.

make build   # build the container image (CUDA PyTorch by default)
make run     # auto-detects GPU (NVIDIA, AMD, or CPU-only)

Open http://localhost:8000 in your browser.

For AMD ROCm or CPU-only PyTorch, rebuild with a different torch index:

make build TORCH_INDEX=https://download.pytorch.org/whl/rocm6.2   # AMD
make build TORCH_INDEX=https://download.pytorch.org/whl/cpu        # CPU

Install from PyPI

pip install trainything
trainything serve

To install with all optional environments (Atari, retro, diffusers, LeRobot, etc.):

pip install trainything[all]

For NVIDIA GPU support, install PyTorch with CUDA first:

pip install torch torchvision --index-url https://download.pytorch.org/whl/cu124
pip install trainything

On macOS (Apple Silicon), PyTorch automatically uses the MPS backend -- no extra steps needed.

Development setup

pip install -e ".[dev]"
cd frontend && npm ci && npm run build && cd ..
trainything serve --dev

Headless execution

trainything run examples/mnist_train.json
trainything run --dry-run examples/lunar_lander_ppo.json

Architecture

Frontend: React + React Flow for the node editor. Side panel for node configuration, bottom panel for live logs and metrics.
Backend: Python + FastAPI. REST for graph CRUD, WebSocket for streaming progress events.
Execution: PyTorch, in-process via asyncio.to_thread. Each process node's execute() is the same code a user would write by hand.

Why Python backend

The entire ecosystem this tool orchestrates is Python: PyTorch, HuggingFace transformers, datasets, tokenizers, schedulers. A Rust backend would mean FFI or subprocess management for every node, and would break the key educational feature: "View Code." Each node's execute() method should be the same code a user would write by hand.

A Rust inference node (e.g. via Meganeura) can exist as a node type that calls out to a Rust binary -- the backend doesn't prevent this.

Acknowledgments

Inspired by ComfyUI's node-based approach to ML pipelines, Kedro's data-centric pipeline model, and the general frustration of managing multi-stage training with shell scripts and notebooks.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kvark

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.1

Apr 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trainything-0.1.tar.gz (1.4 MB view details)

Uploaded Apr 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

trainything-0.1-py3-none-any.whl (77.4 kB view details)

Uploaded Apr 9, 2026 Python 3

File details

Details for the file trainything-0.1.tar.gz.

File metadata

Download URL: trainything-0.1.tar.gz
Upload date: Apr 9, 2026
Size: 1.4 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for trainything-0.1.tar.gz
Algorithm	Hash digest
SHA256	`3ad78216ca44e74945e3aa5ccb4e31d04d6315554c2855a77dcc445e0fe0722e`
MD5	`d8e40e0ce96d35cd2a54192b0f41a328`
BLAKE2b-256	`57474fe9da2066f6db7b732922ae7d908cf7f9a1708a3ac5534d66170abd85d7`

See more details on using hashes here.

Provenance

The following attestation bundles were made for trainything-0.1.tar.gz:

Publisher: release.yml on kvark/trainything

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: trainything-0.1.tar.gz
- Subject digest: 3ad78216ca44e74945e3aa5ccb4e31d04d6315554c2855a77dcc445e0fe0722e
- Sigstore transparency entry: 1260604813
- Sigstore integration time: Apr 9, 2026
Source repository:
- Permalink: kvark/trainything@85c4844f0a3ed44936a47d9cd6968efcd254db00
- Branch / Tag: refs/tags/v0.1
- Owner: https://github.com/kvark
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@85c4844f0a3ed44936a47d9cd6968efcd254db00
- Trigger Event: push

File details

Details for the file trainything-0.1-py3-none-any.whl.

File metadata

Download URL: trainything-0.1-py3-none-any.whl
Upload date: Apr 9, 2026
Size: 77.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for trainything-0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fa835a163e46051f5c773784365a181cf0bc3c20d03494878183cd0a159d5c1c`
MD5	`df43d9399ce5be99e7cb2f957d088fd2`
BLAKE2b-256	`a9a32b4d51f1c54d55521cbc94b62ae4aab581a2e214d0e3413de8e2f98b8c6f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for trainything-0.1-py3-none-any.whl:

Publisher: release.yml on kvark/trainything

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: trainything-0.1-py3-none-any.whl
- Subject digest: fa835a163e46051f5c773784365a181cf0bc3c20d03494878183cd0a159d5c1c
- Sigstore transparency entry: 1260604880
- Sigstore integration time: Apr 9, 2026
Source repository:
- Permalink: kvark/trainything@85c4844f0a3ed44936a47d9cd6968efcd254db00
- Branch / Tag: refs/tags/v0.1
- Owner: https://github.com/kvark
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@85c4844f0a3ed44936a47d9cd6968efcd254db00
- Trigger Event: push

trainything 0.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Graph Model

Node types

What you can build

Getting Started

Install from PyPI

Development setup

Headless execution

Architecture

Why Python backend

Acknowledgments

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance