A frameworkless integration layer for ML pipelines. Ato doesn't compete with frameworks — it restores freedom between them.

These details have not been verified by PyPI

Project links

Project description

Ato: A Tiny Orchestrator

A frameworkless integration layer for Python and ML pipelines

Ato doesn't compete with frameworks — it restores freedom between them.

Every framework tries to own your workflow. Ato doesn't. It just gives you the handles to connect them.

Unlike Hydra, MLflow, or W&B — which build ecosystems — Ato provides structural boundaries that let you compose, track, and optimize without surrendering control to any single system.

You can:

Chain configs from multiple sources (Hydra YAML, OpenMMLab, raw Python)
Merge them with explicit priority control — and debug the exact merge order
Track experiments in SQLite with zero setup — or sync to MLflow/W&B when you need dashboards
Optimize hyperparameters with built-in Hyperband — or use Optuna/Ray Tune alongside

Ato keeps everything transparent and Pythonic. No magic. No vendor lock-in. Just composable pieces you control.

Design Philosophy

Ato was designed from a simple realization: frameworks solve composition, but they also create coupling.

When I built Ato, I didn't know Hydra existed. But I did know I wanted something that wouldn't own my workflow — something that could compose with whatever came next.

That constraint led to three design principles:

Structural neutrality — Ato has no opinion on your stack. It's a layer, not a platform.
Explicit boundaries — Each module (ADict, Scope, SQLTracker, HyperOpt) is independent. Use one, use all, or mix with other tools. No forced dependencies.
Debuggable composition — When configs merge from 5 sources, you should see why a value was set. Ato's manual command shows the exact merge order — a feature no other tool has.

This isn't minimalism for its own sake. It's structural restraint — interfering only where necessary, and staying out of the way everywhere else.

Why Ato?

Ato solves a problem that frameworks create: workflow ownership.

Framework Approach	Ato's Approach
Hydra owns config composition	Ato composes Hydra configs + raw Python + CLI args
MLflow owns experiment tracking	Ato tracks locally in SQLite, or syncs to MLflow
W&B owns hyperparameter search	Ato provides Hyperband, or you use Optuna/Ray
Each framework wants to be the system	Ato is a layer you control

Ato is for teams who want:

Config flexibility without framework lock-in
Experiment tracking without mandatory cloud platforms
Hyperparameter optimization without opaque black boxes
The ability to change tools without rewriting pipelines

What Makes Ato Structurally Different

These aren't features — they're architectural decisions that frameworks can't replicate without breaking their own abstractions:

Capability	Why Frameworks Can't Do This	What It Enables
MultiScope (namespace isolation)	Frameworks use global config namespaces	Multiple teams can own separate config scopes without key collisions. No `model_lr` vs `data_lr` prefixing needed.
`manual` command (merge order debugging)	Frameworks show final configs, not merge logic	See why a value was set — trace exact merge order across defaults, named configs, CLI args, and lazy evaluation.
Structural hashing	Frameworks track values, not structure	Detect when experiment architecture changes (not just hyperparameters). Critical for reproducibility.
Offline-first tracking	Frameworks assume centralized platforms	Zero-setup SQLite tracking. No servers, no auth, no vendor lock-in. Sync to MLflow/W&B only when needed.

Developer Experience

Zero boilerplate — Auto-nested configs (cfg.model.backbone.depth = 50 just works), lazy evaluation, attribute access
CLI-first — Override any config from command line without touching code: python train.py model.backbone=%resnet101%
Framework agnostic — Works with PyTorch, TensorFlow, JAX, or pure Python. No framework-specific decorators or magic.

Quick Start

pip install ato

30-Second Example

from ato.scope import Scope

scope = Scope()

@scope.observe(default=True)
def config(cfg):
    cfg.lr = 0.001
    cfg.batch_size = 32
    cfg.model = 'resnet50'

@scope
def train(cfg):
    print(f"Training {cfg.model} with lr={cfg.lr}")
    # Your training code here

if __name__ == '__main__':
    train()  # python train.py
    # Override from CLI: python train.py lr=0.01 model=%resnet101%

ADict: Enhanced Dictionary
Scope: Configuration Management
- MultiScope: Namespace Isolation ⭐ Unique to Ato
- Config Documentation & Debugging ⭐ Unique to Ato
SQL Tracker: Experiment Tracking
Hyperparameter Optimization
Best Practices
Roadmap
Working with Existing Tools

ADict: Enhanced Dictionary

ADict is an enhanced dictionary designed for managing experiment configurations. It combines the simplicity of Python dictionaries with powerful features for ML workflows.

Core Features

These are the fundamental capabilities that make ADict powerful for experiment management:

Feature	Description	Why It Matters
Structural Hashing	Hash based on keys + types, not values	Track when experiment structure changes
Nested Access	Dot notation for nested configs	`config.model.lr` instead of `config['model']['lr']`
Format Agnostic	Load/save JSON, YAML, TOML, XYZ	Work with any config format
Safe Updates	`update_if_absent()` method	Prevent accidental overwrites

Developer Convenience Features

These utilities maximize developer productivity and reduce boilerplate:

Feature	Description	Benefit
Auto-nested (`ADict.auto()`)	Infinite depth lazy creation	`config.a.b.c = 1` just works - no KeyError
Attribute-style Assignment	`config.lr = 0.1`	Cleaner, more readable code
Conditional Updates	Only update missing keys	Merge configs safely

Quick Examples

from ato.adict import ADict

# Structural hashing - track config structure changes
config1 = ADict(lr=0.1, epochs=100, model='resnet50')
config2 = ADict(lr=0.01, epochs=200, model='resnet101')
print(config1.get_structural_hash() == config2.get_structural_hash())  # True

config3 = ADict(lr=0.1, epochs='100', model='resnet50')  # epochs is str!
print(config1.get_structural_hash() == config3.get_structural_hash())  # False

# Load/save any format
config = ADict.from_file('config.json')
config.dump('config.yaml')

# Safe updates
config.update_if_absent(lr=0.01, scheduler='cosine')  # Only adds scheduler

Convenience Features in Detail

Auto-nested: Zero Boilerplate Config Building

The most loved feature - no more manual nesting:

# ❌ Traditional way
config = ADict()
config.model = ADict()
config.model.backbone = ADict()
config.model.backbone.layers = [64, 128, 256]

# ✅ With ADict.auto()
config = ADict.auto()
config.model.backbone.layers = [64, 128, 256]  # Just works!
config.data.augmentation.brightness = 0.2

Perfect for Scope integration:

from ato.scope import Scope

scope = Scope()

@scope.observe(default=True)
def config(cfg):
    # No pre-definition needed!
    cfg.training.optimizer.name = 'AdamW'
    cfg.training.optimizer.lr = 0.001
    cfg.model.encoder.num_layers = 12

Works with CLI:

python train.py model.backbone.resnet.depth=50 data.batch_size=32

More Convenience Utilities

# Attribute-style access
config.lr = 0.1
print(config.lr)  # Instead of config['lr']

# Nested access
print(config.model.backbone.type)  # Clean and readable

# Conditional updates - merge configs safely
base_config.update_if_absent(**experiment_config)

Scope: Configuration Management

Scope solves configuration complexity through priority-based merging and CLI integration. No more scattered config files or hard-coded parameters.

Key Concepts

Default Configs (priority=0)
    ↓
Named Configs (priority=0+)
    ↓
CLI Arguments (highest priority)
    ↓
Lazy Configs (computed after CLI)

Basic Usage

Simple Configuration

from ato.scope import Scope

scope = Scope()

@scope.observe()
def my_config(config):
    config.dataset = 'cifar10'
    config.lr = 0.001
    config.batch_size = 32

@scope
def train(config):
    print(f"Training on {config.dataset}")
    # Your code here

if __name__ == '__main__':
    train()

Priority-based Merging

@scope.observe(default=True)  # Always applied
def defaults(cfg):
    cfg.lr = 0.001
    cfg.epochs = 100

@scope.observe(priority=1)  # Applied after defaults
def high_lr(cfg):
    cfg.lr = 0.01

@scope.observe(priority=2)  # Applied last
def long_training(cfg):
    cfg.epochs = 300

python train.py                           # lr=0.001, epochs=100
python train.py high_lr                   # lr=0.01, epochs=100
python train.py high_lr long_training     # lr=0.01, epochs=300

CLI Configuration

Override any parameter from command line:

# Simple values
python train.py lr=0.01 batch_size=64

# Nested configs
python train.py model.backbone=%resnet101% model.depth=101

# Lists and complex types
python train.py layers=[64,128,256,512] dropout=0.5

# Combine with named configs
python train.py my_config lr=0.001 batch_size=128

Note: Wrap strings with % (e.g., %resnet101%) instead of quotes.

Advanced Features

1. Lazy Evaluation - Dynamic Configuration

Sometimes you need configs that depend on other values set via CLI:

@scope.observe()
def base_config(cfg):
    cfg.model = 'resnet50'
    cfg.dataset = 'imagenet'

@scope.observe(lazy=True)  # Evaluated AFTER CLI args
def computed_config(cfg):
    # Adjust based on dataset
    if cfg.dataset == 'imagenet':
        cfg.num_classes = 1000
        cfg.image_size = 224
    elif cfg.dataset == 'cifar10':
        cfg.num_classes = 10
        cfg.image_size = 32

python train.py dataset=%cifar10% computed_config
# Results in: num_classes=10, image_size=32

Python 3.11+ Context Manager:

@scope.observe()
def my_config(cfg):
    cfg.model = 'resnet50'
    cfg.num_layers = 50

    with Scope.lazy():  # Evaluated after CLI
        if cfg.model == 'resnet101':
            cfg.num_layers = 101

2. MultiScope - Multiple Configuration Contexts

Unique to Ato: Manage completely separate configuration namespaces. Unlike Hydra's config groups, MultiScope provides true namespace isolation with independent priority systems.

Why MultiScope?

Challenge	Hydra's Approach	Ato's MultiScope
Separate model/data configs	Config groups in one namespace	Independent scopes with own priorities
Avoid key collisions	Manual prefixing (`model.lr`, `train.lr`)	Automatic namespace isolation
Different teams/modules	Single config file	Each scope can be owned separately
Priority conflicts	Global priority system	Per-scope priority system

Basic Usage

from ato.scope import Scope, MultiScope

model_scope = Scope(name='model')
data_scope = Scope(name='data')
scope = MultiScope(model_scope, data_scope)

@model_scope.observe(default=True)
def model_config(model):
    model.backbone = 'resnet50'
    model.pretrained = True

@data_scope.observe(default=True)
def data_config(data):
    data.dataset = 'cifar10'
    data.batch_size = 32

@scope
def train(model, data):  # Named parameters match scope names
    print(f"Training {model.backbone} on {data.dataset}")

Real-world: Team Collaboration

Different team members can own different scopes without conflicts:

# team_model.py - ML team owns this
model_scope = Scope(name='model')

@model_scope.observe(default=True)
def resnet_default(model):
    model.backbone = 'resnet50'
    model.lr = 0.1  # Model-specific learning rate

@model_scope.observe(priority=1)
def resnet101(model):
    model.backbone = 'resnet101'
    model.lr = 0.05  # Different lr for bigger model

# team_data.py - Data team owns this
data_scope = Scope(name='data')

@data_scope.observe(default=True)
def cifar_default(data):
    data.dataset = 'cifar10'
    data.lr = 0.001  # Data augmentation learning rate (no conflict!)

@data_scope.observe(priority=1)
def imagenet(data):
    data.dataset = 'imagenet'
    data.workers = 16

# train.py - Integration point
from team_model import model_scope
from team_data import data_scope

scope = MultiScope(model_scope, data_scope)

@scope
def train(model, data):
    # Both have 'lr' but in separate namespaces!
    print(f"Model LR: {model.lr}, Data LR: {data.lr}")

Key advantage: model.lr and data.lr are completely independent. No need for naming conventions like model_lr vs data_lr.

CLI with MultiScope

Override each scope independently:

# Override model scope only
python train.py model.backbone=%resnet101%

# Override data scope only
python train.py data.dataset=%imagenet%

# Override both
python train.py model.backbone=%resnet101% data.dataset=%imagenet%

# Call named configs per scope
python train.py resnet101 imagenet

3. Import/Export Configs

Ato supports importing configs from multiple frameworks:

@scope.observe()
def load_external(config):
    # Load from any format
    config.load('experiments/baseline.json')
    config.load('models/resnet.yaml')

    # Export to any format
    config.dump('output/final_config.toml')

    # Import OpenMMLab configs - handles _base_ inheritance automatically
    config.load_mm_config('mmdet_configs/faster_rcnn.py')

OpenMMLab compatibility is built-in:

Automatically resolves _base_ inheritance chains
Supports _delete_ keys for config overriding
Makes migration from MMDetection/MMSegmentation/etc. seamless

Hydra-style config composition is also built-in via compose_hierarchy:

from ato.adict import ADict

# Hydra-style directory structure:
# configs/
#   ├── config.yaml          # base config
#   ├── model/
#   │   ├── resnet50.yaml
#   │   └── resnet101.yaml
#   └── data/
#       ├── cifar10.yaml
#       └── imagenet.yaml

config = ADict.compose_hierarchy(
    root='configs',
    config_filename='config',
    select={
        'model': 'resnet50',      # or ['resnet50', 'resnet101'] for multiple
        'data': 'imagenet'
    },
    overrides={
        'model.lr': 0.01,
        'data.batch_size': 64
    },
    required=['model.backbone', 'data.dataset'],  # Validation
    on_missing='warn'  # or 'error'
)

Key features:

Config groups (model/, data/, optimizer/, etc.)
Automatic file discovery (tries .yaml, .json, .toml, .xyz)
Dotted overrides (model.lr=0.01)
Required key validation
Flexible error handling

4. Argparse Integration

Mix Ato with existing argparse code:

from ato.scope import Scope
import argparse

scope = Scope(use_external_parser=True)
parser = argparse.ArgumentParser()
parser.add_argument('--gpu', type=int, default=0)
parser.add_argument('--seed', type=int, default=42)

@scope.observe(default=True)
def config(cfg):
    cfg.lr = 0.001
    cfg.batch_size = 32

@scope
def train(cfg):
    print(f"GPU: {cfg.gpu}, LR: {cfg.lr}")

if __name__ == '__main__':
    parser.parse_args()  # Merges argparse with scope
    train()

5. Configuration Documentation & Inspection

One of Ato's most powerful features: Auto-generate documentation AND visualize the exact order of configuration application.

Basic Documentation

@scope.manual
def config_docs(cfg):
    cfg.lr = 'Learning rate for optimizer'
    cfg.batch_size = 'Number of samples per batch'
    cfg.model = 'Model architecture (resnet50, resnet101, etc.)'

python train.py manual

Output:

--------------------------------------------------
[Scope "config"]
(The Applying Order of Views)
defaults → (CLI Inputs) → lazy_config → main

(User Manuals)
config.lr: Learning rate for optimizer
config.batch_size: Number of samples per batch
config.model: Model architecture (resnet50, resnet101, etc.)
--------------------------------------------------

Why This Matters

The applying order visualization shows you exactly how your configs are merged:

Which config functions are applied (in order)
When CLI inputs override values
Where lazy configs are evaluated
The final function that uses the config

This prevents configuration bugs by making the merge order explicit and debuggable.

MultiScope Documentation

For complex projects with multiple scopes, manual shows each scope separately:

from ato.scope import Scope, MultiScope

model_scope = Scope(name='model')
train_scope = Scope(name='train')
scope = MultiScope(model_scope, train_scope)

@model_scope.observe(default=True)
def model_defaults(model):
    model.backbone = 'resnet50'
    model.num_layers = 50

@model_scope.observe(priority=1)
def model_advanced(model):
    model.pretrained = True

@model_scope.observe(lazy=True)
def model_lazy(model):
    if model.backbone == 'resnet101':
        model.num_layers = 101

@train_scope.observe(default=True)
def train_defaults(train):
    train.lr = 0.001
    train.epochs = 100

@model_scope.manual
def model_docs(model):
    model.backbone = 'Model backbone architecture'
    model.num_layers = 'Number of layers in the model'

@train_scope.manual
def train_docs(train):
    train.lr = 'Learning rate for optimizer'
    train.epochs = 'Total training epochs'

@scope
def main(model, train):
    print(f"Training {model.backbone} with lr={train.lr}")

if __name__ == '__main__':
    main()

python train.py manual

Output:

--------------------------------------------------
[Scope "model"]
(The Applying Order of Views)
model_defaults → model_advanced → (CLI Inputs) → model_lazy → main

(User Manuals)
model.backbone: Model backbone architecture
model.num_layers: Number of layers in the model
--------------------------------------------------
[Scope "train"]
(The Applying Order of Views)
train_defaults → (CLI Inputs) → main

(User Manuals)
train.lr: Learning rate for optimizer
train.epochs: Total training epochs
--------------------------------------------------

Real-world Example

This is especially valuable when debugging why a config value isn't what you expect:

@scope.observe(default=True)
def defaults(cfg):
    cfg.lr = 0.001

@scope.observe(priority=1)
def experiment_config(cfg):
    cfg.lr = 0.01

@scope.observe(priority=2)
def another_config(cfg):
    cfg.lr = 0.1

@scope.observe(lazy=True)
def adaptive_lr(cfg):
    if cfg.batch_size > 64:
        cfg.lr = cfg.lr * 2

When you run python train.py manual, you see:

(The Applying Order of Views)
defaults → experiment_config → another_config → (CLI Inputs) → adaptive_lr → main

Now it's crystal clear why lr=0.1 (from another_config) and not 0.01!

SQL Tracker: Experiment Tracking

Lightweight experiment tracking using SQLite - no external services, no setup complexity.

Why SQL Tracker?

Zero Setup: Just a SQLite file, no servers
Full History: Track all runs, metrics, and artifacts
Smart Search: Find similar experiments by config structure
Code Versioning: Track code changes via fingerprints

Database Schema

Project (my_ml_project)
├── Experiment (run_1)
│   ├── config: {...}
│   ├── structural_hash: "abc123..."
│   ├── Metrics: [loss, accuracy, ...]
│   ├── Artifacts: [model.pt, plots/*, ...]
│   └── Fingerprints: [model_forward, train_step, ...]
├── Experiment (run_2)
└── ...

Quick Start

Logging Experiments

from ato.db_routers.sql.manager import SQLLogger
from ato.adict import ADict

# Setup config
config = ADict(
    experiment=ADict(
        project_name='image_classification',
        sql=ADict(db_path='sqlite:///experiments.db')
    ),
    # Your hyperparameters
    lr=0.001,
    batch_size=32,
    model='resnet50'
)

# Create logger
logger = SQLLogger(config)

# Start experiment run
run_id = logger.run(tags=['baseline', 'resnet50', 'cifar10'])

# Training loop
for epoch in range(100):
    # Your training code
    train_loss = train_one_epoch()
    val_acc = validate()

    # Log metrics
    logger.log_metric('train_loss', train_loss, step=epoch)
    logger.log_metric('val_accuracy', val_acc, step=epoch)

# Log artifacts
logger.log_artifact(run_id, 'checkpoints/model_best.pt',
                   data_type='model',
                   metadata={'epoch': best_epoch})

# Finish run
logger.finish(status='completed')

Querying Experiments

from ato.db_routers.sql.manager import SQLFinder

finder = SQLFinder(config)

# Get all runs in project
runs = finder.get_runs_in_project('image_classification')
for run in runs:
    print(f"Run {run.id}: {run.config.model} - {run.status}")

# Find best performing run
best_run = finder.find_best_run(
    project_name='image_classification',
    metric_key='val_accuracy',
    mode='max'  # or 'min' for loss
)
print(f"Best config: {best_run.config}")

# Find similar experiments (same config structure)
similar = finder.find_similar_runs(run_id=123)
print(f"Found {len(similar)} runs with similar config structure")

# Trace statistics (code fingerprints)
stats = finder.get_trace_statistics('image_classification', trace_id='model_forward')
print(f"Model forward pass has {stats['static_trace_versions']} versions")

Real-world Example: Experiment Comparison

# Compare hyperparameter impact
finder = SQLFinder(config)

runs = finder.get_runs_in_project('my_project')
for run in runs:
    # Get final accuracy
    final_metrics = [m for m in run.metrics if m.key == 'val_accuracy']
    best_acc = max(m.value for m in final_metrics) if final_metrics else 0

    print(f"LR: {run.config.lr}, Batch: {run.config.batch_size} → Acc: {best_acc:.2%}")

Features Summary

Feature	Description
Structural Hash	Auto-track config structure changes
Metric Logging	Time-series metrics with step tracking
Artifact Management	Track model checkpoints, plots, data files
Fingerprint Tracking	Version control for code (static & runtime)
Smart Search	Find similar configs, best runs, statistics

Hyperparameter Optimization

Built-in Hyperband algorithm for efficient hyperparameter search with early stopping.

Extensible Design

Ato's hyperopt module is built for extensibility and reusability:

Component	Purpose	Benefit
`GridSpaceMixIn`	Parameter sampling logic	Reusable across different algorithms
`HyperOpt`	Base optimization class	Easy to implement custom strategies
`DistributedMixIn`	Distributed training support	Optional, composable

This design makes it trivial to implement custom search algorithms:

from ato.hyperopt.base import GridSpaceMixIn, HyperOpt

class RandomSearch(GridSpaceMixIn, HyperOpt):
    def main(self, func):
        # Reuse GridSpaceMixIn.prepare_distributions()
        configs = self.prepare_distributions(self.config, self.search_spaces)

        # Implement random sampling
        import random
        random.shuffle(configs)

        results = []
        for config in configs[:10]:  # Sample 10 random configs
            metric = func(config)
            results.append((config, metric))

        return max(results, key=lambda x: x[1])

How Hyperband Works

Hyperband uses successive halving:

Start with many configs, train briefly
Keep top performers, discard poor ones
Train survivors longer
Repeat until one winner remains

Basic Usage

from ato.adict import ADict
from ato.hyperopt.hyperband import HyperBand
from ato.scope import Scope

scope = Scope()

# Define search space
search_spaces = ADict(
    lr=ADict(
        param_type='FLOAT',
        param_range=(1e-5, 1e-1),
        num_samples=20,
        space_type='LOG'  # Logarithmic spacing
    ),
    batch_size=ADict(
        param_type='INTEGER',
        param_range=(16, 128),
        num_samples=5,
        space_type='LOG'
    ),
    model=ADict(
        param_type='CATEGORY',
        categories=['resnet50', 'resnet101', 'efficientnet_b0']
    )
)

# Create Hyperband optimizer
hyperband = HyperBand(
    scope,
    search_spaces,
    halving_rate=0.3,      # Keep top 30% each round
    num_min_samples=3,     # Stop when <= 3 configs remain
    mode='max'             # Maximize metric (use 'min' for loss)
)

@hyperband.main
def train(config):
    # Your training code
    model = create_model(config.model)
    optimizer = Adam(lr=config.lr)

    # Use __num_halved__ for early stopping
    num_epochs = compute_epochs(config.__num_halved__)

    # Train and return metric
    val_acc = train_and_evaluate(model, optimizer, num_epochs)
    return val_acc

if __name__ == '__main__':
    # Run hyperparameter search
    best_result = train()
    print(f"Best config: {best_result.config}")
    print(f"Best metric: {best_result.metric}")

Automatic Step Calculation

Let Hyperband compute optimal training steps:

hyperband = HyperBand(scope, search_spaces, halving_rate=0.3, num_min_samples=4)

max_steps = 100000
steps_per_generation = hyperband.compute_optimized_initial_training_steps(max_steps)
# Example output: [27, 88, 292, 972, 3240, 10800, 36000, 120000]

# Use in training
@hyperband.main
def train(config):
    generation = config.__num_halved__
    num_steps = steps_per_generation[generation]

    metric = train_for_n_steps(num_steps)
    return metric

Parameter Types

Type	Description	Example
`FLOAT`	Continuous values	Learning rate, dropout
`INTEGER`	Discrete integers	Batch size, num layers
`CATEGORY`	Categorical choices	Model type, optimizer

Space types:

LOG: Logarithmic spacing (good for learning rates)
LINEAR: Linear spacing (default)

Distributed Hyperparameter Search

Ato supports distributed hyperparameter optimization out of the box:

from ato.hyperopt.hyperband import DistributedHyperBand
import torch.distributed as dist

# Initialize distributed training
dist.init_process_group(backend='nccl')
rank = dist.get_rank()
world_size = dist.get_world_size()

# Create distributed hyperband
hyperband = DistributedHyperBand(
    scope,
    search_spaces,
    halving_rate=0.3,
    num_min_samples=3,
    mode='max',
    rank=rank,
    world_size=world_size,
    backend='pytorch'
)

@hyperband.main
def train(config):
    # Your distributed training code
    model = create_model(config)
    model = DDP(model, device_ids=[rank])
    metric = train_and_evaluate(model)
    return metric

if __name__ == '__main__':
    result = train()
    if rank == 0:
        print(f"Best config: {result.config}")

Key features:

Automatic work distribution across GPUs
Synchronized config selection via broadcast_object_from_root
Results aggregation with all_gather_object
Compatible with PyTorch DDP, FSDP, DeepSpeed

Best Practices

1. Project Structure

my_project/
├── configs/
│   ├── default.py       # Default config with @scope.observe(default=True)
│   ├── models.py        # Model-specific configs
│   └── datasets.py      # Dataset configs
├── train.py             # Main training script
├── experiments.db       # SQLite experiment tracking
└── experiments/
    ├── run_001/
    │   ├── checkpoints/
    │   └── logs/
    └── run_002/

2. Config Organization

# configs/default.py
from ato.scope import Scope

scope = Scope()

@scope.observe(default=True)
def defaults(cfg):
    # Data
    cfg.data = ADict(
        dataset='cifar10',
        batch_size=32,
        num_workers=4
    )

    # Model
    cfg.model = ADict(
        backbone='resnet50',
        pretrained=True
    )

    # Training
    cfg.train = ADict(
        lr=0.001,
        epochs=100,
        optimizer='adam'
    )

    # Experiment tracking
    cfg.experiment = ADict(
        project_name='my_project',
        sql=ADict(db_path='sqlite:///experiments.db')
    )

3. Combined Workflow

from ato.scope import Scope
from ato.db_routers.sql.manager import SQLLogger
from configs.default import scope

@scope
def train(cfg):
    # Setup experiment tracking
    logger = SQLLogger(cfg)
    run_id = logger.run(tags=[cfg.model.backbone, cfg.data.dataset])

    try:
        # Training loop
        for epoch in range(cfg.train.epochs):
            loss = train_epoch()
            acc = validate()

            logger.log_metric('loss', loss, epoch)
            logger.log_metric('accuracy', acc, epoch)

        logger.finish(status='completed')

    except Exception as e:
        logger.finish(status='failed')
        raise e

if __name__ == '__main__':
    train()

4. Reproducibility Checklist

✅ Use structural hashing to track config changes
✅ Log all hyperparameters to SQLLogger
✅ Tag experiments with meaningful labels
✅ Track artifacts (checkpoints, plots)
✅ Use lazy configs for derived parameters
✅ Document configs with @scope.manual

Requirements

Python >= 3.7
SQLAlchemy (for SQL Tracker)
PyYAML, toml (for config serialization)

See pyproject.toml for full dependencies.

License

MIT License

Roadmap: Expanding Boundaries Without Breaking Neutrality

Ato's design constraint is structural neutrality — adding capabilities without creating dependencies.

Planned: Local Dashboard (Optional Module)

A lightweight HTML dashboard for teams that want visual exploration without committing to MLflow/W&B:

What it adds:

Metric comparison & trends (read-only view of SQLite data)
Run history & artifact browsing
Config diff visualization (including structural hash changes)
Interactive hyperparameter analysis

Design constraints:

No hard dependency — Ato core works 100% without the dashboard
Separate process — Dashboard reads from SQLite; doesn't block or modify runs
Zero lock-in — Remove the dashboard, and your training code doesn't change
Composable — Use it alongside MLflow/W&B, or replace either one

Why This Fits Ato's Philosophy

The dashboard is not a platform — it's a view into data you already own (SQLite).

What It Doesn't Do	Why That Matters
Doesn't store data	You can delete it without losing experiments
Doesn't require auth	No accounts, no vendors, no network calls
Doesn't modify configs	Pure read-only visualization
Doesn't couple to Ato's core	Works with any SQLite database

This preserves Ato's design principle: provide handles, not ownership.

Modular Adoption Path

What You Need	What You Use
Just configs	`ADict` + `Scope` — no DB, no UI
Headless tracking	Add SQLTracker — still no UI
Local visualization	Add dashboard daemon — run/stop anytime
Team collaboration	Sync to MLflow/W&B dashboards

Guiding principle: Ato remains a set of independent, composable tools — not a platform you commit to.

Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

Development Setup

git clone https://github.com/yourusername/ato.git
cd ato
pip install -e .

Working with Existing Tools

Ato is an integration layer, not a replacement.

It's designed to compose with Hydra, MLflow, W&B, and whatever tools you already use. The goal isn't to compete — it's to give you handles for connecting systems without coupling your workflow to any single platform.

Composition Strategy

Ato provides three structural capabilities that frameworks don't:

What Ato Adds	Why It Matters	How It Composes
MultiScope	True namespace isolation	Multiple config sources (Hydra, raw Python, CLI) coexist without key collisions
`manual` command	Config merge order visualization	Debug why a value was set — see exact merge order across all sources
Offline-first tracking	Zero-setup SQLite tracking	Track locally, then sync to MLflow/W&B only when you need dashboards

These aren't "features" — they're structural boundaries that let you compose tools freely.

Ato + Hydra: Designed to Compose

Ato has native Hydra composition via compose_hierarchy():

from ato.adict import ADict

# Load Hydra-style configs directly
config = ADict.compose_hierarchy(
    root='configs',
    config_filename='config',
    select={'model': 'resnet50', 'data': 'imagenet'},
    overrides={'model.lr': 0.01}
)

# Now add Ato's structural boundaries:
# - MultiScope for independent namespaces
# - `manual` command to debug merge order
# - SQLite tracking without MLflow overhead

You're not replacing Hydra — you're extending it with namespace isolation and debuggable composition.

Integration Matrix

Ato is designed to work between frameworks:

Tool	What It Owns	What Ato Adds
Hydra	Config composition from YAML	MultiScope (namespace isolation) + merge debugging
MLflow	Centralized experiment platform	Local-first SQLite tracking + structural hashing
W&B	Cloud-based tracking + dashboards	Offline tracking + sync when ready
OpenMMLab	Config inheritance (`_base_`)	Direct import via `load_mm_config()`
Optuna/Ray Tune	Advanced hyperparameter search	Built-in Hyperband + composable with their optimizers

Composition Patterns

Pattern 1: Ato as the integration layer

Hydra (config source) → Ato (composition + tracking) → MLflow (dashboards)

Pattern 2: Ato for local development

Local experiments: Ato (full stack)
Production: Ato → MLflow (centralized tracking)

Pattern 3: Gradual adoption

Start: Ato alone (zero dependencies)
Scale: Add Hydra for complex configs
Collaborate: Sync to W&B for team dashboards

When to Use What

Use Ato alone when:

You want zero external dependencies
You need namespace isolation (MultiScope)
You want to debug config merge order

Compose with Hydra when:

Your team already has Hydra YAML configs
You need deep config hierarchies + namespace isolation
You want to see why a Hydra config set a value (manual command)

Compose with MLflow/W&B when:

You want local-first tracking with optional cloud sync
You need structural hashing + offline SQLite
You're migrating between tracking platforms

You don't need Ato if:

You're fully committed to a single framework ecosystem
You don't need debuggable config composition
You never switch between tools

What Ato Doesn't Do

Ato intentionally doesn't build an ecosystem:

No web dashboards → Use MLflow/W&B
No model registry → Use MLflow
No dataset versioning → Use DVC/W&B
No plugin marketplace → Use Hydra

Ato's goal: Stay out of your way. Provide handles. Let you change tools without rewriting code.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.2.1

Jan 12, 2026

2.2.0

Dec 24, 2025

2.1.7

Nov 16, 2025

2.1.6

Nov 12, 2025

2.1.5

Nov 11, 2025

2.1.4

Nov 11, 2025

2.1.3

Nov 11, 2025

2.1.2

Nov 8, 2025

2.1.1

Nov 7, 2025

2.1.0

Nov 7, 2025

2.0.4

Nov 7, 2025

This version

2.0.3

Nov 7, 2025

2.0.2

Nov 7, 2025

2.0.1

Nov 6, 2025

2.0.0

Nov 3, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ato-2.0.3.tar.gz (54.1 kB view details)

Uploaded Nov 7, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ato-2.0.3-py3-none-any.whl (33.1 kB view details)

Uploaded Nov 7, 2025 Python 3

File details

Details for the file ato-2.0.3.tar.gz.

File metadata

Download URL: ato-2.0.3.tar.gz
Upload date: Nov 7, 2025
Size: 54.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for ato-2.0.3.tar.gz
Algorithm	Hash digest
SHA256	`928653dee763019e6c7fcd3f2cf1847e8c00590443b45ce8b3250805087f3ee3`
MD5	`05bf282b52d3b6bfb321f640086cf7c9`
BLAKE2b-256	`06f1da5dd4cc4a76c0336aec51400d458f250fa86fc1ad08fc480be0f2d85ea1`

See more details on using hashes here.

File details

Details for the file ato-2.0.3-py3-none-any.whl.

File metadata

Download URL: ato-2.0.3-py3-none-any.whl
Upload date: Nov 7, 2025
Size: 33.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for ato-2.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6e87575293d53cf0bfe09f6b31eca326dc2607134903a9278de43c81fd44a050`
MD5	`2df75c46bcb81bb8ba04a24986cb6dd9`
BLAKE2b-256	`3c3558958b58150e4df7d70e41c906a120262c00951e9ac4ee5ea8cbfd6d6b3d`

See more details on using hashes here.

ato 2.0.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Ato: A Tiny Orchestrator

A frameworkless integration layer for Python and ML pipelines

Why Ato?

What Makes Ato Structurally Different

Developer Experience

Quick Start

30-Second Example

Table of Contents

ADict: Enhanced Dictionary

Core Features

Developer Convenience Features

Quick Examples

Convenience Features in Detail

Auto-nested: Zero Boilerplate Config Building

More Convenience Utilities

Scope: Configuration Management

Key Concepts

Basic Usage

Simple Configuration

Priority-based Merging

CLI Configuration

Advanced Features

1. Lazy Evaluation - Dynamic Configuration

2. MultiScope - Multiple Configuration Contexts

Why MultiScope?

Basic Usage

Real-world: Team Collaboration

CLI with MultiScope

3. Import/Export Configs

4. Argparse Integration

5. Configuration Documentation & Inspection

Basic Documentation

Why This Matters

MultiScope Documentation

Real-world Example

SQL Tracker: Experiment Tracking

Why SQL Tracker?

Database Schema

Quick Start

Logging Experiments

Querying Experiments

Real-world Example: Experiment Comparison

Features Summary

Hyperparameter Optimization

Extensible Design

How Hyperband Works

Basic Usage

Automatic Step Calculation

Parameter Types

Distributed Hyperparameter Search

Best Practices

1. Project Structure

2. Config Organization

3. Combined Workflow

4. Reproducibility Checklist

Requirements

License

Roadmap: Expanding Boundaries Without Breaking Neutrality

Planned: Local Dashboard (Optional Module)

Why This Fits Ato's Philosophy

Modular Adoption Path

Contributing

Development Setup

Working with Existing Tools

Composition Strategy

Ato + Hydra: Designed to Compose

Integration Matrix

Composition Patterns

When to Use What

What Ato Doesn't Do

Project details