Intuitive training framework for PyTorch

These details have not been verified by PyPI

Project links

Homepage

Project description

blowtorch

Intuitive, high-level training framework for research and development. It abstracts away boilerplate normally associated with training and evaluating PyTorch models, without limiting your flexibility. Blowtorch provides the following:

A way to specify training runs at a high level, while not giving up on fine-grained control over individual training parts
Automated checkpointing, logging and resuming of runs
A sacred inspired configuration management
Reproducibility by keeping track of configuration, code and random state of each run

Installation

Make sure you have numpy and torch installed, then install with pip:

pip install --upgrade blowtorch

Minimal working example

from torch.optim import Adam
from torch.utils.data import DataLoader
from torchvision.models import vgg16
from torchvision.datasets import ImageNet
from blowtorch import Run

run = Run(random_seed=123)

@run.train_step
@run.validate_step
def step(batch, model):
    x, y = batch
    y_hat = model(x)
    loss = (y - y_hat) ** 2
    return loss

# will be called when model has been moved to the desired device 
@run.configure_optimizers
def configure_optimizers(model):
    return Adam(model.parameters())

train_loader = DataLoader(ImageNet('.', split='train'), batch_size=4)
val_loader = DataLoader(ImageNet('.', split='val'), batch_size=4)

run(vgg16(), train_loader, val_loader)

Configuration

You can pass multiple configuration files in YAML format to your Run, e.g.

run = Run(config_files=['config/default.yaml'])

Configuration values can then be accessed via e.g. run['model']['num_layers']. Dotted notation is also supported, e.g. run['model.num_layers']. When executing your training script, individual configuration values can be overwritten as follows:

python train.py with model.num_layers=4 model.use_dropout=True

Run options

Run.run() takes following options:

model: torch.nn.Module
train_loader: torch.utils.data.DataLoader
val_loader: torch.utils.data.DataLoader
loggers: Optional[List[aurora.logging.BaseLogger]] (List of loggers that subscribe to various logging events, see logging section)
max_epochs: int (default 1)
use_gpu: bool (default True)
gpu_id: int (default 0)
resume_checkpoint: Optional[Union[str, pathlib.Path]] (Path to checkpoint directory to resume training from, default None)
save_path: Union[str, pathlib.Path] (Path to directory that blowtorch will save logs and checkpoints to, default 'train_logs')
run_name: Optional[str] (Name associated with that run, will be randomly created if None, default None)
optimize_metric: Optional[str] (train metric that will be used for optimization, will pick the first returned one if None, default None)
checkpoint_metric: Optional[str] (validation metric that will be used for checkpointing, will pick the first returned one if None, default None)
smaller_is_better: bool (default True)
optimize_first: bool (whether optimization should occur during the first epoch, default False)
detect_anomalies: bool (enable autograd anomaly detection, default False)

Logging

Blowtorch will create a folder with name "[timestamp]-[name]-[sequential integer]" for each run inside the run.save_path directory. Here it will save the runs's configuration, metrics, a model summary, checkoints as well as source code. Additional loggers can be added through Runs loggers parameter:

blowtorch.loggers.WandbLogger: Logs to Weights & Biases
blowtorch.loggers.TensorBoardLogger: Logs to TensorBoard

Custom loggers can be created by subclassing blowtorch.loggers.BaseLogger.

Decorators

Blowtorch uses the decorator syntax to specify parts of the training pipeline:

@run.train_step, @run.val_step: Specify train/val steps with one or two functions. Arguments: batch, model, is_validate, device, epoch
@run.train_epoch, @run.val_epoch: Specify whole train/val epoch, in case more flexibility for iteration/optimization is required. Arguments: data_loader, model, is_validate, optimizers
@run.configure_optimizers: Return optimizers and learning rate schedulers. Can either return a single optimizer object or a dictionary with multiple optimizers/schedulers. Arguments: model

TODO hooks

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.5.4

Aug 15, 2021

0.5.3

Jun 24, 2021

0.5.2

Jun 21, 2021

This version

0.4.2

Jun 17, 2021

0.4.1

Jun 3, 2021

0.3.1

May 5, 2021

0.3.0

May 5, 2021

0.2.6

Mar 28, 2021

0.2.5

Mar 19, 2021

0.2.4

Mar 17, 2021

0.2.2

Mar 4, 2021

0.2.1

Mar 4, 2021

0.2

Mar 4, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

blowtorch-0.4.2.tar.gz (16.2 kB view hashes)

Uploaded Jun 17, 2021 Source

Built Distribution

blowtorch-0.4.2-py3-none-any.whl (17.9 kB view hashes)

Uploaded Jun 17, 2021 Python 3

Hashes for blowtorch-0.4.2.tar.gz

Hashes for blowtorch-0.4.2.tar.gz
Algorithm	Hash digest
SHA256	`f3fc1e521ce1a9f0b17a1b17865190edfd509961de7b7f47026c51cc826354bf`
MD5	`09e480c07c3cc2f3c7d2d9bbf0a9e11b`
BLAKE2b-256	`7386c5c489b627ad3adeb8a381ea435c3220b9603eba528c6e118731325ac283`

Hashes for blowtorch-0.4.2-py3-none-any.whl

Hashes for blowtorch-0.4.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dc18e3d54e8382813ee320059cd4c6de0bfdd2294bfd3addbe1ebfb490c519b3`
MD5	`882e0fe5baf3dd00aeb48fd0fc87700a`
BLAKE2b-256	`67bbdb44fba6dba6744776c0fc508fe19272eeb0329f8b0ed428129813d2636f`