PyTorch Training Manager v1.4.3

These details have not been verified by PyPI

Project links

Repository

Project description

torchmanager

A generic deep learning training/testing framework for PyTorch

To use this framework, simply initialize a Manager object. The Manager class provides a generic training/testing loop for PyTorch models. It also provides some useful callbacks to use during training/testing.

Pre-request

Python 3.10+
PyTorch
Packaging
tqdm
PyYAML (Optional for yaml configs)
scipy (Optional for FID metric)
tensorboard (Optional for tensorboard recording)

Installation

PyPi: pip install torchmanager
Conda: conda install torchmanager -c conda-forge

Start from Configurations

The Configs class is designed to be inherited to define necessary configurations. It also provides a method to get configurations from terminal arguments.

from torchmanager.configs import Configs as _Configs

# define necessary configurations
class Configs(_Configs):
    epochs: int
    lr: float
    ...

    @staticmethod
    def get_arguments(parser: Union[argparse.ArgumentParser, argparse._ArgumentGroup] = argparse.ArgumentParser()) -> Union[argparse.ArgumentParser, argparse._ArgumentGroup]:
        '''Add arguments to argument parser'''
        ...

    def show_settings(self) -> None:
        '''Display current configuerations'''
        ...

# get configs from terminal arguments
configs = Configs.from_arguments()

Torchmanager Dataset

The data.Dataset class is designed to be inherited to define a dataset. It is a combination of torch.utils.data.Dataset and torch.utils.data.DataLoader with easier usage.

from torchmanager.data import Dataset

# define dataset
class CustomDataset(Dataset):
    def __init__(self, ...):
        ...

    @property
    def unbatched_len(self) -> int:
        '''The total length of data without batch'''
        ...

    def __getitem__(self, index: int) -> tuple[torch.Tensor, torch.Tensor]:
        '''Returns a single pair of unbatched data, iterator will batch the data automatically with `torch.util.data.DataLoader`'''
        ...

# initialize datasets
training_dataset = CustomDataset(...)
val_dataset = CustomDataset(...)
testing_dataset = CustomDataset(...)

The Manager

The Manager class is the core of the framework. It provides a generic training/testing pipeline for PyTorch models. The Manager class is designed to be inherited to manage the training/testing algorithm. There are also some useful callbacks to use during training/testing.

Initialize the manager with target model, optimizer, loss function, and metrics:

import torch, torchmanager

# define model
class PytorchModel(torch.nn.Module):
    ...

# initialize model, optimizer, loss function, and metrics
model = PytorchModel(...)
optimizer = torch.optim.SGD(model.parameters(), lr=configs.lr)
loss_fn = torchmanager.losses.CrossEntropy()
metrics = {'accuracy': torchmanager.metrics.SparseCategoricalAccuracy()}

# initialize manager
manager = torchmanager.Manager(model, optimizer, loss_fn=loss_fn, metrics=metrics)

Multiple losses can be used by passing a dictionary to loss_fn:

loss_fn = {
    'loss1': torchmanager.losses.CrossEntropy(),
    'loss2': torchmanager.losses.Dice(),
    ...
}  # total_loss = loss1 + loss2

Use weight for constant weight coefficients to control the balance between multiple losses:

# define weights
w1: float = ...
w2: float = ...

loss_fn = {
    'loss1': torchmanager.losses.CrossEntropy(weight=w1),
    'loss2': torchmanager.losses.Dice(),
    ...
}  # total_loss = w1 * loss1 + w2 * loss2

Use target for output targets between different losses:

class ModelOutputDict(TypedDict):
    output1: torch.Tensor
    output2: torch.Tensor

LabelDict = ModelOutputDict  # optional, label can also be a direct `torch.Tensor` to compare with target

loss_fn = {
    'loss1': torchmanager.losses.CrossEntropy(target="output1"),
    'loss2': torchmanager.losses.Dice(target="output2"),
    ...
}  # total_loss = loss1(y['output1'], label['output1']) + loss2(y['output2'], label['output2]) if type(label) is LabelDict else loss1(y['output1'], label) + loss2(y['output2'], label)

Train the model with fit method:

show_verbose: bool = ... # show progress bar information during training/testing
manager.fit(training_dataset, epochs=configs.epochs, val_dataset=val_dataset, show_verbose=show_verbose)

There are also some other callbacks to use:

tensorboard_callback = torchmanager.callbacks.TensorBoard('logs') # tensorboard dependency required
last_ckpt_callback = torchmanager.callbacks.LastCheckpoint(manager, 'last.model')
model = manager.fit(..., callbacks_list=[tensorboard_callback, last_ckpt_callback])

Test the model with test method:

manager.test(testing_dataset, show_verbose=show_verbose)

Save the final trained PyTorch model:

torch.save(model, "model.pth") # The saved PyTorch model can be loaded individually without using torchmanager

Device selection during training/testing

Torchmanager automatically identifies available devices for training and testing. If CUDA or MPS is available, it will be used first. To use multiple GPUs, set the use_multi_gpus flag to True. To specify a different device for training or testing, pass the device to the fit or test method, respectively. When use_multi_gpus is set to False, the first available or specified device will be used.

Multi-GPU (CUDA) training/testing:

# train on multiple GPUs
model = manager.fit(..., use_multi_gpus=True)

# test on multiple GPUs
manager.test(..., use_multi_gpus=True)

Use only specified GPUs for training/testing:

# specify devices to use
gpus: list[torch.device] | torch.device = ... # Notice: device id must be specified

# train on specified multiple GPUs
model = manager.fit(..., use_multi_gpus=True, devices=gpus) # Notice: `use_multi_gpus` must set to `True` to use all specified GPUs, otherwise only the first will be used.

# test on specified multiple GPUs
manager.test(..., use_multi_gpus=True, devices=gpus)

Customize training/testing algorithm

Inherited the Manager (TrainingManager) class to manage the training/testing algorithm if default training/testing algorithm is necessary. To customize the training/testing algorithm, simply override the train_step and/or test_step methods.

class CustomManager(Manager):
    ...

    def train_step(x_train: Any, y_train: Any) -> dict[str, float]:
        ...  # code before default training step
        summary = super().train_step(x_train, y_train)
        ...  # code after default training step
        return summary

    def test_step(x_test: Any, y_test: Any) -> dict[str, float]:
        ...  # code before default testing step
        summary = super().test_step(x_test, y_test)
        ...  # code after default testing step
        return summary

Inherited the TestingManager class to manage the testing algorithm without training algorithm if default testing algorithm is necessary. To customize the testing algorithm, simply override the test_step methods.

class CustomManager(TestingManager):
    ...

    def test_step(x_test: Any, y_test: Any) -> dict[str, float]:
        ...  # code before default testing step
        summary = super().test_step(x_test, y_test)
        ...  # code after default testing step
        return summary

Inherited the BasicTrainingManager class to implement the training algorithm with train_step method and testing algorithm with test_step.

class CustomManager(BasicTrainingManager):
    ...

    def train_step(x_train: Any, y_train: Any) -> dict[str, float]:
        ...  # code for one iteration training
        summary: dict[str, float] = ...  # set training summary
        return summary

    def test_step(x_test: Any, y_test: Any) -> dict[str, float]:
        ...  # code for one iteration testing
        summary = ...  # set testing summary
        return summary

Inherited the BasicTestingManager class to implement the testing algorithm with test_step method without training algorithm.

class CustomManager(BasicTestingManager):
    ...

    def test_step(x_test: Any, y_test: Any) -> dict[str, float]:
        ...  # code for one iteration testing
        summary = ...  # set testing summary
        return summary

The saved experiment information

The Experiment class is designed to be used as a single callback to save experiment information. It is a combination of torchmanager.callbacks.TensorBoard, torchmanager.callbacks.LastCheckpoint, and torchmanager.callbacks.BestCheckpoint with easier usage.

...

exp_callback = torchmanager.callbacks.Experiment('test.exp', manager) # tensorboard dependency required
model = manager.fit(..., callbacks_list=[exp_callback])

The information, including full training logs and checkpoints, will be saved in the following structure:

experiments
└── <experiment name>.exp
    ├── checkpoints
    │   ├── best-<metric name>.model
    │   └── last.model
    └── data
    │   └── <TensorBoard data file>
    ├── <experiment name>.cfg
    └── <experiment name>.log

Please cite this work if you find it useful

@software{he_2023_10381715,
  author       = {He, Qisheng and
                  Dong, Ming},
  title        = {{TorchManager: A generic deep learning 
                   training/testing framework for PyTorch}},
  month        = dec,
  year         = 2023,
  publisher    = {Zenodo},
  version      = 1,
  doi          = {10.5281/zenodo.10381715},
  url          = {https://doi.org/10.5281/zenodo.10381715}
}

Also checkout our projects implemented with torchmanager

A-Bridge (SDE-BBDM) - Score-Based Image-to-Image Brownian Bridge
MAG-MS/MAGNET - Modality-Agnostic Learning for Medical Image Segmentation Using Multi-modality Self-distillation
tlt - Transferring Lottery Tickets in Computer Vision Models: a Dynamic Pruning Approach

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

1.5b3 pre-release

Jun 16, 2026

This version

1.4.3

Jan 5, 2026

1.4.2

Sep 30, 2025

1.4.1

Jul 25, 2025

1.4

May 5, 2025

1.4rc2 pre-release yanked

Apr 21, 2025

1.4rc1 pre-release yanked

Apr 8, 2025

1.3.4

Mar 3, 2025

1.3.3

Jan 7, 2025

1.3.2

Oct 25, 2024

1.3.1

Sep 25, 2024

1.3

Aug 26, 2024

1.2.6

Jul 31, 2024

1.2.5

Jul 8, 2024

1.2.4

Apr 22, 2024

1.2.3

Mar 6, 2024

1.2.2

Jan 17, 2024

1.2.1

Dec 7, 2023

1.2

Nov 10, 2023

1.1.6

Oct 6, 2023

1.1.5

Jul 24, 2023

1.1.4

Jun 29, 2023

1.1.3

May 26, 2023

1.1.2

Apr 21, 2023

1.1.1

Apr 4, 2023

1.1

Mar 10, 2023

1.0.7

Jan 9, 2023

1.0.6

Oct 24, 2022

1.0.5

Aug 9, 2022

1.0.4

Jul 21, 2022

1.0.3

May 2, 2022

1.0.2

Apr 4, 2022

1.0.1

Mar 7, 2022

1.0.0

Feb 22, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torchmanager-1.4.3.tar.gz (47.8 kB view details)

Uploaded Jan 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

torchmanager-1.4.3-py3-none-any.whl (71.8 kB view details)

Uploaded Jan 5, 2026 Python 3

File details

Details for the file torchmanager-1.4.3.tar.gz.

File metadata

Download URL: torchmanager-1.4.3.tar.gz
Upload date: Jan 5, 2026
Size: 47.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.18

File hashes

Hashes for torchmanager-1.4.3.tar.gz
Algorithm	Hash digest
SHA256	`66cbdf960a41914c16df3ae26ed0d43bccc28ce7d606ee811e7de066075fd3c7`
MD5	`040f20dfbfc86abc04e9a1d21aa917df`
BLAKE2b-256	`99491673c8bbc358af4f908847b07fc4f82e2e25a272d4700e41c6bc03778f99`

See more details on using hashes here.

File details

Details for the file torchmanager-1.4.3-py3-none-any.whl.

File metadata

Download URL: torchmanager-1.4.3-py3-none-any.whl
Upload date: Jan 5, 2026
Size: 71.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.18

File hashes

Hashes for torchmanager-1.4.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9cddcb5eff876f57e0dc915d2f585a43f4a1ad551feb22816df46deb45d79801`
MD5	`93ecdb631051bb07e37abc6ab575a5a8`
BLAKE2b-256	`24b8cdabd4ae6f15675cf3079057e04f2e31720eecf92504c03d2333f5dac6aa`

See more details on using hashes here.

torchmanager 1.4.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

torchmanager

A generic deep learning training/testing framework for PyTorch

Pre-request

Installation

Start from Configurations

Torchmanager Dataset

The Manager

Device selection during training/testing

Customize training/testing algorithm

The saved experiment information

Please cite this work if you find it useful

Also checkout our projects implemented with torchmanager

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes