easytorch

Easy Neural Network Experiments with pytorch

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

A very lightweight framework on top of PyTorch with full functionality.

Logo

PyPi version Python versions

Introduces two extra data augmentation handles in addition to PyTorch's data transforms.
- Pooled run that allows to combine multiple datasets without moving from there original locations
- Data specifications specifying data(and its augmentations) specifications.
Introduces two extra multi-processing handles for blazing fast training by extending the easytorch.ETDataset class:
- Multi-threaded data pre-loading.
- Disk caching for faster access.

from easytorch import ETDataset

class MyDataset(ETDataset):
    def load_index(self, dataset_name, file):
        """(Optional) Load/Process something and add to diskcache as:
                self.diskcahe.add(file, value)"""
        """This method runs in multiple processes by default"""
    
        self.indices.append([dataset_name, file])

    def __getitem__(self, index):
        dataset_name, file = self.indices[index]
        dataspec = self.dataspecs[dataset_name]
        """(Optional) Retrieve from diskcache as self.diskcache.get(file)"""

        image =  # Todo # Load file/Image. 
        label =  # Todo # Load corresponding label.
        
        # Extra preprocessing, if needed.
        # Apply transforms, if needed.

        return image, label

Installation

pip install --upgrade pip
Install latest pytorch and torchvision from Pytorch
pip install easytorch

Lets start something simple like MNIST digit classification:

from easytorch import EasyTorch, ETTrainer, ConfusionMatrix, ETMeter
from torchvision import datasets, transforms
import torch.nn.functional as F
import torch
from examples.models import MNISTNet

transform = transforms.Compose([
    transforms.ToTensor(),
    transforms.Normalize((0.1307,), (0.3081,))
])


class MNISTTrainer(ETTrainer):
    def _init_nn_model(self):
        self.nn['model'] = MNISTNet()

    def iteration(self, batch):
        inputs, labels = batch[0].to(self.device['gpu']).float(), batch[1].to(self.device['gpu']).long()

        out = self.nn['model'](inputs)
        loss = F.nll_loss(out, labels)
        _, pred = torch.max(out, 1)
        
        meter = self.new_meter()
        meter.averages.add(loss.item(), len(inputs))
        meter.metrics['cfm'].add(pred, labels.float())

        return {'loss': loss, 'meter': meter, 'predictions': pred}

    def init_experiment_cache(self):
        self.cache['log_header'] = 'Loss|Accuracy,F1,Precision,Recall'
        self.cache.update(monitor_metric='f1', metric_direction='maximize')

    def new_meter(self):
        return ETMeter(
            cfm=ConfusionMatrix(num_classes=10)
        )


if __name__ == "__main__":
    train_dataset = datasets.MNIST('../data', train=True, download=True, transform=transform)
    val_dataset = datasets.MNIST('../data', train=False, transform=transform)

    dataloader_args = {'train': {'dataset': train_dataset}, 'validation': {'dataset': val_dataset}}
    runner = EasyTorch(phase='train', batch_size=512,
                       epochs=10, gpus=[0], dataloader_args=dataloader_args)
    runner.run(MNISTTrainer)

General use case:

1. Define your trainer

from easytorch import ETTrainer, Prf1a, ETMeter, AUCROCMetrics


class MyTrainer(ETTrainer):

    def _init_nn_model(self):
        self.nn['model'] = NeuralNetModel(out_size=self.args['num_class'])

    def iteration(self, batch):
        """Handle a single batch"""
        """Must have loss and meter"""
        return {'loss': ..., 'meter': ..., 'predictions': ...}

    def new_meter(self):
       return ETMeter(
            num_averages=1,
            prf1a=Prf1a(),
            auc=AUCROCMetrics()
        )

    def init_experiment_cache(self):
        """Will plot Loss in one plot, and Accuracy,F1_score in another."""
        self.cache['log_header'] = 'Loss|Accuracy,F1_score'
        
        """Model selection using validation set if present"""
        self.cache.update(monitor_metric='f1', metric_direction='maximize')

Method new_meter() returns ETMeter that takes any implementation of easytorch.meter.ETMetrics. Provided ones:
- easytorch.metrics.Prf1a() for binary classification that computes accuracy,f1,precision,recall, overlap/IOU.
- easytorch.metrics.ConfusionMatrix(num_classes=...) for multiclass classification that also computes global accuracy,f1,precision,recall.
- easytorch.metrics.AUCROCMetrics for binary ROC-AUC score.

2. Define specification for your datasets:

import os

def get_label(x):
    return x.split('_')[0] + '_label.png'

sep = os.sep
MYDATA = {
    'name': 'mydata',
    'data_dir': 'MYDATA' + sep + 'images',
    'label_dir': 'MYDATA' + sep + 'labels',
    'label_getter': get_label
}

MyOTHERDATA = {
    'name': 'otherdata',
    'data_dir': 'OTHERDATA' + sep + 'images',
    'label_dir': 'OTHERDATA' + sep + 'labels',
    'label_getter': get_label
}

EasyTorch automatically splits the training data in 'data_dir' as specified (split_ratio, or num_folds in EasyTorch Module as below).
One can also provide custom splits(json files with train, validation, test data list) in the directory specified by split_dir in dataspec.
One can give a path to a .txt file with path list of images for test(inference) phase in split_dir field of dataspec.
Additional options in dataspecs:
- Load from sub-folders, "sub_folders": ["class0", "class1", ... "class_K"]
- Load recursively, "recursive": True
- Filter by an extension, "extension": "png"

3. Entry point (say main.py)

from easytorch import EasyTorch

data_spcifications = [DATA_A, DATA_B]
runner = EasyTorch(data_spcifications,
                   phase="train", batch_size=4, epochs=21,
                   num_channel=1, num_class=2,
                   split_ratio=[0.6, 0.2, 0.2])  # or num_folds=5 (exclusive with split_ratio)

if __name__ == "__main__":
    runner.run(MyTrainer, MyDataset) # To train an individual models for each datasets. 
    runner.run_pooled(MyTrainer, MyDataset) # To train a single model combining both datasets.

Run from the command line:

python main.py -ph train -b 4 -e 21 -spl 0.6 0.2 0.2

Note: directly given(EasyTorch constructor) args precedes command line arguments. See below for a list of default arguments.

All the best! Cheers! 🎉

Cite the following papers if you use this library:

@article{deepdyn_10.3389/fcomp.2020.00035,
	title        = {Dynamic Deep Networks for Retinal Vessel Segmentation},
	author       = {Khanal, Aashis and Estrada, Rolando},
	year         = 2020,
	journal      = {Frontiers in Computer Science},
	volume       = 2,
	pages        = 35,
	doi          = {10.3389/fcomp.2020.00035},
	issn         = {2624-9898}
}

@misc{2202.02382,
        Author       = {Aashis Khanal and Saeid Motevali and Rolando Estrada},
        Title        = {Fully Automated Tree Topology Estimation and Artery-Vein Classification},
        Year         = {2022},
        Eprint       = {arXiv:2202.02382},
}

Feature Higlights:

Minimal configuration to setup any simple/complex experiment (Single GPU, DP, and DDP usage).
DataHandle that is always available, and decoupled from other modules enabling easy customization (ETDataHandle).
- Use custom & complex data handling mechanism.
- Load folder datasets.
- Load recursively large datasets with multiple threads.
Full support to split images into patches and rejoin/merge them to get back the complete prediction image like in U-Net(Usually needed when input images are large, and of different shapes) (Thanks to sparse data loaders).
Limit data loading- Limit data to debug the pipeline without moving data from the original place (Thanks to load_limit)
Heterogeneous datasets handling-One can use many folders of dataset by just defining dataspecs and use in single experiment(Thanks to pooled run).
Automatic k-fold cross validation/Auto dataset split (Example: num_folds=10, or split_ratio=[0.6, 0.2, 0.2])
Simple lightweight logger/plotter.
- Plot: set log_header = 'Loss,F1,Accuracy' to plot in same plot or set log_header = 'Loss|F1,Accuracy' to plot Loss in one plot, and F1,Accuracy in another plot.
- Logs: all arguments/generated data will be saved in logs.json file after the experiment finishes.
Gradient accumulation, automatic logging/plotting, model checkpointing
Multiple metrics implementation at easytorch.metrics: Precision, Recall, Accuracy, Overlap, F1, ROC-AUC, Confusion matrix ..more features
For advanced training with multiple networks, and complex training steps, click here:
Implement custom metrics as here.

Default arguments[default-value]. Easily add custom arguments.

-ph/--phase [Required]
- Which phase to run? 'train' (runs all train, validation, test steps) OR 'test' (runs only test step).
-b/--batch_size [4]
-ep/--epochs [11]
-lr/--learning_rate [0.001]
-gpus/--gpus [0]
- List of gpus to be used. Eg. [0], [1], [0, 1]
-nw/--num_workers [0]
- Number of workers for data loading so that cpu can keep-up with GPU speed when loading mini-batches.
-lim/--load-limit[None]
- Specifies a limit on images/files to load for debug purpose for pipeline debugging.
-nf/--num_folds [None]
- Number of folds in k-fold cross validation(Integer value like 5, 10).
-spl/--split_ratio [None]
- Split ratio for train, validation, test set if two items given| train, test if three items given| train only if one item given.
...see more (ddp args)

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

3.8.5

Dec 6, 2023

3.8.4

Dec 5, 2023

3.8.3

Dec 5, 2023

3.8.2

May 31, 2023

3.8.1

May 28, 2023

3.8.0

May 28, 2023

3.7.9

May 28, 2023

3.7.8

May 24, 2023

3.7.7

May 10, 2023

3.7.6

Apr 25, 2023

3.7.5

Apr 24, 2023

3.7.4

Apr 21, 2023

3.7.3

Apr 21, 2023

3.7.2

Apr 21, 2023

3.7.1

Apr 21, 2023

3.7.0

Apr 6, 2023

3.6.9

Apr 5, 2023

3.6.8

Apr 5, 2023

3.6.7

Apr 4, 2023

3.6.6

Apr 4, 2023

3.6.5

Mar 12, 2023

3.6.4

Mar 12, 2023

3.6.3

Mar 12, 2023

3.6.2

Mar 12, 2023

3.6.1

Mar 11, 2023

3.6.0

Mar 11, 2023

3.5.9

Mar 10, 2023

3.5.8

Mar 10, 2023

3.5.7

Mar 10, 2023

3.5.6

Mar 10, 2023

3.5.5

Feb 7, 2023

3.5.4

Feb 4, 2023

3.5.3

Feb 1, 2023

3.5.2

Feb 1, 2023

3.5.0

Jan 22, 2023

This version

3.4.9

Oct 18, 2022

3.4.8

Oct 18, 2022

3.4.7

Jun 29, 2022

3.4.6

Jun 28, 2022

3.4.5

Jun 25, 2022

3.4.3

Jun 10, 2022

3.4.2

Jun 10, 2022

3.4.1

Jun 6, 2022

3.4.0

May 8, 2022

3.3.9

May 7, 2022

3.3.8

May 7, 2022

3.3.7

May 4, 2022

3.3.6

Apr 20, 2022

3.3.4

Mar 30, 2022

3.3.2

Mar 30, 2022

3.3.1

Mar 30, 2022

3.3.0

Mar 30, 2022

3.2.342

Mar 30, 2022

3.2.341

Mar 30, 2022

3.2.340

Mar 22, 2022

3.2.339

Mar 18, 2022

3.2.338

Mar 16, 2022

3.2.336

Mar 11, 2022

3.2.335

Mar 11, 2022

3.2.334

Mar 11, 2022

3.2.333

Mar 11, 2022

3.2.332

Mar 7, 2022

3.2.331

Mar 7, 2022

3.2.330

Mar 7, 2022

3.2.329

Mar 7, 2022

3.2.328

Mar 7, 2022

3.2.327

Mar 7, 2022

3.2.326

Mar 6, 2022

3.2.325

Mar 6, 2022

3.2.324

Mar 6, 2022

3.2.323

Feb 20, 2022

3.2.322

Feb 19, 2022

3.2.321

Feb 18, 2022

3.2.320

Feb 18, 2022

3.2.319

Feb 8, 2022

3.2.318

Feb 8, 2022

3.2.317

Feb 7, 2022

3.2.316

Feb 7, 2022

3.2.315

Feb 7, 2022

3.2.314

Feb 7, 2022

3.2.313

Feb 6, 2022

3.2.312

Feb 6, 2022

3.2.311

Feb 6, 2022

3.2.35

Mar 30, 2022

3.2.31

Feb 6, 2022

3.2.30

Feb 6, 2022

3.2.29

Feb 6, 2022

3.2.27

Feb 6, 2022

3.2.26

Feb 5, 2022

3.2.25

Feb 5, 2022

3.2.24

Feb 5, 2022

3.2.23

Feb 5, 2022

3.2.22

Feb 4, 2022

3.2.21

Jan 25, 2022

3.2.4

Mar 30, 2022

3.2.2

Jan 25, 2022

3.2.1

Jan 25, 2022

3.2.0

Jan 25, 2022

3.1.9

Jan 25, 2022

3.1.8

Jan 2, 2022

3.1.7

Dec 23, 2021

3.1.6

Dec 20, 2021

3.1.5

Dec 19, 2021

3.1.4

Dec 17, 2021

3.1.3

Dec 15, 2021

3.1.2

Dec 14, 2021

3.1.1

Dec 14, 2021

3.1.0

Dec 14, 2021

3.0.9

Dec 10, 2021

3.0.8

Dec 1, 2021

3.0.7

Oct 31, 2021

3.0.5

Oct 31, 2021

3.0.4

Oct 30, 2021

3.0.2

Oct 30, 2021

3.0.1

Oct 30, 2021

3.0.0

Oct 6, 2021

2.9.8

Sep 29, 2021

2.9.7

Sep 28, 2021

2.9.6

Aug 13, 2021

2.9.5

Aug 5, 2021

2.9.4

Aug 3, 2021

2.9.3

Jun 25, 2021

2.9.2

Jun 20, 2021

2.9.1

Jun 20, 2021

2.9.0

Jun 20, 2021

2.8.9

Jun 16, 2021

2.8.8

Jun 16, 2021

2.8.7

Jun 11, 2021

2.8.6

Jun 7, 2021

2.8.5

Jun 5, 2021

2.8.3

Apr 16, 2021

2.8.1

Apr 16, 2021

2.8.0

Apr 16, 2021

2.7.3

Apr 13, 2021

2.7.2

Apr 13, 2021

2.7.1

Apr 13, 2021

2.6.6

Apr 2, 2021

2.6.5

Mar 28, 2021

2.6.4

Mar 27, 2021

2.6.3

Mar 19, 2021

2.6.2

Mar 19, 2021

2.6.0

Mar 15, 2021

2.5.29

Mar 13, 2021

2.5.28

Mar 13, 2021

2.5.27

Mar 13, 2021

2.5.26

Mar 13, 2021

2.5.25

Mar 13, 2021

2.5.24

Mar 11, 2021

2.5.23

Mar 11, 2021

2.5.22

Mar 11, 2021

2.5.19

Feb 27, 2021

2.5.18

Feb 27, 2021

2.5.17

Feb 27, 2021

2.5.16

Feb 24, 2021

2.5.15

Feb 24, 2021

2.5.14

Feb 24, 2021

2.5.12

Feb 24, 2021

2.5.11

Feb 21, 2021

2.5.8

Mar 14, 2021

2.5.7

Mar 14, 2021

2.5.6

Mar 14, 2021

2.5.5

Mar 13, 2021

2.5.4

Mar 13, 2021

2.5.3

Mar 13, 2021

2.5.1

Feb 21, 2021

2.5.0

Feb 21, 2021

2.4.87

Feb 21, 2021

2.4.85

Feb 21, 2021

2.4.83

Feb 21, 2021

2.4.82

Feb 21, 2021

2.4.81

Feb 20, 2021

2.4.79

Feb 20, 2021

2.4.78

Feb 14, 2021

2.4.77

Feb 8, 2021

2.4.76

Feb 8, 2021

2.4.75

Feb 5, 2021

2.4.74

Feb 4, 2021

2.4.73

Feb 4, 2021

2.4.72

Feb 4, 2021

2.4.71

Feb 4, 2021

2.4.7

Feb 4, 2021

2.4.6

Feb 4, 2021

2.4.5

Feb 3, 2021

2.4.4

Feb 3, 2021

2.4.3

Feb 2, 2021

2.4.2

Feb 2, 2021

2.4.1

Feb 2, 2021

2.4.0

Feb 2, 2021

2.3.0

Feb 2, 2021

2.2.0

Feb 2, 2021

2.1.8

Feb 1, 2021

2.1.7

Feb 1, 2021

2.1.6

Feb 1, 2021

2.1.5

Feb 1, 2021

2.1.4

Feb 1, 2021

2.1.3

Jan 31, 2021

2.1.2

Jan 31, 2021

2.1.1

Jan 31, 2021

2.1.0

Jan 25, 2021

2.0.9

Jan 18, 2021

2.0.8

Jan 12, 2021

2.0.7

Jan 12, 2021

2.0.6

Jan 11, 2021

2.0.5

Jan 10, 2021

2.0.4

Jan 10, 2021

2.0.3

Jan 9, 2021

2.0.2

Jan 8, 2021

2.0.1

Jan 7, 2021

2.0.0

Jan 7, 2021

1.7.1

Jan 7, 2021

1.7.0

Jan 7, 2021

1.6.8

Jan 6, 2021

1.6.7

Jan 6, 2021

1.6.6

Jan 6, 2021

1.6.5

Dec 20, 2020

1.6.4

Dec 20, 2020

1.6.2

Dec 19, 2020

1.6.1

Dec 18, 2020

1.5.7

Dec 18, 2020

1.5.6

Dec 18, 2020

1.5.5

Dec 16, 2020

1.5.3

Dec 16, 2020

1.5.2

Dec 15, 2020

1.5.1

Dec 15, 2020

1.5.0

Dec 15, 2020

1.4.3

Dec 13, 2020

1.4.2

Dec 13, 2020

1.4.1

Dec 13, 2020

1.4.0

Dec 13, 2020

1.3.4

Dec 13, 2020

1.3.3

Dec 12, 2020

1.3.2

Dec 9, 2020

1.3.1

Sep 2, 2020

1.3

Aug 29, 2020

1.2

Aug 25, 2020

1.1

Aug 13, 2020

1.0

Aug 12, 2020

0.9

Jul 31, 2020

0.8

Jul 31, 2020

0.7

Jul 31, 2020

0.6

Jul 29, 2020

0.5

Jul 29, 2020

0.4

Jul 29, 2020

0.3

Jul 29, 2020

0.2

Jul 29, 2020

0.1

Jul 29, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

easytorch-3.4.9.tar.gz (42.6 kB view hashes)

Uploaded Oct 18, 2022 Source

Hashes for easytorch-3.4.9.tar.gz

Hashes for easytorch-3.4.9.tar.gz
Algorithm	Hash digest
SHA256	`c21e7be03269193940d99bc5f39775ebf885dd5d2a573f1e582f4fb54ee8a75e`
MD5	`e7d16d03b7d5740712d754414ddfd375`
BLAKE2b-256	`d2ebdd657c48f7b4bdde2e4a8a234ae015f9d21a34e4effb8b340f707488b221`