Working with deep learning models

These details have not been verified by PyPI

Project links

Homepage

Project description

QuNet

Easy working with deep learning models.

Trainer class for training the model.
Various tools for visualizing the training process and the state of the model.
Training large models: float16, mini-batch splitting, etc.
Large set of custom modules for neural networks (MLP, CNN, Transformer, etc.)

Install

pip install qunet

Usage

To work with the library, it is enough to add training_step(batch, batch_id) to the model, in which to calculate the loss and, if necessary, some quality metrics. For example, for 1D linear regression $y=f(x)$ with mse-loss and metric as |y_pred-y_true|, model looks like:

class Model(nn.Module):
    def __init__(self):              
        super().__init__() 
        self.fc = nn.Linear( 1, 1 )

    def forward(self, x):                                 # (B,1)
        return self.fc(x)                                 # (B,1)

    def training_step(self, batch, batch_id):        
        x, y_true = batch                                 # the model knows the minbatch format
        y_pred = self(x)                                  # (B,1)  forward function call

        loss  = (y_pred - y_true).pow(2).mean()           # ()     loss for optimization (scalar)!
        error = torch.abs(y_pred.detach()-y_true).mean()  # (B,1)  error for batch samples

        return {'loss':loss, 'score': error}              # if no score, you can return loss

model = Model()

Training and validation datasets can be standard DataLoader. For small datasets, you can also use the faster loader Data from the library:

from qunet import Data, Trainer

num, val = 1000, 900
X = torch.rand(num)
Y = 2*X + torch.randn(X.shape)

data_trn = Data( (X[:val], Y[:val]) )
data_val = Data( (X[val:], Y[val:]) )

After that, we create an instance of the trainer, pass the model and data to it. Set the optimizer at the trainer and start training:

trainer = Trainer(model, data_trn, data_val)
trainer.set_optimizer( torch.optim.SGD(model.parameters(), lr=1e-2) )
trainer.fit(epochs=10, period_plot=5, monitor=['loss'])

This is all!

Let's make a small overview of the library. A more detailed introduction can be found in the document Quick start, documents describing the various modules of the library, and notebooks dedicated to various deep learning tasks.

Trainer

The trainer is a key object of the QuNet library. It solves the following tasks:

Model training and validation.
Visualization of the learning process, with ample opportunities for its customization.
Calculation of optimal breakpoints based on the best local and smoothed metrics.
Saving the best models by loss or score, as well as saving checkpoints.
Combining different training schedulers
For large models, switch to half precision and use the gradient accumulation buffer.
Use of multiple callback objects that can be embedded in different parts of the pipeline.

Below is an example of visualization:

val_loss:  best = 0.190465[293], smooth21 = 0.199713[296], last21 = 0.210965 В± 0.019436
trn_loss:  best = 0.209042[234], smooth21 = 0.244457[299], last21 = 0.293281 В± 0.043728

val_score: best = 0.942300[291], smooth21 = 0.938188[295], last21 = 0.934581 В± 0.000000
trn_score: best = 0.929560[234], smooth21 = 0.916017[299], last21 = 0.898531 В± 0.005823

epochs=300, samples=15000000, steps=30000
times=(trn:214.34, val:11.69)m,  42.87 s/epoch, 428.68 s/10^3 steps,  857.35 s/10^6 samples

Example of learning curves of various schedulers:

ModelState

The standalone ModelState class is a powerful replacement for libraries such as torchinfo. It allows you to display information about submodules and their parameters.

Transformer                            params           data
в”њв”Ђ ModuleList                                                           ->                 <  blocks
в”‚  в””в”Ђ TransformerBlock                                   (1, 10, 64)    -> (1, 10, 64)     <  blocks[0]
в”‚     в””в”Ђ Residual                                        (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].fft
в”‚        в””в”Ђ FFT                                          (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].fft.module
в”‚           в””в”Ђ Dropout(0)                                (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].fft.module.drop        
в”‚        в””в”Ђ LayerNorm                     128         |  (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].fft.norm
в”‚     в””в”Ђ Residual                                        (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].att
в”‚        в””в”Ђ Attention                                    (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].att.module
в”‚           в””в”Ђ Linear(64->192)         12,480  ~  25% |  (1, 10, 64)    -> (1, 10, 192)    <  blocks[0].att.module.c_attn      
в”‚           в””в”Ђ Linear(64->64)           4,160  ~   8% |  (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].att.module.c_proj      
в”‚           в””в”Ђ Dropout(0)                                (1, 4, 10, 10) -> (1, 4, 10, 10)  <  blocks[0].att.module.att_dropout 
в”‚           в””в”Ђ Dropout(0)                                (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].att.module.res_dropout 
в”‚        в””в”Ђ LayerNorm                     128         |  (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].att.norm
в”‚     в””в”Ђ Residual                                        (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].mlp
в”‚        в””в”Ђ MLP                                          (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].mlp.module
в”‚           в””в”Ђ Sequential                                (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].mlp.module.layers      
в”‚              в””в”Ђ Linear(64->256)      16,640  ~  33% |  (1, 10, 64)    -> (1, 10, 256)    <  blocks[0].mlp.module.layers[0]   
в”‚              в””в”Ђ GELU                                   (1, 10, 256)   -> (1, 10, 256)    <  blocks[0].mlp.module.layers[1]   
в”‚              в””в”Ђ Dropout(0)                             (1, 10, 256)   -> (1, 10, 256)    <  blocks[0].mlp.module.layers[2]   
в”‚              в””в”Ђ Linear(256->64)      16,448  ~  33% |  (1, 10, 256)   -> (1, 10, 64)     <  blocks[0].mlp.module.layers[3]   
в”‚        в””в”Ђ LayerNorm                     128         |  (1, 10, 64)    -> (1, 10, 64)     <  blocks[0].mlp.norm
=============================================
trainable:                             50,115

During training, ModelState keeps track of gradients and smoothes values:

 #                                           params          |mean|  [     min,      max ]  |grad|   shape
-------------------------------------------------------------------------------------
  0: blocks.0.fft.gamma                            1           0.200  [   0.200,    0.200]   1.3e+02  ()
  1: blocks.0.fft.norm.weight                     64           1.000  [   1.000,    1.000]   4.7e-01  (64,)
  2: blocks.0.fft.norm.bias                       64           0.000  [   0.000,    0.000]   2.2e-01  (64,)
  ...

Modules

The library has many ready-made modules for building various architectures of neural networks:

MLP
Transformer
CNN
ResCNN
ProjViT
ResCNN3D
GNN

Most modules have debugging and visualization tools. For example, this is how the visualization of the learning process of a transformer, consisting of 10 blocks, looks like.

Such diagrams allow you to analyze the problem areas of the network and change them in the learning process.

Docs

Examples

Interpolation_F(x) - interpolation of a function of one variable (example of setting up a training plot; working with the list of schedulers; adding a custom plot)
MNIST - recognition of handwritten digits 0-9 (example using pytorch DataLoader, model predict, show errors, confusion matrix)
CIFAR10 (truncated EfficientNet, pre-trained parameters, bone freezing, augmentation)
Vanishing gradient
Regression_1D - visualization of changes in model parameters

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.0.190

Dec 5, 2023

0.0.189

Dec 4, 2023

0.0.188

Dec 4, 2023

0.0.187

Dec 3, 2023

0.0.186

Nov 30, 2023

0.0.185

Nov 30, 2023

0.0.184

Nov 30, 2023

0.0.183

Nov 30, 2023

0.0.182

Nov 30, 2023

0.0.181

Nov 29, 2023

0.0.180

Nov 29, 2023

0.0.179

Nov 28, 2023

0.0.178

Nov 28, 2023

0.0.177

Nov 28, 2023

0.0.176

Nov 28, 2023

0.0.175

Nov 27, 2023

0.0.174

Nov 26, 2023

0.0.173

Nov 26, 2023

0.0.172

Nov 25, 2023

0.0.171

Nov 25, 2023

0.0.170

Nov 25, 2023

0.0.169

Nov 23, 2023

0.0.168

Nov 22, 2023

0.0.167

Nov 21, 2023

0.0.166

Nov 21, 2023

0.0.165

Nov 21, 2023

0.0.164

Nov 20, 2023

0.0.163

Nov 7, 2023

0.0.162

Nov 7, 2023

This version

0.0.161

Oct 17, 2023

0.0.160

Oct 7, 2023

0.0.159

Aug 24, 2023

0.0.158

Jul 27, 2023

0.0.157

Jul 25, 2023

0.0.156

Jul 24, 2023

0.0.155

Jul 20, 2023

0.0.154

Jul 20, 2023

0.0.153

Jul 19, 2023

0.0.152

Jul 19, 2023

0.0.151

Jul 18, 2023

0.0.150

Jul 17, 2023

0.0.149

Jul 17, 2023

0.0.148

Jul 14, 2023

0.0.147

Jul 13, 2023

0.0.146

Jul 12, 2023

0.0.145

Jul 12, 2023

0.0.144

Jul 12, 2023

0.0.143

Jul 12, 2023

0.0.142

Jul 12, 2023

0.0.141

Jul 12, 2023

0.0.140

Jul 12, 2023

0.0.139

Jul 10, 2023

0.0.138

Jul 10, 2023

0.0.137

Jul 10, 2023

0.0.136

Jul 10, 2023

0.0.135

Jul 9, 2023

0.0.134

Jul 8, 2023

0.0.133

Jul 8, 2023

0.0.132

Jul 8, 2023

0.0.131

Jul 8, 2023

0.0.130

Jul 8, 2023

0.0.129

Jul 7, 2023

0.0.128

Jul 7, 2023

0.0.127

Jul 7, 2023

0.0.126

Jul 6, 2023

0.0.125

Jul 5, 2023

0.0.124

Jul 5, 2023

0.0.123

Jul 5, 2023

0.0.122

Jul 5, 2023

0.0.121

Jul 5, 2023

0.0.120

Jul 3, 2023

0.0.119

Jul 2, 2023

0.0.118

Jul 2, 2023

0.0.117

Jul 1, 2023

0.0.116

Jul 1, 2023

0.0.115

Jul 1, 2023

0.0.114

Jun 30, 2023

0.0.113

Jun 30, 2023

0.0.112

Jun 30, 2023

0.0.111

Jun 30, 2023

0.0.110

Jun 29, 2023

0.0.109

Jun 28, 2023

0.0.108

Jun 28, 2023

0.0.107

Jun 27, 2023

0.0.106

Jun 27, 2023

0.0.105

Jun 26, 2023

0.0.104

Jun 26, 2023

0.0.103

Jun 26, 2023

0.0.102

Jun 26, 2023

0.0.101

Jun 26, 2023

0.0.100

Jun 24, 2023

0.0.99

Jun 24, 2023

0.0.98

Jun 23, 2023

0.0.97

Jun 23, 2023

0.0.96

Jun 23, 2023

0.0.95

Jun 22, 2023

0.0.94

Jun 22, 2023

0.0.93

Jun 22, 2023

0.0.92

Jun 21, 2023

0.0.91

Jun 20, 2023

0.0.90

Jun 17, 2023

0.0.89

Jun 16, 2023

0.0.88

Jun 14, 2023

0.0.87

Jun 14, 2023

0.0.86

Jun 14, 2023

0.0.85

Jun 13, 2023

0.0.84

Jun 13, 2023

0.0.83

Jun 13, 2023

0.0.82

Jun 13, 2023

0.0.81

Jun 13, 2023

0.0.80

Jun 13, 2023

0.0.79

Jun 12, 2023

0.0.78

Jun 11, 2023

0.0.77

Jun 11, 2023

0.0.76

Jun 11, 2023

0.0.75

Jun 11, 2023

0.0.74

Jun 6, 2023

0.0.73

Jun 3, 2023

0.0.72

Jun 1, 2023

0.0.71

May 30, 2023

0.0.70

May 22, 2023

0.0.69

May 22, 2023

0.0.68

May 21, 2023

0.0.67

May 21, 2023

0.0.66

May 21, 2023

0.0.65

May 20, 2023

0.0.64

May 19, 2023

0.0.63

May 19, 2023

0.0.62

May 19, 2023

0.0.61

May 19, 2023

0.0.60

May 18, 2023

0.0.59

May 18, 2023

0.0.58

May 18, 2023

0.0.57

May 18, 2023

0.0.56

May 17, 2023

0.0.55

May 17, 2023

0.0.54

May 17, 2023

0.0.53

May 17, 2023

0.0.52

May 16, 2023

0.0.51

May 15, 2023

0.0.50

May 15, 2023

0.0.49

May 14, 2023

0.0.48

May 13, 2023

0.0.47

May 13, 2023

0.0.46

May 12, 2023

0.0.45

May 9, 2023

0.0.44

May 9, 2023

0.0.43

May 8, 2023

0.0.42

May 7, 2023

0.0.41

May 5, 2023

0.0.40

May 5, 2023

0.0.39

May 4, 2023

0.0.38

May 4, 2023

0.0.37

May 3, 2023

0.0.36

May 2, 2023

0.0.35

May 2, 2023

0.0.34

May 1, 2023

0.0.33

May 1, 2023

0.0.32

May 1, 2023

0.0.31

Apr 29, 2023

0.0.30

Apr 29, 2023

0.0.29

Apr 29, 2023

0.0.28

Apr 28, 2023

0.0.27

Apr 28, 2023

0.0.26

Apr 27, 2023

0.0.25

Apr 27, 2023

0.0.24

Apr 27, 2023

0.0.23

Apr 27, 2023

0.0.22

Apr 26, 2023

0.0.21

Apr 26, 2023

0.0.20

Apr 26, 2023

0.0.19

Apr 26, 2023

0.0.18

Apr 26, 2023

0.0.17

Apr 25, 2023

0.0.16

Apr 25, 2023

0.0.15

Apr 25, 2023

0.0.14

Apr 25, 2023

0.0.13

Apr 25, 2023

0.0.12

Apr 23, 2023

0.0.11

Apr 23, 2023

0.0.10

Apr 23, 2023

0.0.9

Apr 23, 2023

0.0.8

Apr 22, 2023

0.0.7

Apr 22, 2023

0.0.6

Apr 22, 2023

0.0.5

Apr 22, 2023

0.0.4

Apr 20, 2023

0.0.3

Apr 20, 2023

0.0.2

Apr 20, 2023

0.0.1

Apr 20, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

QuNet-0.0.161-py3-none-any.whl (88.8 kB view hashes)

Uploaded Oct 17, 2023 Python 3

Hashes for QuNet-0.0.161-py3-none-any.whl

Hashes for QuNet-0.0.161-py3-none-any.whl
Algorithm	Hash digest
SHA256	`eaf28bad62c02be87e743ec228dfd7c385069ce2dec1368fbc44497b9ba0abca`
MD5	`735e927976a29aa6c2ad18b61100dcdd`
BLAKE2b-256	`22ee9a63302abb156d2125eadea4b6b333f1bd279c09ade1d6cfcdea66adfa2a`