A Python package hackable code snips package for faster pytorch-AI development

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Codebook

Installation

pip install codejournal

Note:

This project was developed to code and train models faster. The code is clean and hackable. This is not a production ready project!

Features:

Easy loading and saving of models
WandB integration
Slack integration
Checkpoint tracking
Resuming training from checkpoints
Debug mode

TODO:

[] Adding schedulers support [] Refactoring [] Multi-GPU support

Example:

from codejournal.imports import *
from codejournal.modeling import *

import torchvision.models as models
from torchvision import datasets, transforms

os.environ["HUGGINGFACE_TOKEN"] = "" # Put your HuggingFace token here
os.environ["SLACK_WEBHOOK_URL"] = "" # Put your Slack webhook URL here

class Config(ConfigBase):
    resnet: int = 18
    pretrained: bool = True
    num_classes: int = 10

class ResNet(ModelBase):
    def __init__(self, config):
        super().__init__(config)
        self.model = getattr(models, f'resnet{self.config.resnet}')(pretrained=self.config.pretrained)
        self.model.fc = nn.Linear(self.model.fc.in_features, self.config.num_classes)
        
    def forward(self, x):
        x = x.repeat(1,3,1,1)
        return self.model(x)

    def training_step(self, batch, batch_idx):
        x,y = batch
        y_hat = self(x)
        loss = F.cross_entropy(y_hat, y)
        preds = torch.argmax(y_hat, dim=1)
        acc = torch.sum(preds == y).item() / y.size(0)
        return {'loss': loss, 'acc': acc}

    def validation_step(self, batch, batch_idx):
        x,y = batch
        y_hat = self(x)
        loss = F.cross_entropy(y_hat, y)
        preds = torch.argmax(y_hat, dim=1)
        acc = torch.sum(preds == y).item() / y.size(0)
        return {'loss': loss, 'acc': acc}
    
    def get_optimizer(self, trainer):
        # Override the default optimizer if needed
        return super().get_optimizer(trainer)
    
    def get_scheduler(self, optimizer, trainer):
        args = trainer.args # For configuring÷
        return super().get_scheduler(optimizer, trainer)


transform=transforms.Compose([
        transforms.ToTensor(),
        transforms.Normalize((0.1307,), (0.3081,))
        ])

train_dataset = datasets.MNIST('./data', train=True, download=True,
                    transform=transform)
val_dataset = datasets.MNIST('./data', train=False, download=True,
                    transform=transform)

training_args = TrainerArgs(
    # Core Training Configuration
    batch_size=32,
    max_epochs=5,
    train_steps_per_epoch=800,
    val_steps_per_epoch=400,
    grad_accumulation_steps=1,
    lr=1e-5,
    optimizer="AdamW",
    optimizer_kwargs={},
    scheduler=None,
    scheduler_kwargs={},

    # Logging and Checkpointing
    log_every_n_steps=32,
    save_every_n_steps=100,
    n_best_checkpoints=3,  # Negative value for saving all checkpoints
    n_latest_checkpoints=2,  # Negative value for saving all checkpoints
    checkpoint_metric="loss",
    checkpoint_metric_type="val",
    checkpoint_metric_minimize=True,

    # Hardware and Precision
    device="mps",  # Auto inferred
    mixed_precision=False,
    grad_clip_norm=1.0,  # False for no clipping
    num_workers=0,

    # WandB Integration
    wandb_project="Demo",  # wandb login
    wandb_run_name="MNIST",
    wandb_run_id=None,
    wandb_resume="allow",
    wandb_kwargs={},
    disable_wandb=False,

    # Resume and Debugging
    resume_from_checkpoint=True,  # None for no resuming, True for resuming latest checkpoint, or path to checkpoint
    debug_mode=False,

    # Miscellaneous
    safe_dataloader=True,
    log_grad_norm=True,
    slack_notify=True if os.environ.get("SLACK_WEBHOOK_URL") else False,
    results_dir="results",
    val_data_shuffle=False,
)

config = Config()
model = ResNet(config)

trainer = Trainer(training_args)
trainer.train(model, train_dataset, val_dataset)
if os.environ.get("HUGGINGFACE_TOKEN"):
    model.push_to_hub("demo-resnet")

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.2.0.1

Jan 9, 2025

This version

0.2.0.0

Jan 9, 2025

0.1.9.4

Jan 7, 2025

0.1.9.3

Jan 5, 2025

0.1.9.2

Jan 4, 2025

0.1.9.1

Jan 4, 2025

0.1.9

Jan 4, 2025

0.1.8

Jan 3, 2025

0.1.7

Jan 2, 2025

0.1.6

Dec 31, 2024

0.1.5

Dec 31, 2024

0.1.4

Dec 30, 2024

0.1.3

Dec 30, 2024

0.1.2

Dec 30, 2024

0.1.1

Dec 26, 2024

0.1.0

Dec 26, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

codejournal-0.2.0.0.tar.gz (23.5 kB view details)

Uploaded Jan 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

codejournal-0.2.0.0-py3-none-any.whl (24.9 kB view details)

Uploaded Jan 9, 2025 Python 3

File details

Details for the file codejournal-0.2.0.0.tar.gz.

File metadata

Download URL: codejournal-0.2.0.0.tar.gz
Upload date: Jan 9, 2025
Size: 23.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.11.11

File hashes

Hashes for codejournal-0.2.0.0.tar.gz
Algorithm	Hash digest
SHA256	`162fac7087cbe1b284b0192cfca179ccb5e3dc8d5862449ff5dc50cde6218391`
MD5	`5465a6e3d22dd5cc674e7e95012aa572`
BLAKE2b-256	`2ed0a0efc04f42697a14e2d4c31248a405010b973e36ed48e94d7f3dff49c3fd`

See more details on using hashes here.

File details

Details for the file codejournal-0.2.0.0-py3-none-any.whl.

File metadata

Download URL: codejournal-0.2.0.0-py3-none-any.whl
Upload date: Jan 9, 2025
Size: 24.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.11.11

File hashes

Hashes for codejournal-0.2.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1d1106d4fccf8cab9d4387055eb727e0c9589b4db5d61ee162825b77550b685f`
MD5	`46631f6e55d408b20a5c89b2021ba82f`
BLAKE2b-256	`36465920d8ee14fb285d892fb3564a0e9d0d4e97fe36e71b252d8dddf4e9849c`

See more details on using hashes here.

codejournal 0.2.0.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

Codebook

Installation

Note:

Features:

TODO:

Example:

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes