Add your description here

Project description

sarasa

A minimum LLM training framework built on pure PyTorch with simplicity and extensibility.

[! CAUTION] sarasa is developed by an error-prone human and thus may contain many bugs. Use it at your own risk.

Installation

uv sync [--extra cpu|cu128|cu130] [--extra flash_attn]

uv add sarasa[cpu|cu128|cu130]

Features

Pure PyTorch implementation
Flexible configuration system with command-line overrides
Support from a single GPU to multiple GPUs (simple DDP and FSDP for now)
Selective activation checkpointing (SAC) for memory efficiency
Async distributed checkpoint saving / loading
Profiling
FP8 training
Post-training

Usage

It's (almost) ready to use. First, set up tokenizer, e.g.,

mkdir tokenizer
cd tokenizer
uvx hf download --local-dir . --include "tokenizer*" "meta-llama/Llama-3.1-8B"

Then, the following command starts training of a GPT model on FineWeb-edu with a single or multiple GPUs.

uv run torchrun --nproc_per_node="gpu" main.py \
--config-file configs/example.py \
[--train.local-batch-size 8 ...] # override config options as needed

For details, run

uv run torchrun --nproc_per_node="gpu" main.py --help

Extending `sarasa` with Custom Components

Extending sarasa is as simple as defining your own configuration dataclasses with create methods. Users can define custom configurations for models, optimizers, learning-rate schedulers, and datasets. Here's an example of using a custom optimizer:

from sarasa import Trainer, Config
from custom_optim import CustomOptimizer, CustomOptimizer2

@dataclass
class CustomOptim:
    lr: float = ...

    def create(self,
               model: torch.nn.Module
    ) -> torch.optim.Optimizer:
        return CustomOptimizer(model.parameters(), lr=self.lr, ...)

@dataclass
class CustomOptim2:
    lr: float = ...

    def create(self,
               model: torch.nn.Module
    ) -> torch.optim.Optimizer:
        return CustomOptimizer2(model.parameters(), lr=self.lr, ...)


if __name__ == "__main__":
    config = Config.from_cli(optim_type=CustomOptim | CustomOptim2)
    trainer = Trainer(config)
    trainer.train()

Thanks to tyro's type support, Sarasa can automatically recognize multiple custom optimizer types. From the command line, you can specify which custom optimizer to use:

python script.py optim:custom_optim --optim.lr 0.001 ...
# or
python script.py optim:custom_optim2 --optim.lr 0.002 ...

(As tyro automatically converts config class names from CamelCase to snake_case, config class names are recommended not to include Config suffixes.)

Config File Example

It's very simple. IDE autocompletion will help you.

from sarasa import Config, Data, LRScheduler, Model, Train, LRScheduler
from custom_optim import CustomOptim

# only one Config instance should be defined in each config file
config = Config.create(
    model=Model(num_layers=12),
    train=Train(
        local_batch_size=16,
        global_batch_size=256,
        dtype="bfloat16",
    ),
    optim=CustomOptim(lr=0.001),
    lr_scheduler=LRScheduler(
        decay_type="linear",
        warmup_steps=1000,
        total_steps=100000,
    ),
    data=Data(tokenizer_path="./tokenizer"),
    seed=12,
)

Acknowledgements

This project is heavily inspired by and borrows code from torchtitan.

Project details

Release history Release notifications | RSS feed

0.0.12

Mar 7, 2026

0.0.11

Feb 18, 2026

0.0.10

Feb 18, 2026

0.0.9

Feb 17, 2026

0.0.8

Feb 17, 2026

This version

0.0.6

Feb 14, 2026

0.0.5

Feb 4, 2026

0.0.4

Feb 4, 2026

0.0.3

Feb 1, 2026

0.0.2

Jan 31, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sarasa-0.0.6.tar.gz (32.9 kB view details)

Uploaded Feb 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sarasa-0.0.6-py3-none-any.whl (36.9 kB view details)

Uploaded Feb 14, 2026 Python 3

File details

Details for the file sarasa-0.0.6.tar.gz.

File metadata

Download URL: sarasa-0.0.6.tar.gz
Upload date: Feb 14, 2026
Size: 32.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.10.2 {"installer":{"name":"uv","version":"0.10.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for sarasa-0.0.6.tar.gz
Algorithm	Hash digest
SHA256	`2cbe955e45277a1c87aaab13636f6a9392c29a465d8aef2cb80f5abad48386eb`
MD5	`651bf5509bc659b8885960dd254b9cca`
BLAKE2b-256	`b8d07b97ac9243646b7ddf20714d7fa5b9ee433bdaca7f989ef8167896211e6c`

See more details on using hashes here.

File details

Details for the file sarasa-0.0.6-py3-none-any.whl.

File metadata

Download URL: sarasa-0.0.6-py3-none-any.whl
Upload date: Feb 14, 2026
Size: 36.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.10.2 {"installer":{"name":"uv","version":"0.10.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for sarasa-0.0.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2d72dce59a3d565211659b659aa39bfd6c95aad54ed166e908504fefcb2494a9`
MD5	`1c32b12563fc59941a98f6d6eb08fc38`
BLAKE2b-256	`9b694d13848e8ee6222a4c61fa97ca681b8dbf89d04f067b53f98cf0a46ffb14`

See more details on using hashes here.

sarasa 0.0.6

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

sarasa

Installation

Features

Usage

Extending `sarasa` with Custom Components

Config File Example

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

sarasa 0.0.6

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

sarasa

Installation

Features

Usage

Extending sarasa with Custom Components

Config File Example

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Extending `sarasa` with Custom Components