Architecture experimentation and training infrastructure.

Project description

theseus

Have you ever wanted to train a language model from scratch but hate writing boilerplate? Previously the solution to this is to work at a frontier lab with Research Engineers:tm:.

Now the solution is to make Jack:tm: (and also a cast of frontier coding models) do your research engineering. It will probably break a lot but what the heck at least I tried.

Download

It depends on who gave you computors to make warm:

cuda13: uv sync --group all --group cuda13
cuda12: uv sync --group all --group cuda12
you love Google: uv sync --group all --group tpu
you bought your own computors: uv sync --group all --group cpu

Quick Start

Use the CLI.

# List available jobs
theseus jobs

# Generate a config for data tokenization
theseus configure data/tokenize_variable_dataset tokenize.yaml \
    data.name=fineweb data.max_samples=1000000

# Run the tokenization locally
theseus run tokenize-fineweb tokenize.yaml ./output

# Generate a config for pretraining
theseus configure gpt/train/pretrain train.yaml \
    --chip h100 -n 8

# Run training locally
theseus run my-gpt-run train.yaml ./output

Quick Start, but You Have Infra

Set up ~/.theseus.yaml (see examples/dispatch.yaml), then submit jobs to remote clusters:

theseus submit my-run train.yaml --chip h100 -n 8

Quickish Start

For programmatic configuration and rapid prototyping:

from theseus.quick import quick
from theseus.registry import JOBS

with quick("gpt/train/pretrain", "/path/to/output", "my-run") as j:
    j.config.training.per_device_batch_size = 16
    j.config.logging.checkpoint_interval = 4096
    j()  # run locally

# Or save config for later submission:
with quick("gpt/train/pretrain", "/path/to/output", "my-run") as j:
    j.config.training.per_device_batch_size = 16
    j.save("config.yaml", chip="h100", n_chips=8)

Not Quick Start at All

When you (or Claude) manage to find some time to chill you can actually extend this package. The package is organized based around theseus.job.BasicJobs. They can be extended with checkpointing and recovery tools.

The main entrypoint to start hacking:

take a look at how to compose a model together in theseus.model.models.base
bodge together anything you want to change and make a new model in the models folder (be sure to add it to theseus.model.models.__init__)
write an experiment, which is a RestoreableJob. A very basic one can just inherit the normal trainer, and then that's about it. see theseus.experiments.gpt to get started (be sure to add it to theseus.experiments.__init__)

# theseus/experiments/my_model.py
from theseus.training.base import BaseTrainer, BaseTrainerConfig
from theseus.model.models import MyModel

class PretrainMyModel(BaseTrainer[BaseTrainerConfig, MyModel]):
    MODEL = MyModel
    CONFIG = BaseTrainerConfig

    @classmethod
    def schedule(cls):
        return "wsd"

JuiceFS Integration

When you are on many remote computors but bursty you may go "aw schucks I need to copy like 50TB of pretraining data around that's so lame!"

Don't worry, we gotchu. If you use the submit API, we have a way to ship your root directory around by using a thing called JuiceFS, which is a distributed filesystem.

In your ~/.theseus.yaml, add the mount field to your cluster config:

clusters:
  hpc:
    root: /mnt/juicefs/theseus
    work: /scratch/theseus
    mount: redis://:password@redis.example.com:6379/0
    cache_size: 100G
    cache_dir: /scratch/juicefs-cache

(an incomplete list of) Features

CLI & Programmatic API: Configure and run jobs via theseus CLI or the quick() Python API
Remote Dispatch: Submit jobs to SLURM clusters or plain SSH hosts via ~/.theseus.yaml
Checkpointing & Recovery: Jobs are RestoreableJobs with built-in checkpoint/restore support
Data Pipelines: Tokenize datasets (blockwise or streaming) with data/tokenize_* jobs
JuiceFS Integration: Distributed filesystem support for sharing data across clusters
Multi-backend: CUDA 11/12/13, TPU, and CPU via uv sync --group
Extensible: Add models in theseus.model.models, experiments in theseus.experiments, and datasets in theseus.data.datasets
Dataclass Configs: Type-safe configuration via dataclasses with OmegaConf, easy configuration with theseus.config.field dataclass extension, and Hydra-style cheeky cli overrides (model.hidden_size=1024)

Project details

Release history Release notifications | RSS feed

0.2.1

May 4, 2026

0.2.0

May 1, 2026

0.1.8

Mar 31, 2026

0.1.7

Mar 17, 2026

0.1.6

Mar 16, 2026

0.1.5

Mar 16, 2026

0.1.4

Mar 16, 2026

0.1.3

Mar 16, 2026

This version

0.1.2

Mar 16, 2026

0.1.1

Mar 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

libthx-0.1.2.tar.gz (30.2 kB view details)

Uploaded Mar 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

libthx-0.1.2-py3-none-any.whl (31.8 kB view details)

Uploaded Mar 16, 2026 Python 3

File details

Details for the file libthx-0.1.2.tar.gz.

File metadata

Download URL: libthx-0.1.2.tar.gz
Upload date: Mar 16, 2026
Size: 30.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for libthx-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`d7daabc57e98ee22571b100ddeb0d87a785e0c06f660819a635c1d78f36ffb97`
MD5	`bbbe629c3856ca5e49df1828adab28e8`
BLAKE2b-256	`afbb7627cbf13ec29fb6fe7128ec1288eecd42b8a7aeb3d1d9bf2eb581b4a804`

See more details on using hashes here.

File details

Details for the file libthx-0.1.2-py3-none-any.whl.

File metadata

Download URL: libthx-0.1.2-py3-none-any.whl
Upload date: Mar 16, 2026
Size: 31.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for libthx-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cf6be53675d80343d701d1a846ecb620ef4e5a57894853dea07c35ef5b9f07da`
MD5	`96a98c8e7a5d17ef3417934d3a34d96e`
BLAKE2b-256	`d7eaf380ce6ebbfb346c95f764f3bbb2a0705710e1f7074c6caaa11d3d8a14d0`

See more details on using hashes here.

libthx 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

theseus

Download

Quick Start

Quick Start, but You Have Infra

Quickish Start

Not Quick Start at All

JuiceFS Integration

(an incomplete list of) Features

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes