d9d - d[istribute]d - distributed training framework based on PyTorch that tries to be efficient yet hackable

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mrapplexz

These details have not been verified by PyPI

Project description

title: Home

The d9d Project

d9d is a distributed training framework built on top of PyTorch 2.0. It aims to be hackable, modular, and efficient, designed to scale from single-GPU debugging to massive clusters running 6D-Parallelism.

LET'S START TRAINING 🚀

Installation

Just use your favourite package manager:

pip install d9d
poetry add d9d
uv add d9d

Extras

d9d[aim]: Aim experiment tracker integration.
d9d[visualization]: Plotting libraries required to some advanced visualization functionality.
d9d[moe]: Efficient Mixture of Experts GPU kernels. You should build and install some dependencies manually before installation: DeepEP, grouped-gemm.
d9d[cce]: Efficient Fused Cross-Entropy kernels. You should build and install some dependencies manually before installation: Cut Cross Entropy.

Why another framework?

Distributed training frameworks such as Megatron-LM are monolithic in the way you run a script from the command line to train any of a set of predefined models, using predefined regimes. While powerful, these systems can be difficult to hack and integrate into novel research workflows. Their focus is often on providing a complete, end-to-end solution, which can limit flexibility for experimentally-driven research.

Conversely, creating your own distributed training solution from scratch is tricky. You have to implement many low-level components (like distributed checkpoints and synchronization) that are identical across setups, and manually tackle common performance bottlenecks.

d9d was designed to fill the gap between monolithic frameworks and homebrew setups, providing a modular yet effective solution for distributed training.

What d9d is and isn't

In terms of core concept:

IS a pluggable framework for implementing distributed training regimes for your deep learning models.
IS built on clear interfaces and building blocks that may be composed and implemented in your own way.
IS NOT an all-in-one CLI platform for setting up pre-training and post-training like torchtitan, Megatron-LM, or torchforge.

In terms of codebase & engineering:

IS built on a strong engineering foundation: We enforce strict type-checking and rigorous linting to catch errors before execution.
IS reliable: The framework is backed by a suite of over 450 tests, covering unit logic, integration flows, and End-to-End distributed scenarios.
IS eager to use performance hacks (like DeepEp or custom kernels) if they improve MFU, even if they aren't PyTorch-native.
IS NOT for legacy setups: We do not maintain backward compatibility with older PyTorch versions or hardware. We prioritize simplicity and modern APIs (like DTensor).

Key Philosophies

To achieve the balance between hackability and performance, d9d adheres to specific design principles:

Composition over Monoliths: We avoid "God Classes" like DistributedDataParallel or ParallelDims that assume ownership of the entire execution loop. Instead, we provide composable and extendable APIs. For instance, specific horizontal parallelism strategies for specific layers (parallelize_replicate, parallelize_expert_parallel, ...).
White-Box Modelling: We encourage standard PyTorch code. Models are not wrapped in obscure metadata specifications; they are standard nn.Modules that implement lightweight protocols.
Pragmatic Efficiency: While we prefer native PyTorch, we are eager to integrate non-native solutions if they improve MFU. For example, we implement MoE using DeepEp communications, reindexing kernels from Megatron-LM, and efficient grouped-GEMM implementations.
Graph-Based State Management: Our IO system treats model checkpoints as directed acyclic graphs. This allows you to transform architectures (e.g., merging q, k, v into qkv) on-the-fly while streaming from disk, without massive memory overhead.
DTensors: We mandate that distributed parameters be represented as torch.distributed.tensor.DTensor. This simplifies checkpointing by making them topology-aware automatically. We leverage modern PyTorch 2.0 APIs (DeviceMesh) as much as possible.

Examples

Qwen3-MoE Pretraining

An example showing causal LM pretraing for the Qwen3-MoE model.

WIP: MoE load balancing is currently work in progress.

Link.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mrapplexz

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.14.0

May 21, 2026

0.13.1

Apr 23, 2026

0.13.0

Apr 20, 2026

0.12.0

Apr 14, 2026

0.11.0

Apr 13, 2026

0.10.0

Apr 13, 2026

0.9.0

Apr 8, 2026

0.8.0

Mar 22, 2026

0.7.0

Mar 18, 2026

0.6.0

Mar 11, 2026

0.5.4

Mar 10, 2026

0.5.3

Mar 1, 2026

0.5.2

Feb 24, 2026

0.5.1

Feb 12, 2026

0.5.0

Feb 12, 2026

0.4.0

Feb 10, 2026

0.3.0

Feb 9, 2026

0.2.4

Feb 9, 2026

0.2.3

Feb 5, 2026

0.2.2

Feb 5, 2026

0.2.1

Feb 5, 2026

This version

0.2.0

Feb 4, 2026

0.1.1

Feb 4, 2026

0.1.0

Feb 4, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

d9d-0.2.0.tar.gz (151.6 kB view details)

Uploaded Feb 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

d9d-0.2.0-py3-none-any.whl (240.1 kB view details)

Uploaded Feb 4, 2026 Python 3

File details

Details for the file d9d-0.2.0.tar.gz.

File metadata

Download URL: d9d-0.2.0.tar.gz
Upload date: Feb 4, 2026
Size: 151.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for d9d-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`44133a249068dd7f0e642d101a0ed0e906427fae65a312e568a573fdc769bbad`
MD5	`53d4b4b37a87dbd36bcd6bea218634ed`
BLAKE2b-256	`19e2eb422da903ceb93370f2f899c9e3d86f5f5f5320f837208f8da5be4ac677`

See more details on using hashes here.

Provenance

The following attestation bundles were made for d9d-0.2.0.tar.gz:

Publisher: release.yml on d9d-project/d9d

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: d9d-0.2.0.tar.gz
- Subject digest: 44133a249068dd7f0e642d101a0ed0e906427fae65a312e568a573fdc769bbad
- Sigstore transparency entry: 914787472
- Sigstore integration time: Feb 4, 2026
Source repository:
- Permalink: d9d-project/d9d@6bd6e72fc456087ad0d2c762095724ee5310cb9d
- Branch / Tag: refs/heads/main
- Owner: https://github.com/d9d-project
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6bd6e72fc456087ad0d2c762095724ee5310cb9d
- Trigger Event: push

File details

Details for the file d9d-0.2.0-py3-none-any.whl.

File metadata

Download URL: d9d-0.2.0-py3-none-any.whl
Upload date: Feb 4, 2026
Size: 240.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for d9d-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c8df0c2963a15f193ac824be7c93a41a25ea2904b46fced27788f1c0f1dadf6f`
MD5	`ec3b75003561a90a2517c2e5ad6d0d90`
BLAKE2b-256	`ff1225d87575a7e2df647cdcb2ed7e0de1c39beb11df9ae926aa993c42242d40`

See more details on using hashes here.

Provenance

The following attestation bundles were made for d9d-0.2.0-py3-none-any.whl:

Publisher: release.yml on d9d-project/d9d

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: d9d-0.2.0-py3-none-any.whl
- Subject digest: c8df0c2963a15f193ac824be7c93a41a25ea2904b46fced27788f1c0f1dadf6f
- Sigstore transparency entry: 914787600
- Sigstore integration time: Feb 4, 2026
Source repository:
- Permalink: d9d-project/d9d@6bd6e72fc456087ad0d2c762095724ee5310cb9d
- Branch / Tag: refs/heads/main
- Owner: https://github.com/d9d-project
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6bd6e72fc456087ad0d2c762095724ee5310cb9d
- Trigger Event: push

d9d 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

title: Home

The d9d Project

Installation

Extras

Why another framework?

What d9d is and isn't

Key Philosophies

Examples

Qwen3-MoE Pretraining

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance