AXIOM - High-performance quantized optimizer with 73% memory savings

These details have not been verified by PyPI

Project links

Project description

QuarterBit AXIOM

High-Performance Quantized Optimizer for PyTorch

Drop-in replacement for AdamW with significant memory savings.

Installation

pip install quarterbit

Requirements:

Python 3.8+
PyTorch 1.8+
NVIDIA GPU with CUDA

Supported GPUs:

Consumer: GTX 1650+, RTX 20/30/40 series
Data Center: T4, V100, A10, A100, L4, L40, H100, H200

Quick Start

from quarterbit import Axiom

optimizer = Axiom(model.parameters())

for batch in dataloader:
    loss = model(batch)
    loss.backward()
    optimizer.step()
    optimizer.zero_grad()

Optimizer Parameters

AXIOM comes with tuned defaults. Advanced users can adjust:

from quarterbit import Axiom

optimizer = Axiom(
    params,                    # Model parameters
    lr=5e-4,                   # Learning rate
    betas=(0.9, 0.999),        # Momentum coefficients
    eps=1e-8,                  # Numerical stability
    weight_decay=0.1,          # Decoupled weight decay
    total_steps=10000,         # Training steps (for scheduling)
    warmup_ratio=0.1,          # Warmup fraction
)

Per-Layer Learning Rates

optimizer = Axiom([
    {'params': model.backbone.parameters(), 'lr': 1e-4},
    {'params': model.head.parameters(), 'lr': 5e-4},
], weight_decay=0.1)

Activation Checkpointing (Pro)

Custom compressed checkpointing with slot-based storage.

from quarterbit.torch.utils import ActivationCheckpoint

# Create checkpoint storage
actcp = ActivationCheckpoint(
    max_slots=24,              # Number of layers/checkpoints
    max_elements=2**20,        # Max elements per activation
)

# During forward pass - store activations
actcp.store(activation, slot=layer_idx)

# During backward pass - restore activations
restored = actcp.restore(slot=layer_idx)

# Check memory savings
info = actcp.memory_info()
print(f"Memory saved: {info['savings_pct']:.1f}%")

# Clear when done
actcp.clear()

Parameters:

Parameter	Description
`max_slots`	Number of checkpoint slots (typically num_layers)
`max_elements`	Maximum tensor size per slot

Methods:

Method	Description
`store(tensor, slot)`	Compress and store activation
`restore(slot)`	Decompress and return activation
`clear(slot=None)`	Clear one slot or all
`memory_info()`	Get memory usage statistics

Gradient Compression (Pro)

Drift-free compression with error feedback for distributed training.

from quarterbit.torch.utils import GradientCompressor

# Create compressor for each parameter
compressor = GradientCompressor(num_elements=param.numel())

# Compress before all-reduce
compressed = compressor.compress(param.grad)

# ... communicate compressed gradients ...

# Decompress after communication
param.grad = compressor.decompress(compressed)

# Reset error feedback (optional, between epochs)
compressor.reset()

Parameters:

Parameter	Description
`num_elements`	Number of gradient elements

Methods:

Method	Description
`compress(grads)`	Compress with error feedback
`decompress(compressed)`	Decompress to FP32
`reset()`	Clear error feedback accumulator

Multi-GPU Training

DataParallel

model = torch.nn.DataParallel(model)
optimizer = Axiom(model.parameters())

DistributedDataParallel

from torch.nn.parallel import DistributedDataParallel as DDP

model = DDP(model, device_ids=[local_rank])
optimizer = Axiom(model.parameters())

DeepSpeed

import deepspeed

model, optimizer, _, _ = deepspeed.initialize(
    model=model,
    optimizer=Axiom(model.parameters()),
    config=ds_config
)

Checkpointing

# Save
torch.save({
    'model': model.state_dict(),
    'optimizer': optimizer.state_dict(),
    'step': step,
}, 'checkpoint.pt')

# Load
ckpt = torch.load('checkpoint.pt')
model.load_state_dict(ckpt['model'])
optimizer.load_state_dict(ckpt['optimizer'])

Licensing

quarterbit activate <LICENSE_KEY>

Tier	GPU Hours	Features
Free	10/month	Optimizer only
Pro	1,000/month	+ Checkpointing, Compression
Enterprise	Unlimited	+ On-premise, Custom SLA

Get your key: quarterbit.dev/pricing

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

50.3.0

Mar 16, 2026

50.2.0

Mar 16, 2026

50.1.0

Mar 15, 2026

50.0.0

Mar 15, 2026

8.4.1 yanked

Feb 11, 2026

8.4.0 yanked

Feb 11, 2026

8.3.4 yanked

Feb 11, 2026

8.3.3 yanked

Feb 11, 2026

8.3.2 yanked

Feb 11, 2026

8.3.1 yanked

Feb 11, 2026

8.3.0 yanked

Feb 11, 2026

8.2.0 yanked

Feb 11, 2026

8.1.1 yanked

Feb 10, 2026

8.1.0 yanked

Feb 10, 2026

8.0.9 yanked

Feb 10, 2026

8.0.8 yanked

Feb 10, 2026

8.0.6 yanked

Feb 10, 2026

8.0.5 yanked

Feb 10, 2026

8.0.4 yanked

Feb 10, 2026

8.0.3 yanked

Feb 10, 2026

8.0.2 yanked

Feb 10, 2026

8.0.1 yanked

Feb 10, 2026

8.0.0 yanked

Feb 10, 2026

7.1.0 yanked

Feb 6, 2026

7.0.0 yanked

Feb 6, 2026

4.0.3 yanked

Feb 5, 2026

4.0.2 yanked

Feb 5, 2026

4.0.1 yanked

Feb 5, 2026

4.0.0 yanked

Feb 5, 2026

2.0.4 yanked

Feb 4, 2026

2.0.3 yanked

Feb 4, 2026

2.0.2 yanked

Feb 4, 2026

2.0.1 yanked

Feb 4, 2026

2.0.0 yanked

Feb 4, 2026

Reason this release was yanked:

yank

1.0.6 yanked

Feb 9, 2026

1.0.5 yanked

Feb 6, 2026

This version

1.0.3 yanked

Feb 6, 2026

1.0.2 yanked

Feb 6, 2026

1.0.1 yanked

Feb 6, 2026

1.0.0 yanked

Feb 6, 2026

Reason this release was yanked:

cgg

0.1.7 yanked

Feb 4, 2026

Reason this release was yanked:

leak

0.1.6 yanked

Feb 4, 2026

Reason this release was yanked:

IP Leak

0.1.5 yanked

Feb 4, 2026

Reason this release was yanked:

IP exposure - use 0.1.7+

0.1.4 yanked

Feb 4, 2026

Reason this release was yanked:

IP exposure - use 0.1.7+

0.1.3 yanked

Feb 4, 2026

Reason this release was yanked:

IP exposure - use 0.1.7+

0.1.2 yanked

Feb 4, 2026

Reason this release was yanked:

IP exposure - use 0.1.7+

0.1.1 yanked

Feb 4, 2026

Reason this release was yanked:

IP exposure - use 0.1.7+

0.1.0 yanked

Feb 3, 2026

Reason this release was yanked:

IP exposure - use 0.1.7+

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

quarterbit-1.0.3-cp311-cp311-win_amd64.whl (1.3 MB view details)

Uploaded Feb 6, 2026 CPython 3.11Windows x86-64

File details

Details for the file quarterbit-1.0.3-cp311-cp311-win_amd64.whl.

File metadata

Download URL: quarterbit-1.0.3-cp311-cp311-win_amd64.whl
Upload date: Feb 6, 2026
Size: 1.3 MB
Tags: CPython 3.11, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.0

File hashes

Hashes for quarterbit-1.0.3-cp311-cp311-win_amd64.whl
Algorithm	Hash digest
SHA256	`b4388f74d5c520f9d3680fd2f2a712e372e2d5194a9f4d2329083573fde3c4f8`
MD5	`8aa1ca09f9d9dfbef945f0375be96012`
BLAKE2b-256	`f8d98ac2105900848255c91c10cdd17b1aab17d03eda81c903dda0b06aed29c7`

See more details on using hashes here.

quarterbit 1.0.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

QuarterBit AXIOM

Installation

Quick Start

Optimizer Parameters

Per-Layer Learning Rates

Activation Checkpointing (Pro)

Gradient Compression (Pro)

Multi-GPU Training

DataParallel

DistributedDataParallel

DeepSpeed

Checkpointing

Licensing

Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes