Skip to main content

AXIOM - High-performance quantized optimizer with 73% memory savings

Reason this release was yanked:

cgg

Project description

QuarterBit AXIOM

High-Performance Quantized Optimizer

73% memory savings. Drift-free precision. Beats AdamW convergence.

The Problem

Training large AI models requires massive GPU memory for optimizer states. AdamW stores 8 bytes per parameter just for momentum and variance — that's 8GB for a 1B parameter model.

Additionally, long training runs suffer from floating-point drift — tiny gradient updates accumulate rounding errors over millions of steps, causing:

  • Stalled convergence in late training
  • Numerical instability
  • Suboptimal final models

The Solution

QuarterBit AXIOM solves both problems:

Metric AdamW AXIOM Improvement
Memory per param 8.0 bytes 2.14 bytes 73% savings
Precision drift Accumulates errors Drift-free Eliminated
Convergence Baseline 3-4% better Faster training

Installation

pip install quarterbit

Supported GPUs:

  • NVIDIA T4, V100, A100, L4, L40
  • NVIDIA RTX 30 series (3060-3090)
  • NVIDIA RTX 40 series (4060-4090)
  • NVIDIA H100, H200

Quick Start

from quarterbit import Axiom

# Drop-in replacement for AdamW
optimizer = Axiom(
    model.parameters(),
    lr=5e-4,
    weight_decay=0.1,
    total_steps=10000
)

# Train as usual
for batch in dataloader:
    loss = model(batch)
    loss.backward()
    optimizer.step()
    optimizer.zero_grad()

Why AXIOM?

1. 73% Memory Savings

Train larger models on the same GPU. AXIOM uses FP4 quantization with per-parameter Log8 variance to reduce optimizer state from 8 bytes to 2.14 bytes per parameter.

2. Drift-Free Training

Error-Free Transformations eliminate floating-point accumulation errors. Your model trains with perfect numerical stability from step 1 to step 1,000,000+.

3. Better Convergence

Two-Step Nesterov momentum, temporal coherence, and soft cautious masking deliver 3-4% better validation loss than AdamW.

4. Drop-In Replacement

Same API as PyTorch optimizers. Change one line of code.

Benchmarks

Validated on GPT-2 Small (124M parameters), WikiText-2, 500 steps:

Optimizer Val Loss Memory/Param Savings
AdamW 4.89 8.0 B baseline
AXIOM 4.72 2.14 B 73%

AXIOM consistently beats AdamW by 3-4% on validation loss across architectures.

See our benchmark results for full results including multi-model comparisons.

Requirements

  • Python 3.8+
  • PyTorch 1.8+
  • NVIDIA GPU with CUDA support
  • Linux or Windows

Pricing

Tier Price Use Case
Free $0 Personal, research, evaluation
Pro $299/mo Commercial use, up to 10 GPUs
Team $2,499/mo Up to 100 GPUs, priority support
Enterprise Custom Unlimited, custom SLA

See quarterbit.dev/pricing for details.

License

Proprietary - see LICENSE for details. Free tier available.

Links


Copyright (c) 2026 Clouthier Simulation Labs. All rights reserved.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

quarterbit-1.0.0-cp311-cp311-win_amd64.whl (1.3 MB view details)

Uploaded CPython 3.11Windows x86-64

File details

Details for the file quarterbit-1.0.0-cp311-cp311-win_amd64.whl.

File metadata

  • Download URL: quarterbit-1.0.0-cp311-cp311-win_amd64.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: CPython 3.11, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.0

File hashes

Hashes for quarterbit-1.0.0-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 353eb8c5f999ed1a374b1a1120c580d39735fcfd387092c5066066a0556370be
MD5 7a89fe80a49b285322df88891c8c3a0c
BLAKE2b-256 eae023808da9667a66ab188f1ce3a48ce2124971c9f7f8e418f0d4a122bbe451

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page