AXIOM - High-performance quantized optimizer with 73% memory savings
Reason this release was yanked:
cgg
Project description
QuarterBit AXIOM
High-Performance Quantized Optimizer
73% memory savings. Drift-free precision. Beats AdamW convergence.
The Problem
Training large AI models requires massive GPU memory for optimizer states. AdamW stores 8 bytes per parameter just for momentum and variance — that's 8GB for a 1B parameter model.
Additionally, long training runs suffer from floating-point drift — tiny gradient updates accumulate rounding errors over millions of steps, causing:
- Stalled convergence in late training
- Numerical instability
- Suboptimal final models
The Solution
QuarterBit AXIOM solves both problems:
| Metric | AdamW | AXIOM | Improvement |
|---|---|---|---|
| Memory per param | 8.0 bytes | 2.14 bytes | 73% savings |
| Precision drift | Accumulates errors | Drift-free | Eliminated |
| Convergence | Baseline | 3-4% better | Faster training |
Installation
pip install quarterbit
Supported GPUs:
- NVIDIA T4, V100, A100, L4, L40
- NVIDIA RTX 30 series (3060-3090)
- NVIDIA RTX 40 series (4060-4090)
- NVIDIA H100, H200
Quick Start
from quarterbit import Axiom
# Drop-in replacement for AdamW
optimizer = Axiom(
model.parameters(),
lr=5e-4,
weight_decay=0.1,
total_steps=10000
)
# Train as usual
for batch in dataloader:
loss = model(batch)
loss.backward()
optimizer.step()
optimizer.zero_grad()
Why AXIOM?
1. 73% Memory Savings
Train larger models on the same GPU. AXIOM uses FP4 quantization with per-parameter Log8 variance to reduce optimizer state from 8 bytes to 2.14 bytes per parameter.
2. Drift-Free Training
Error-Free Transformations eliminate floating-point accumulation errors. Your model trains with perfect numerical stability from step 1 to step 1,000,000+.
3. Better Convergence
Two-Step Nesterov momentum, temporal coherence, and soft cautious masking deliver 3-4% better validation loss than AdamW.
4. Drop-In Replacement
Same API as PyTorch optimizers. Change one line of code.
Benchmarks
Validated on GPT-2 Small (124M parameters), WikiText-2, 500 steps:
| Optimizer | Val Loss | Memory/Param | Savings |
|---|---|---|---|
| AdamW | 4.89 | 8.0 B | baseline |
| AXIOM | 4.72 | 2.14 B | 73% |
AXIOM consistently beats AdamW by 3-4% on validation loss across architectures.
See our benchmark results for full results including multi-model comparisons.
Requirements
- Python 3.8+
- PyTorch 1.8+
- NVIDIA GPU with CUDA support
- Linux or Windows
Pricing
| Tier | Price | Use Case |
|---|---|---|
| Free | $0 | Personal, research, evaluation |
| Pro | $299/mo | Commercial use, up to 10 GPUs |
| Team | $2,499/mo | Up to 100 GPUs, priority support |
| Enterprise | Custom | Unlimited, custom SLA |
See quarterbit.dev/pricing for details.
License
Proprietary - see LICENSE for details. Free tier available.
Links
- Website: quarterbit.dev
- Documentation: quarterbit.dev/docs
- Email: info@quarterbit.dev
Copyright (c) 2026 Clouthier Simulation Labs. All rights reserved.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file quarterbit-1.0.0-cp311-cp311-win_amd64.whl.
File metadata
- Download URL: quarterbit-1.0.0-cp311-cp311-win_amd64.whl
- Upload date:
- Size: 1.3 MB
- Tags: CPython 3.11, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
353eb8c5f999ed1a374b1a1120c580d39735fcfd387092c5066066a0556370be
|
|
| MD5 |
7a89fe80a49b285322df88891c8c3a0c
|
|
| BLAKE2b-256 |
eae023808da9667a66ab188f1ce3a48ce2124971c9f7f8e418f0d4a122bbe451
|