SD.Next Quantization Engine

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Disty0

These details have not been verified by PyPI

Project description

SDNQ: SD.Next Quantization Engine

For more info, please check out SD.Next SDNQ wiki page: https://github.com/vladmandic/sdnext/wiki/SDNQ-Quantization

Install command:

pip install sdnq

Example code to load pre-quantized models:

Pre-quantized models can be found here: https://huggingface.co/collections/Disty0/sdnq

from sdnq import SDNQConfig # import sdnq to register it into diffusers and transformers
model = AutoModel.from_pretrained(model_path)

Example code for enabling or disabling quantized matmul with a pre-quantized model:

from sdnq.loader import apply_sdnq_options_to_model
quantized_model = apply_sdnq_options_to_model(quantized_model, use_quantized_matmul=True)

Example quantization config code for Diffusers and Transformers libraries:

from sdnq import SDNQConfig
from sdnq.common import use_torch_compile as triton_is_available

sdnq_config = SDNQConfig(
    weights_dtype="int8",
    group_size=0,
    svd_rank=32,
    svd_steps=8,
    dynamic_loss_threshold=1e-2,
    use_svd=False,
    quant_conv=False,
    use_quantized_matmul=triton_is_available,
    use_quantized_matmul_conv=False,
    use_dynamic_quantization=False,
    dequantize_fp32=False,
    non_blocking=False,
    add_skip_keys=True,
    quantization_device="cuda",
    return_device="cuda",
    modules_to_not_convert=["correction_coefs", "prediction_coefs", "lm_head", "embedding_projection"],
    modules_dtype_dict={"int8": ["lm_head"]},
)

quantized_model = AutoModel.from_pretrained(model_path, quantization_config=sdnq_config)

Example code for saving a quantized model:

from sdnq.loader import save_sdnq_model
# set is_pipeline to True if you want to save the entire diffusers pipeline instead of a single model.
save_sdnq_model(pipe_or_quantized_model, "path_to_save_the_quantized_model", is_pipeline=False)

Example code for quantized training:

Note:

Safetensors serialization is not supported with SDNQ training.
Either don't use Safetensors serialization or convert the quantized model to standard SDNQ model before saving.

from sdnq.training import sdnq_training_post_load_quant
from sdnq.common import use_torch_compile as triton_is_available

quantized_model = sdnq_training_post_load_quant(
    model,
    weights_dtype="uint8",
    quantized_matmul_dtype="int8",
    group_size=32, # 0 means auto, -1 means disabled
    svd_rank=32,
    svd_steps=8,
    use_svd=False,
    use_grad_ckpt=True, # disable this if you are not using gradient checkpointing
    use_quantized_matmul=triton_is_available,
    use_static_quantization=True, # quantize the model weights
    use_stochastic_rounding=True,
    dequantize_fp32=True,
    non_blocking=False,
    add_skip_keys=True,
    quantization_device="cuda",
    return_device="cuda",
    modules_to_not_convert=["correction_coefs", "prediction_coefs", "lm_head", "embedding_projection"],
    modules_dtype_dict={"int8": ["lm_head"]},
)

Example code for converting standard SDNQ model to training SDNQ Model:

from sdnq.training import convert_sdnq_model_to_training
from sdnq.common import use_torch_compile as triton_is_available
quantized_model = convert_sdnq_model_to_training(
    quantized_model,
    quantized_matmul_dtype="int8",
    use_grad_ckpt=True,
    use_quantized_matmul=triton_is_available,
    use_stochastic_rounding=True,
    dequantize_fp32=True,
)

Example code for converting training SDNQ model to standard SDNQ Model:

from sdnq.training import convert_training_model_to_sdnq
quantized_model = convert_training_model_to_sdnq(quantized_model)

Example code for quantized optimizer states:

from sdnq.optim import Adafactor, AdamW, CAME, Lion, Muon
optimizer = AdamW(
    parameters,
    use_stochastic_rounding=True,
    use_stochastic_buffers=True,
    use_quantized_buffers=True,
    use_svd_quantization=False,
    quantized_buffers_dtype="uint8",
    quantized_buffers_group_size=32,
    quantized_buffers_svd_rank=32,
)

Example code for quantized optimizer states for custom optimizers:

from sdnq.training import SDNQTensor

state["exp_avg"] = SDNQTensor.from_float(torch.zeros_like(p), weights_dtype="uint8", group_size=32, use_stochastic_rounding=True)

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Disty0

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.8

May 6, 2026

0.1.7

Apr 14, 2026

0.1.6

Mar 13, 2026

0.1.5

Feb 24, 2026

0.1.4

Jan 19, 2026

This version

0.1.3

Dec 27, 2025

0.1.2

Dec 9, 2025

0.1.1

Nov 29, 2025

0.1.0

Nov 27, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sdnq-0.1.3.tar.gz (61.2 kB view details)

Uploaded Dec 27, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sdnq-0.1.3-py3-none-any.whl (94.4 kB view details)

Uploaded Dec 27, 2025 Python 3

File details

Details for the file sdnq-0.1.3.tar.gz.

File metadata

Download URL: sdnq-0.1.3.tar.gz
Upload date: Dec 27, 2025
Size: 61.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sdnq-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`692a81b2d6b272c7451d8a49f157cbb7cc4b867dc4ecdaaeb222f50fc0e266c3`
MD5	`e72fcb4842724bc1474ce601f1bfd2bc`
BLAKE2b-256	`cc8108f17cfefbcc5de6367a8304038283e086afa6f9c70e0fe4e0d0b9cd38b5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sdnq-0.1.3.tar.gz:

Publisher: python-publish.yml on Disty0/sdnq

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sdnq-0.1.3.tar.gz
- Subject digest: 692a81b2d6b272c7451d8a49f157cbb7cc4b867dc4ecdaaeb222f50fc0e266c3
- Sigstore transparency entry: 780443030
- Sigstore integration time: Dec 27, 2025
Source repository:
- Permalink: Disty0/sdnq@c7fcfd2bd63230e6157add808079cbb02059a2ec
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/Disty0
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@c7fcfd2bd63230e6157add808079cbb02059a2ec
- Trigger Event: release

File details

Details for the file sdnq-0.1.3-py3-none-any.whl.

File metadata

Download URL: sdnq-0.1.3-py3-none-any.whl
Upload date: Dec 27, 2025
Size: 94.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sdnq-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`145e458407e5aa841edc93c645a70442e9ebf6f32b6f7d06091ab3ed84a8e2ff`
MD5	`67f00ec59214336d4731d3f1799ef504`
BLAKE2b-256	`e0843d3bad89170897a3da3fc4734ab4fcdb590e63f2881dbf4c92b188e0f3cc`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sdnq-0.1.3-py3-none-any.whl:

Publisher: python-publish.yml on Disty0/sdnq

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sdnq-0.1.3-py3-none-any.whl
- Subject digest: 145e458407e5aa841edc93c645a70442e9ebf6f32b6f7d06091ab3ed84a8e2ff
- Sigstore transparency entry: 780443031
- Sigstore integration time: Dec 27, 2025
Source repository:
- Permalink: Disty0/sdnq@c7fcfd2bd63230e6157add808079cbb02059a2ec
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/Disty0
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@c7fcfd2bd63230e6157add808079cbb02059a2ec
- Trigger Event: release

sdnq 0.1.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

SDNQ: SD.Next Quantization Engine

Install command:

Example code to load pre-quantized models:

Example code for enabling or disabling quantized matmul with a pre-quantized model:

Example quantization config code for Diffusers and Transformers libraries:

Example code for saving a quantized model:

Example code for quantized training:

Example code for converting standard SDNQ model to training SDNQ Model:

Example code for converting training SDNQ model to standard SDNQ Model:

Example code for quantized optimizer states:

Example code for quantized optimizer states for custom optimizers:

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance