netspresso

PyNetsPresso

These details have not been verified by PyPI

Project links

Homepage

Project description

🤝 Collaboration with partners 🤝

Qualcomm AI Hub x NetsPresso

STM32 x NetsPresso

🔥 NetsPresso Model Optimization Tutorials 🔥
A Practical Guide to Using NetsPresso's Compressor Module
A Practical Guide to Using NetsPresso's Quantizer Module

Use NetsPresso for a seamless model optimization process. NetsPresso resolves AI-related constraints in business use cases and enables cost-efficiency and enhanced performance by removing the requirement for high-spec servers and network connectivity and preventing high latency and personal data breaches.

Easily compress various models with our resources. Please browse the Docs for details, and join our Discussion Forum for providing feedback or sharing your use cases.

To get started with NetsPresso, you'll need to sign up here.

We offer a comprehensive guide to walk you through the process of optimizing an AI model using NetsPresso. A full tutorial can be found Google Colab.

Step	Type	Description
Train	np.trainer	Build and train a model.
Train	Model Zoo Image Classification PyTorch-CIFAR-Models Object Detection YOLO Fastest YOLOX YOLOv5 YOLOv7 Semantic Segmentation PIDNet Pose Estimation YOLOv8	Build and train a model.
Compress	np.compressor	Compress and optimize the user’s model.
Quantize	np.quantizer	Quantize the user’s model.
Convert	np.converter	Convert and quantize the user’s model to run efficiently on device.
Benchmark	np.benchmarker	Benchmark the user's model to measure model inference speed on diverse device.

Installation

Prerequisites

Python 3.8 | 3.9 | 3.10
PyTorch 1.13.0 (recommended) (compatible with: 1.11.x - 1.13.x)
TensorFlow 2.8.0 (recommended) (compatible with: 2.3.x - 2.8.x)

Install with PyPI (stable)

pip install netspresso

To use editable mode or docker, see INSTALLATION.md.

Getting started

Login

Log-in to your netspresso account. Please sign-up here if you need one.

from netspresso import NetsPresso

netspresso = NetsPresso(email="YOUR_EMAIL", password="YOUR_PASSWORD")

Quantizer

Automatic quantization

To start quantize a model, enter the model path, dataset path, and the desired quantization precision.

The quantized model will be saved to the specified output directory (output_dir).

from netspresso.enums import QuantizationPrecision, SimilarityMetric

# 1. Declare quantizer
quantizer = netspresso.quantizer()

# 2. Run automatic quantization
quantization_result = quantizer.automatic_quantization(
    input_model_path="./examples/sample_models/test.onnx",
    output_dir="./outputs/quantized/automatic_quantization",
    dataset_path="./examples/sample_datasets/pickle_calibration_dataset_128x128.npy",
    weight_precision=QuantizationPrecision.INT8,
    activation_precision=QuantizationPrecision.INT8,
    threshold=0,
)

Custom precision quantization by layer name

This method enables you to apply precision settings tailored to each layer, based on the recommendations, to optimize model.

Or, you can modify it to your desired precision and optimize it.

from netspresso.enums import QuantizationPrecision

# 1. Declare quantizer
quantizer = netspresso.quantizer()

# 2. Recommendation precision
metadata = quantizer.get_recommendation_precision(
    input_model_path="./examples/sample_models/test.onnx",
    output_dir="./outputs/quantized/recommendation",
    dataset_path="./examples/sample_datasets/pickle_calibration_dataset_128x128.npy",
    weight_precision=QuantizationPrecision.INT8,
    activation_precision=QuantizationPrecision.INT8,
    threshold=0,
)
recommendation_precisions = quantizer.load_recommendation_precision_result(metadata.recommendation_result_path)

# 2. Run quantization by layer name
quantization_result = quantizer.custom_precision_quantization_by_layer_name(
    input_model_path="./examples/sample_models/test.onnx",
    output_dir="./outputs/quantized/custom_precision_quantization_by_layer_name",
    precision_by_layer_name=recommendation_precisions.layers,
    dataset_path="./examples/sample_datasets/pickle_calibration_dataset_128x128.npy",
)

Trainer

Train

To start training a model, first select a task.

Then configure the dataset, model, augmentation, and hyperparameters.

Once setup is finished, enter the GPU number and project name for training.

from netspresso.enums import Task
from netspresso.trainer.optimizers import AdamW
from netspresso.trainer.schedulers import CosineAnnealingWarmRestartsWithCustomWarmUp
from netspresso.trainer.augmentations import Resize


# 1. Declare trainer
trainer = netspresso.trainer(task=Task.OBJECT_DETECTION)  # IMAGE_CLASSIFICATION, OBJECT_DETECTION, SEMANTIC_SEGMENTATION

# 2. Set config for training
# 2-1. Data
trainer.set_dataset_config(
    name="traffic_sign_config_example",
    root_path="/root/traffic-sign",
    train_image="images/train",
    train_label="labels/train",
    valid_image="images/valid",
    valid_label="labels/valid",
    id_mapping=["prohibitory", "danger", "mandatory", "other"],
)

# 2-2. Model
print(trainer.available_models)  # ['YOLOX-S', 'YOLOX-M', 'YOLOX-L', 'YOLOX-X']
trainer.set_model_config(model_name="YOLOX-S", img_size=512)

# 2-3. Augmentation
trainer.set_augmentation_config(
    train_transforms=[Resize()],
    inference_transforms=[Resize()],
)

# 2-4. Training
optimizer = AdamW(lr=6e-3)
scheduler = CosineAnnealingWarmRestartsWithCustomWarmUp(warmup_epochs=10)
trainer.set_training_config(
    epochs=40,
    batch_size=16,
    optimizer=optimizer,
    scheduler=scheduler,
)

# 3. Train
training_result = trainer.train(gpus="0, 1", project_name="PROJECT_TRAIN_SAMPLE")

Retrain

To start retraining a model, use hparams.yaml file which is one of the artifacts generated during the training of the original model.

Then, enter the compressed model path, which is an artifact of the compressor in fx_model_path.

Adjust the training hyperparameters as needed. (See 2-2. for detailed code.)

from netspresso.trainer.optimizers import AdamW

# 1. Declare trainer
trainer = netspresso.trainer(yaml_path="./temp/hparams.yaml")

# 2. Set config for retraining
# 2-1. FX Model
trainer.set_fx_model(fx_model_path="./temp/FX_MODEL_PATH.pt")

# 2-2. Training
optimizer = AdamW(lr=6e-3)
trainer.set_training_config(
    epochs=30,
    batch_size=16,
    optimizer=optimizer,
)

# 3. Train
retraining_result = trainer.train(gpus="0, 1", project_name="PROJECT_RETRAIN_SAMPLE")

Compressor

Compress (Automatic compression)

To start compressing a model, enter the model path to compress and the appropriate compression ratio.

The compressed model will be saved in the specified output directory (output_dir).

# 1. Declare compressor
compressor = netspresso.compressor_v2()

# 2. Run automatic compression
compression_result = compressor.automatic_compression(
    input_shapes=[{"batch": 1, "channel": 3, "dimension": [224, 224]}],
    input_model_path="./examples/sample_models/graphmodule.pt",
    output_dir="./outputs/compressed/pytorch_automatic_compression",
    compression_ratio=0.5,
)

Converter

Convert

To start converting a model, enter the model path to convert and the target framework and device name.

For NVIDIA GPUs and Jetson devices, enter the software version additionally due to the jetpack version.

The converted model will be saved in the specified output directory (output_dir).

from netspresso.enums import DeviceName, Framework, SoftwareVersion

# 1. Declare converter
converter = netspresso.converter_v2()

# 2. Run convert
conversion_result = converter.convert_model(
    input_model_path="./examples/sample_models/test.onnx",
    output_dir="./outputs/converted/TENSORRT_JETSON_AGX_ORIN_JETPACK_5_0_1",
    target_framework=Framework.TENSORRT,
    target_device_name=DeviceName.JETSON_AGX_ORIN,
    target_software_version=SoftwareVersion.JETPACK_5_0_1,
)

Benchmarker

Benchmark

To start benchmarking a model, enter the model path to benchmark and the target device name.

For NVIDIA GPUs and Jetson devices, device name and software version have to be matched with the target device of the conversion.

TensorRT Model has strong dependency with the device type and its jetpack version.

from netspresso.enums import DeviceName, SoftwareVersion

# 1. Declare benchmarker
benchmarker = netspresso.benchmarker_v2()

# 2. Run benchmark
benchmark_result = benchmarker.benchmark_model(
    input_model_path="./outputs/converted/TENSORRT_JETSON_AGX_ORIN_JETPACK_5_0_1/TENSORRT_JETSON_AGX_ORIN_JETPACK_5_0_1.trt",
    target_device_name=DeviceName.JETSON_AGX_ORIN,
    target_software_version=SoftwareVersion.JETPACK_5_0_1,
)
print(f"model inference latency: {benchmark_result.benchmark_result.latency} ms")
print(f"model gpu memory footprint: {benchmark_result.benchmark_result.memory_footprint_gpu} MB")
print(f"model cpu memory footprint: {benchmark_result.benchmark_result.memory_footprint_cpu} MB")

Supported options for Converter & Benchmarker

Frameworks that support conversion for model's framework

Target / Source Framework	ONNX	TENSORFLOW_KERAS	TENSORFLOW
TENSORRT	✔️
DRPAI	✔️
OPENVINO	✔️
TENSORFLOW_LITE	✔️	✔️	✔️

Devices that support benchmarks for model's framework

Device / Framework	ONNX	TENSORRT	TENSORFLOW_LITE	DRPAI	OPENVINO
RASPBERRY_PI_5	✔️		✔️
RASPBERRY_PI_4B	✔️		✔️
RASPBERRY_PI_3B_PLUS	✔️		✔️
RASPBERRY_PI_ZERO_W	✔️		✔️
RASPBERRY_PI_ZERO_2W	✔️		✔️
ARM_ETHOS_U_SERIES			✔️(only INT8)
ALIF_ENSEMBLE_E7_DEVKIT_GEN2			✔️(only INT8)
RENESAS_RA8D1			✔️(only INT8)
NXP_iMX93			✔️(only INT8)
ARDUINO_NICLA_VISION			✔️(only INT8)
RENESAS_RZ_V2L	✔️			✔️
RENESAS_RZ_V2M	✔️			✔️
JETSON_NANO	✔️	✔️
JETSON_TX2	✔️	✔️
JETSON_XAVIER	✔️	✔️
JETSON_NX	✔️	✔️
JETSON_AGX_ORIN	✔️	✔️
JETSON_ORIN_NANO	✔️	✔️
AWS_T4	✔️	✔️
INTEL_XEON_W_2233					✔️

Software versions that support conversions and benchmarks for specific devices

Software Versions requires for Jetson Device. If you are using a different device, you do not need to enter it.

Software Version / Device	JETSON_NANO	JETSON_TX2	JETSON_XAVIER	JETSON_NX	JETSON_AGX_ORIN	JETSON_ORIN_NANO
JETPACK_4_4_1	✔️
JETPACK_4_6	✔️	✔️	✔️	✔️
JETPACK_5_0_1					✔️
JETPACK_5_0_2				✔️
JETPACK_6_1						✔️

The code below is an example of using software version.

conversion_result = converter.convert_model(
    input_model_path=INPUT_MODEL_PATH,
    output_dir=OUTPUT_DIR,
    target_framework=Framework.TENSORRT,
    target_device_name=DeviceName.JETSON_AGX_ORIN,
    target_software_version=SoftwareVersion.JETPACK_5_0_1,
)
benchmark_result = benchmarker.benchmark_model(
    input_model_path=CONVERTED_MODEL_PATH,
    target_device_name=DeviceName.JETSON_AGX_ORIN,
    target_software_version=SoftwareVersion.JETPACK_5_0_1,
)

Hardware type that support benchmarks for specific devices

Benchmark and compare models with and without Arm Helium.

RENESAS_RA8D1 and ALIF_ENSEMBLE_E7_DEVKIT_GEN2 are available for use.

The benchmark results with Helium can be up to twice as fast as without Helium.

The code below is an example of using hardware type.

benchmark_result = benchmarker.benchmark_model(
    input_model_path=CONVERTED_MODEL_PATH,
    target_device_name=DeviceName.RENESAS_RA8D1,
    target_data_type=DataType.INT8,
    target_hardware_type=HardwareType.HELIUM
)

Guide to Credit Consumption by Module

Module	Feature	Credit
Compressor	Automatic compression	25
Compressor	Advanced compression	50
Converter	Convert	50
Benchmarker	Benchmark	25

Contact

Join our Discussion Forum for providing feedback or sharing your use cases, and if you want to talk more with Nota, please contact us here.
Or you can also do it via email(netspresso@nota.ai) or phone(+82 2-555-8659)!

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.17.0

Nov 20, 2025

1.17.0b0 pre-release

Nov 19, 2025

1.16.1

Aug 30, 2025

1.16.0

Jul 29, 2025

1.15.4

May 20, 2025

1.15.3

May 20, 2025

1.15.2

May 20, 2025

1.15.2b1 pre-release

May 20, 2025

1.15.2b0 pre-release

May 19, 2025

1.15.1

May 19, 2025

1.15.0

May 19, 2025

This version

1.15.0b4 pre-release

May 17, 2025

1.15.0b3 pre-release

May 17, 2025

1.15.0b2 pre-release

May 17, 2025

1.15.0b1 pre-release

May 14, 2025

1.15.0b0 pre-release

Apr 3, 2025

1.14.2

Mar 14, 2025

1.14.1

Mar 5, 2025

1.14.1b2 pre-release

Mar 5, 2025

1.14.1b1 pre-release

Mar 4, 2025

1.14.1b0 pre-release

Mar 4, 2025

1.14.0

Feb 13, 2025

1.14.0b7 pre-release

Feb 12, 2025

1.14.0b6 pre-release

Feb 7, 2025

1.14.0b5 pre-release

Nov 29, 2024

1.14.0b4 pre-release

Nov 26, 2024

1.14.0b3 pre-release

Nov 23, 2024

1.14.0b2 pre-release

Nov 23, 2024

1.14.0b1 pre-release

Nov 23, 2024

1.14.0b0 pre-release

Nov 22, 2024

1.13.2

Dec 5, 2024

1.13.1

Nov 15, 2024

1.13.0

Nov 14, 2024

1.12.2

Aug 29, 2024

1.12.2b0 pre-release

Aug 28, 2024

1.12.1

Aug 26, 2024

1.12.0

Aug 23, 2024

1.12.0b11 pre-release

Aug 23, 2024

1.12.0b10 pre-release

Aug 23, 2024

1.12.0b9 pre-release

Aug 23, 2024

1.12.0b8 pre-release

Aug 23, 2024

1.12.0b7 pre-release

Aug 16, 2024

1.12.0b6 pre-release

Aug 16, 2024

1.12.0b5 pre-release

Aug 14, 2024

1.12.0b4 pre-release

Aug 13, 2024

1.12.0b3 pre-release

Aug 9, 2024

1.12.0b2 pre-release

Aug 7, 2024

1.12.0b1 pre-release

Aug 7, 2024

1.12.0b0 pre-release

Aug 7, 2024

1.11.0

Aug 5, 2024

1.10.0b14 pre-release

Feb 4, 2025

1.10.0b13 pre-release

Jan 24, 2025

1.10.0b12 pre-release

Jan 24, 2025

1.10.0b11 pre-release

Jan 23, 2025

1.10.0b10 pre-release

Sep 5, 2024

1.10.0b9 pre-release

Aug 5, 2024

1.10.0b8 pre-release

Aug 1, 2024

1.10.0b7 pre-release

Aug 1, 2024

1.10.0b6 pre-release

Jul 31, 2024

1.10.0b5 pre-release

Jul 31, 2024

1.10.0b4 pre-release

Jul 31, 2024

1.10.0b3 pre-release

Jul 31, 2024

1.10.0b2 pre-release

Jul 30, 2024

1.10.0b1 pre-release

Jul 30, 2024

1.10.0b0 pre-release

Jul 30, 2024

1.9.0

Jul 24, 2024

1.9.0b0 pre-release

Jul 23, 2024

1.8.0

Jul 15, 2024

1.7.1

Jul 8, 2024

1.7.1b3 pre-release

Jul 5, 2024

1.7.1b2 pre-release

Jul 5, 2024

1.7.1b1 pre-release

Jul 5, 2024

1.7.1b0 pre-release

Jul 1, 2024

1.7.0

Jun 27, 2024

1.7.0b9 pre-release

Jun 26, 2024

1.7.0b8 pre-release

Jun 17, 2024

1.7.0b7 pre-release

Jun 17, 2024

1.7.0b6 pre-release

Jun 12, 2024

1.7.0b5 pre-release

Jun 11, 2024

1.7.0b4 pre-release

Jun 11, 2024

1.7.0b3 pre-release

Jun 11, 2024

1.7.0b2 pre-release

Jun 11, 2024

1.7.0b1 pre-release

Jun 5, 2024

1.7.0b0 pre-release

Jun 4, 2024

1.6.0

Apr 5, 2024

1.5.0

Mar 14, 2024

1.4.0

Mar 8, 2024

1.4.0b2 pre-release

Mar 4, 2024

1.4.0b1 pre-release

Mar 4, 2024

1.4.0b0 pre-release

Feb 28, 2024

1.3.2

Mar 31, 2024

1.3.1

Feb 15, 2024

1.3.0

Feb 5, 2024

1.3.0b2 pre-release

Feb 4, 2024

1.3.0b1 pre-release

Feb 4, 2024

1.3.0b0 pre-release

Dec 20, 2023

1.2.2

Jan 31, 2024

1.2.1

Jan 15, 2024

1.2.0

Nov 21, 2023

1.1.7

Oct 19, 2023

1.1.6

Aug 24, 2023

1.1.5

Aug 24, 2023

1.1.4

Aug 16, 2023

1.1.3

Aug 16, 2023

1.1.2

Aug 8, 2023

1.1.1

Aug 8, 2023

1.1.0

Aug 7, 2023

1.0.5

Jul 13, 2023

1.0.4

Jun 26, 2023

1.0.3

Jun 19, 2023

1.0.2

Jun 19, 2023

1.0.1

Jun 19, 2023

1.0.0

Jun 14, 2023

0.1.12

Jun 13, 2023

0.1.11

Jun 12, 2023

0.1.10

Jun 12, 2023

0.1.9

Jun 12, 2023

0.1.8

Jun 12, 2023

0.1.7

Jun 12, 2023

0.1.6

Jun 8, 2023

0.1.5

Jun 7, 2023

0.1.4

Jun 2, 2023

0.1.3

Jun 2, 2023

0.1.2

Jun 2, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

netspresso-1.15.0b4.tar.gz (117.8 kB view details)

Uploaded May 17, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

netspresso-1.15.0b4-py3-none-any.whl (171.4 kB view details)

Uploaded May 17, 2025 Python 3

File details

Details for the file netspresso-1.15.0b4.tar.gz.

File metadata

Download URL: netspresso-1.15.0b4.tar.gz
Upload date: May 17, 2025
Size: 117.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for netspresso-1.15.0b4.tar.gz
Algorithm	Hash digest
SHA256	`634f3afad6fad02ec06e3ce9a5f3cb404ece24696335bd93eb7f87e7ef1b7b9c`
MD5	`1dcd64a66941bad4720f19221f98c918`
BLAKE2b-256	`eeb76678e7163d24af6e919d92441dc095bcd1e3cff2259010c657b5effb5cb3`

See more details on using hashes here.

File details

Details for the file netspresso-1.15.0b4-py3-none-any.whl.

File metadata

Download URL: netspresso-1.15.0b4-py3-none-any.whl
Upload date: May 17, 2025
Size: 171.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for netspresso-1.15.0b4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d8c1afb88e835425b84ba0897fbcd574930030a7f1c32ea5ac8ed5761e87ca31`
MD5	`f39b31558111b2fbd6945726a589f6dd`
BLAKE2b-256	`312c82c346ffa2b69cc0da39d76de403f4c8546df778e905dd8293e6c4e96e54`

See more details on using hashes here.

netspresso 1.15.0b4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Installation

Prerequisites

Install with PyPI (stable)

Getting started

Login

Quantizer

Automatic quantization

Custom precision quantization by layer name

Trainer

Train

Retrain

Compressor

Compress (Automatic compression)

Converter

Convert

Benchmarker

Benchmark

Frameworks that support conversion for model's framework

Devices that support benchmarks for model's framework

Software versions that support conversions and benchmarks for specific devices

Hardware type that support benchmarks for specific devices

Guide to Credit Consumption by Module

Contact

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes