aftab

A highly configurable implementation of our approach in the Aftab paper, benchmarking different convolutional neural networks and their effects on the final results.

These details have not been verified by PyPI

Project links

Project description

Overview

Aftab (Persian: آفتاب, meaning "sun" or "sun rays") is a benchmarking framework for evaluating CNN-based encoders in PQN across Atari environments.
It provides standardized training, evaluation, and reproducibility tools for deep reinforcement learning research.

IQM HNS	IQM HNS (Last 50M Frames)

Global performance of base encoders.

IQM HNS	IQM HNS (Last 50M Frames)

Comparison of two Gamma encoder variants based on findings from Hadamax Encoding: Elevating Performance in Model-Free Atari .

Installation

Install via pip:

pip install aftab

Usage

Note that the JAX API is under development, but using current PyTorch version you need to expect training of your agents to take up to 13 hours for the best configuration. We hope we are going to get much faster results using JAX.

from aftab import Aftab
from aftab import aftab_environments

seeds = [1, 2, 3, 4]

for environment in aftab_environments:
    agent = Aftab(encoder="gamma", frames="pilot")
    for seed in seeds:
        agent.train(environment=environment, seed=seed)
        agent.log()

Defining a Custom Encoder

You can define your own encoder as a PyTorch module and pass it to the agent:

import torch
from aftab import Aftab

class CustomImageEncoder(torch.nn.Module):
    def __init__(self):
        super().__init__()
  
    def forward(self, x):
        pass

agent = Aftab(encoder=CustomImageEncoder, frames="pilot")

Results

Encoder Experiments:

Tables:
- HNS
- Scores
Charts:
- Loss Evolution
- IQM HNS

Hadamax Experiments:

Tables:
- HNS
- Scores
Charts:
- Loss Evolution
- IQM HNS

Final Experiments: (GPUs are working :D)

Model Complexity

Base Variants

Variant	Encoder Parameters	Regression Head Parameters	Total Parameters	Encoder FLOPs	Regression Head FLOPs	Total FLOPs
PQN	78,304	1,686,500	1,764,804	7.734	1.610	9.347
Alpha	174,752	1,782,948	1,957,700	27.541	1.610	29.151
Beta	89,008	1,782,948	1,871,956	61.515	1.610	63.126
Gamma	117,168	1,725,364	1,842,532	22.901	1.610	24.512
Delta	78,552	1,850,588	1,929,140	6.143	1.774	7.917
Epsilon	80,112	2,179,828	2,259,940	13.252	2.101	15.354
Zeta	77,232	2,537,396	2,614,628	25.362	2.462	27.824
Eta	78,400	23,739,460	23,817,860	28.422	23.663	52.085
Theta	76,288	1,127,428	1,203,716	9.065	1.053	10.118

Note: The Eta variant has significantly more parameters than other variants, primarily due to the encoder producing a large number of features.

Hadamax Variants

Variant	Encoder Parameters	Regression Head Parameters	Total Parameters	Encoder FLOPs	Regression Head FLOPs	Total FLOPs
PQN Hadamax	156,608	3,968,516	4,125,124	159.014	3.969	162.984
Gamma Hadamax V1	234,336	1,609,220	1,843,556	122.001	1.610	123.611
Gamma Hadamax V2	234,336	3,280,388	3,514,724	129.300	3.281	132.581

Hyperparameters

Hyperparameter	Value
Learning rate	$2.5 \times 10^{-4}$
Training environments	128
Test environments	8
Optimizer	Rectified Adam
Weight decay	0
$\epsilon$	$1 \times 10^{-5}$
$\beta_{1}$	0.9
$\beta_{2}$	0.999
Total Frames	200,000,000
Loss function	Mean Squared Error
Scheduler	Linear Annealing
$\epsilon$-greedy exploration	10% of total frames
Discount factor ($\gamma$)	0.99
GAE ($\lambda$)	0.65
Epochs	2
Batch size	4096

Used in encoder and Hadamax experiments.

Statistical Significance

	PQN	Alpha	Beta	Gamma	Delta	Epsilon	Zeta	Eta	Theta
PQN	-	-	-	-	-	-	-	-	-
Alpha	0	-	-	-	-	-	-	-	-
Beta	0	0.847	-	-	-	-	-	-	-
Gamma	0	0.295	0.802	-	-	-	-	-	-
Delta	0	0	0	0	-	-	-	-	-
Epsilon	0	0.104	0.068	0.01	0	-	-	-	-
Zeta	0	0.145	0.293	0.024	0	0.552	-	-	-
Eta	0.001	0.337	0.757	0.221	0	0.819	0.967	-	-
Theta	0.431	0	0.004	0	0.046	0.001	0.001	0.002	-

	Gamma	Hadamax Gamma V1	Hadamax Gamma V2	Hadamax
Gamma	-	-	-	-
Hadamax Gamma V1	0	-	-	-
Hadamax Gamma V2	0	0.72	-	-
Hadamax Nature DQN	0	0.078	0.151	-

Reproducibility

Due to the stochastic nature of deep reinforcement learning, exact reproducibility via fixed datasets is not feasible.
Instead, we provide a set of random seeds used in our experiments.

from aftab import aftab_seeds

print(aftab_seeds)

Full experiment replication:

from aftab import Aftab
from aftab import aftab_environments
from aftab import aftab_seeds

for environment in aftab_environments:
    agent = Aftab()
    for seed in aftab_seeds:
        agent.train(environment=environment, seed=seed)
        agent.log()

A comprehensive set of Atari environments is available via EnvPool:
https://envpool.readthedocs.io/en/latest/env/atari.html#available-tasks

Hardware

Nvidia A40 GPUs were used to run all the experiments in this experiment.

Specification	Details
GPU Memory	48 GB GDDR6 with error-correcting code (ECC)
GPU Memory Bandwidth	696 GB/s
Interconnect	NVIDIA NVLink 112.5 GB/s (bidirectional); PCIe Gen4: 64 GB/s
NVLink	2-way low profile (2-slot)
Display Ports	3x DisplayPort 1.4*
Max Power Consumption	300 W
Form Factor	4.4" (H) x 10.5" (L), Dual Slot
Thermal	Passive
vGPU Software Support	NVIDIA Virtual PC, NVIDIA Virtual Applications, NVIDIA RTX Virtual Workstation, NVIDIA Virtual Compute Server, NVIDIA AI Enterprise
vGPU Profiles Supported	See the Virtual GPU Licensing Guide
NVENC / NVDEC	1x / 2x (includes AV1 decode)
Secure Boot	Secure and Measured Boot with Hardware Root of Trust (optional)
NEBS Ready	Level 3
Power Connector	8-pin CPU

Citation

@article{aftab2026benchmarking,
  title={Aftab: Benchmarking {CNN} Encoders in {PQN}},
  author={Shieenavaz, Taha and Zareshahraki, Shabnam and Nanni, Loris},
  journal={arXiv preprint arXiv:YYMM.NNNNN},
  year={2026}
}

Related Works

@misc{2407.04811,
  Title = {Simplifying Deep Temporal Difference Learning},
  Author = {Matteo Gallici and Mattie Fellows and Benjamin Ellis and Bartomeu Pou and Ivan Masmitja and Jakob Nicolaus Foerster and Mario Martin},
  Year = {2024},
  Eprint = {arXiv:2407.04811},
}

@misc{2403.03950,
  Title = {Stop Regressing: Training Value Functions via Classification for Scalable Deep RL},
  Author = {Jesse Farebrother and Jordi Orbay and Quan Vuong and Adrien Ali Taïga and Yevgen Chebotar and Ted Xiao and Alex Irpan and Sergey Levine and Pablo Samuel Castro and Aleksandra Faust and Aviral Kumar and Rishabh Agarwal},
  Year = {2024},
  Eprint = {arXiv:2403.03950},
}

@misc{1511.06581,
  Title = {Dueling Network Architectures for Deep Reinforcement Learning},
  Author = {Ziyu Wang and Tom Schaul and Matteo Hessel and Hado van Hasselt and Marc Lanctot and Nando de Freitas},
  Year = {2015},
  Eprint = {arXiv:1511.06581},
}

@misc{1806.04613,
  Title = {Improving Regression Performance with Distributional Losses},
  Author = {Ehsan Imani and Martha White},
  Year = {2018},
  Eprint = {arXiv:1806.04613},
}

@misc{1602.04621,
  Title = {Deep Exploration via Bootstrapped DQN},
  Author = {Ian Osband and Charles Blundell and Alexander Pritzel and Benjamin Van Roy},
  Year = {2016},
  Eprint = {arXiv:1602.04621},
}

License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.58

May 21, 2026

0.1.57

May 19, 2026

This version

0.1.56

May 19, 2026

0.1.55

May 19, 2026

0.1.54

May 19, 2026

0.1.53

May 18, 2026

0.1.52

May 14, 2026

0.1.51

May 14, 2026

0.1.50

May 14, 2026

0.1.49

May 14, 2026

0.1.48

May 14, 2026

0.1.47

May 14, 2026

0.1.46

May 13, 2026

0.1.45

May 1, 2026

0.1.44

May 1, 2026

0.1.43

May 1, 2026

0.1.42

May 1, 2026

0.1.41

Apr 30, 2026

0.1.40

Apr 30, 2026

0.1.39

Apr 30, 2026

0.1.38

Apr 30, 2026

0.1.37

Apr 30, 2026

0.1.36

Apr 30, 2026

0.1.35

Apr 30, 2026

0.1.34

Apr 30, 2026

0.1.33

Apr 29, 2026

0.1.31

Apr 29, 2026

0.1.30

Apr 29, 2026

0.1.29

Apr 29, 2026

0.1.28

Apr 29, 2026

0.1.27

Apr 29, 2026

0.1.26

Apr 29, 2026

0.1.25

Apr 29, 2026

0.1.24

Apr 29, 2026

0.1.23

Apr 29, 2026

0.1.22

Apr 28, 2026

0.1.21

Apr 28, 2026

0.1.20

Apr 27, 2026

0.1.19

Apr 27, 2026

0.1.18

Apr 27, 2026

0.1.16

Apr 27, 2026

0.1.15

Apr 27, 2026

0.1.14

Apr 27, 2026

0.1.13

Apr 27, 2026

0.1.12

Apr 27, 2026

0.1.11

Apr 26, 2026

0.1.1

Apr 26, 2026

0.1.0

Apr 26, 2026

0.0.99

Apr 26, 2026

0.0.97

Apr 25, 2026

0.0.96

Apr 25, 2026

0.0.95

Apr 25, 2026

0.0.94

Apr 25, 2026

0.0.93

Apr 25, 2026

0.0.92

Apr 25, 2026

0.0.91

Apr 25, 2026

0.0.90

Apr 25, 2026

0.0.89

Apr 25, 2026

0.0.88

Apr 25, 2026

0.0.87

Apr 25, 2026

0.0.86

Apr 25, 2026

0.0.85

Apr 25, 2026

0.0.84

Apr 25, 2026

0.0.83

Apr 25, 2026

0.0.82

Apr 24, 2026

0.0.81

Apr 24, 2026

0.0.80

Apr 24, 2026

0.0.79

Apr 24, 2026

0.0.78

Apr 24, 2026

0.0.77

Apr 24, 2026

0.0.76

Apr 24, 2026

0.0.75

Apr 23, 2026

0.0.74

Apr 23, 2026

0.0.73

Apr 23, 2026

0.0.72

Apr 23, 2026

0.0.71

Apr 23, 2026

0.0.70

Apr 23, 2026

0.0.69

Apr 23, 2026

0.0.68

Apr 23, 2026

0.0.67

Apr 23, 2026

0.0.66

Apr 23, 2026

0.0.65

Apr 22, 2026

0.0.64

Apr 22, 2026

0.0.63

Apr 21, 2026

0.0.62

Apr 21, 2026

0.0.61

Apr 21, 2026

0.0.60

Apr 21, 2026

0.0.59

Apr 21, 2026

0.0.58

Apr 21, 2026

0.0.57

Apr 21, 2026

0.0.56

Apr 21, 2026

0.0.55

Apr 21, 2026

0.0.54

Apr 21, 2026

0.0.53

Apr 21, 2026

0.0.52

Apr 21, 2026

0.0.51

Apr 21, 2026

0.0.50

Apr 21, 2026

0.0.49

Apr 21, 2026

0.0.48

Apr 21, 2026

0.0.47

Apr 21, 2026

0.0.46

Apr 21, 2026

0.0.45

Apr 21, 2026

0.0.44

Apr 18, 2026

0.0.43

Apr 18, 2026

0.0.42

Apr 17, 2026

0.0.41

Apr 17, 2026

0.0.40

Apr 17, 2026

0.0.39

Apr 17, 2026

0.0.38

Apr 17, 2026

0.0.37

Apr 17, 2026

0.0.36

Apr 17, 2026

0.0.35

Apr 17, 2026

0.0.34

Apr 17, 2026

0.0.33

Apr 17, 2026

0.0.32

Apr 17, 2026

0.0.31

Apr 17, 2026

0.0.30

Apr 17, 2026

0.0.29

Apr 17, 2026

0.0.28

Apr 17, 2026

0.0.27

Apr 16, 2026

0.0.26

Apr 16, 2026

0.0.25

Apr 13, 2026

0.0.24

Apr 13, 2026

0.0.23

Apr 12, 2026

0.0.22

Apr 12, 2026

0.0.21

Apr 12, 2026

0.0.20

Apr 12, 2026

0.0.19

Apr 12, 2026

0.0.18

Apr 12, 2026

0.0.17

Apr 12, 2026

0.0.16

Apr 12, 2026

0.0.15

Apr 12, 2026

0.0.14

Apr 12, 2026

0.0.13

Apr 12, 2026

0.0.1

Nov 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aftab-0.1.56.tar.gz (2.2 MB view details)

Uploaded May 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aftab-0.1.56-py3-none-any.whl (75.4 kB view details)

Uploaded May 19, 2026 Python 3

File details

Details for the file aftab-0.1.56.tar.gz.

File metadata

Download URL: aftab-0.1.56.tar.gz
Upload date: May 19, 2026
Size: 2.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for aftab-0.1.56.tar.gz
Algorithm	Hash digest
SHA256	`135a31aac42a68cd9f6e4f710a519ed04dd467cd70dad333f999ad043a4a4452`
MD5	`ef786caafeb001a1a97d82ab892f3d1f`
BLAKE2b-256	`489333884a68884f1af6da6f1843c1354007c84e55d6289e636d34e2dd8f4da5`

See more details on using hashes here.

File details

Details for the file aftab-0.1.56-py3-none-any.whl.

File metadata

Download URL: aftab-0.1.56-py3-none-any.whl
Upload date: May 19, 2026
Size: 75.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for aftab-0.1.56-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0536c45587fe4a61dd1de5a37f00c7e2ef7f6600414b2b55a6ca5d38c3eede63`
MD5	`36ccefe803c10c59d897b16a8171011f`
BLAKE2b-256	`de905d220f007e2574372196e0cda791b8acb411aa0d84a2da0c34addd189433`

See more details on using hashes here.

aftab 0.1.56

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Overview

Installation

Usage

Defining a Custom Encoder

Results

Model Complexity

Base Variants

Hadamax Variants

Hyperparameters

Statistical Significance

Reproducibility

Hardware

Citation

Related Works

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes