Package that will offer a small collection of different (Neural) Bandit algorithms with different feedback strategies.

These details have not been verified by PyPI

Project links

Project description

Calvera

Calvera is a Python library offering a collection of neural multi-armed bandit algorithms, designed to integrate seamlessly with PyTorch and PyTorch Lightning. Whether you're exploring contextual bandits or developing new strategies, Calvera provides a flexible, easy-to-use interface. You can bring your own neural networks and datasets while Calvera focuses on the implementation of the bandit algorithms.

Features

Multi-Armed Bandit Algorithms:
- (Approximate + Standard) Linear Thompson Sampling
- (Approximate + Standard) Linear UCB
- Neural Linear
- Neural Thompson Sampling
- Neural UCB
Customizable Selectors:
- ArgMaxSelector: Chooses the arm with the highest score.
- EpsilonGreedySelector: Chooses the best arm with probability 1-epsilon or a random arm with probability epsilon.
- TopKSelector: Selects the top k arms with the highest scores.
- EpsilonGreedyTopKSelector: Selects the top k arms with probability 1-epsilon or k random arms with probability epsilon.
Integration:
- Built on top of PyTorch Lightning for training and inference.
- Minimal adjustments needed to plug into your existing workflow.

Installation

Calvera is available on PyPI. Install it via pip:

pip install calvera

This installs the necessary dependencies for the base library. If you want to use parts of the benchmark subpackage we recommend installing the optional dependencies as well:

pip install calvera[benchmark]

For development you can install the development dependencies via:

pip install calvera[dev]

Quick Start

Below is a simple example using a Linear Thompson Sampling bandit:

import torch
import lightning as pl
from calvera.bandits import LinearTSBandit

# 1. Create a bandit for a linear model with 128 features.
N_FEATURES = 128
bandit = LinearTSBandit(n_features=N_FEATURES)

# 2. Generate sample data (batch_size, n_actions, n_features) and perform inference.
data = torch.randn(100, 1, N_FEATURES)
chosen_arms_one_hot, probabilities = bandit(data)
chosen_arms = chosen_arms_one_hot.argmax(dim=1)

# 3. Retrieve rewards for the chosen arms.
rewards = torch.randn(100, 1)

# 4. Add the data to the bandit.
chosen_contextualized_actions = data[:, :, chosen_arms]
bandit.record_feedback(chosen_contextualized_actions, rewards)

# 5. Train the bandit.
trainer = pl.Trainer(
    max_epochs=1,
    enable_progress_bar=False,
    enable_model_summary=False,
    accelerator=accelerator,
)
trainer.fit(bandit)

# (6. Repeat the process as needed)

For more detailed examples, see the examples page in the documentation.

Documentation

Bandits: Each bandit is implemented as a PyTorch Lightning Module with forward() for inference and training_step() for training.
Buffers: Data is managed via buffers that subclass AbstractBanditDataBuffer.
Selectors: Easily customize your arm selection strategy by using or extending the provided selectors.

Benchmarks & Experimental Results

Detailed benchmarks, datasets, and experimental results are available in the extended documentation. The configuration and even more specific results can be found in ./experiments under the specific sub-directories.

Contributing

Contributions are welcome! For guidelines on how to contribute, please refer to our CONTRIBUTING.md.

License

Calvera is licensed under the MIT License. See the LICENSE file for details.

Contact

For questions or feedback, please reach out to one of the authors:

Philipp Kolbe
Robert Weeke
Parisa Shahabinejad

Link to Agreement

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.2

Mar 14, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

calvera-1.0.2.tar.gz (66.3 kB view details)

Uploaded Mar 14, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

calvera-1.0.2-py3-none-any.whl (86.7 kB view details)

Uploaded Mar 14, 2025 Python 3

File details

Details for the file calvera-1.0.2.tar.gz.

File metadata

Download URL: calvera-1.0.2.tar.gz
Upload date: Mar 14, 2025
Size: 66.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.5

File hashes

Hashes for calvera-1.0.2.tar.gz
Algorithm	Hash digest
SHA256	`ccd36fb8e851d6c43471447af903eb009c65b430731599fb6396a72ae3e01c68`
MD5	`859e8d0d8e95182437afc119e601a4ba`
BLAKE2b-256	`37b33f1b2a4e1c2b6473d01b42e41a405d56502077ceb442b93d5204b306a3a9`

See more details on using hashes here.

File details

Details for the file calvera-1.0.2-py3-none-any.whl.

File metadata

Download URL: calvera-1.0.2-py3-none-any.whl
Upload date: Mar 14, 2025
Size: 86.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.5

File hashes

Hashes for calvera-1.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f8467004651ff86866c5a5ccdcc06df72a717946a7d86484af40626045e8cfa2`
MD5	`3b2865bf5ad4448c190b85fc3e2afe83`
BLAKE2b-256	`eaaeb097b53c923e5f9ac6b216ac8b04f5dcd712a2cec8b2895b029839928220`

See more details on using hashes here.

calvera 1.0.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Calvera

Features

Installation

Quick Start

Documentation

Benchmarks & Experimental Results

Contributing

License

Contact

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes