Python package for easy, data-efficient RL-based decision-making in business applications.

These details have not been verified by PyPI

Project links

Project description

pi_optimal Logo

🤖 What is `pi_optimal`?

pi_optimal is an open-source Python library that helps you model, optimize, and control complex systems through Reinforcement Learning (RL). Whether your system involves advertising delivery, energy consumption, inventory management, or any scenario where sequential decision-making is paramount, pi_optimal provides a flexible and modular interface to train, evaluate, and deploy RL-based policies.

Built for data scientists, RL practitioners, and developers, pi_optimal:

Offers a time-series aware RL pipeline, handling lookback windows and forecasting future states.
Supports various action spaces (continuous, discrete, or multi-dimensional), enabling complex control strategies.
Integrates easily with custom reward functions, empowering you to tailor the agent’s objectives to your business goals.
Facilitates multi-step planning, allowing you to look ahead and optimize future outcomes, not just the immediate next step.

If you find pi_optimal useful, consider joining our community and give us a ⭐ on GitHub!

🎯 Why use `pi_optimal`?

In dynamic and complex systems, even experienced operators can struggle to find the best decisions at every step. pi_optimal helps you:

Automate Decision-Making: Reduce human overhead by letting RL agents handle routine optimization tasks.
Optimize Performance Over Time: Forecast system states and choose actions that yield smooth, cost-effective, or profit-maximizing trajectories.
Incorporate Uncertainty: Account for uncertainty in future outcomes with built-in approaches to handle uncertain environments.
Seamlessly Integrate with Your Workflow: pi_optimal fits easily with your existing code, data pipelines, and infrastructure.

🌐 Use Cases

Advertising Delivery Optimization: Smooth out ad impressions over time, ensuring efficient, controlled delivery that meets pacing and budget constraints.
Energy Management: Balance supply and demand, optimize resource allocation, and reduce operational costs.
Inventory and Supply Chain: Manage stock levels, forecast demand, and plan orders for just-in-time deliveries.
Dynamic Pricing and Bidding: Adjust bids, prices, and frequency caps in real-time to maximize revenue or reduce costs.

🚀 Getting Started

Installation

Install pi_optimal directly from PyPI using pip:

pip install pi-optimal

Once installed, you can explore the examples in the notebooks directory to see how to integrate pi_optimal into your projects.

Example Usage

Below is a simplified excerpt demonstrating how pi_optimal can be applied to optimize ad delivery. For a more detailed walkthrough, refer to the notebooks.

import pandas as pd

from pi_optimal.agents.agent import Agent
from pi_optimal.datasets.timeseries_dataset import TimeseriesDataset
from pi_optimal.utils.trajectory_visualizer import TrajectoryVisualizer

# Load historical room climate control data
df_room_history = pd.read_csv('room_climate_history.csv')

# Prepare dataset: define states (e.g., room conditions), actions (e.g., heater settings), and reward (e.g., comfort level)
climate_dataset = TimeseriesDataset(
    df_room_history,
    state_columns=['temperature', 'humidity'],
    action_columns=['heater_power'],
    reward_column='comfort_score',
    timestep_column='timestamp',
    unit_index='room_id',
    lookback_timesteps=8
)

# Train a reinforcement learning agent for climate control
climate_agent = Agent()
climate_agent.train(dataset=climate_dataset)

# Load current room data to predict next actions
df_current_conditions = pd.read_csv('current_room_conditions.csv')
current_dataset = TimeseriesDataset(df_current_conditions,
                                                dataset_config=climate_dataset.dataset_config,
                                                train_processors=False,
                                                is_inference=True)

# Predict optimal heater settings for improved comfort
optimal_actions = climate_agent.predict(current_dataset)
print(optimal_actions)


# Playground to show simulated result of optimal actions and allows you to test different actions
trajectory_visualizer = TrajectoryVisualizer(agent, current_dataset, best_actions=best_actions)
trajectory_visualizer.display()

✨ Features

Time-Series Aware RL:
Directly handle sequences, lookback windows, and rolling state representations.
Flexible Action Spaces:
Support for continuous and discrete actions, or complex multidimensional action vectors.
Custom Reward Functions:
Easily define domain-specific rewards to reflect real-world KPIs.
Multi-Step Planning:
Implement look-ahead strategies that consider future impacts of current actions.
Data Processing and Visualization:
Built-in tools for dataset preparation, trajectory visualization, and iterative evaluation.

📖 Documentation

Tutorials & Examples: Walk through real-world examples to understand how to best apply pi_optimal.
API Reference: Detailed documentation for all classes, methods, and functions.
Best Practices: Learn recommended strategies for defining rewards, choosing architectures, and tuning hyperparameters.

Read the Docs »

🤝 Contributing and Community

We welcome contributions from the community! If you have feature requests, bug reports, or want to contribute code:

Open an issue on GitHub Issues.
Submit a pull request with your proposed changes.
Join our Discord community to ask questions, share ideas, or get help.

A big thanks to all contributors who make pi_optimal better every day!

🙋 Get Help

If you have questions or need assistance, the fastest way to get answers is via our Discord channel. Drop by and say hello!

🔨 Development

If you want to contribute to pi_optimal, we recommend using Poetry for managing dependencies and environments. Follow these steps to set up your development environment:

Deactivate any active virtual environments:
Ensure you are not already in a virtual environment (for example, use conda deactivate if you are using Conda).
Install Poetry (if you haven't already):
```
pipx install poetry
```

Clone the repository and navigate into its directory:

git clone https://github.com/pi-optimal/pi-optimal.git
cd pi-optimal

Install the project dependencies using Poetry:
```
poetry install
```

Note on PyTorch Versions:

By default, pi_optimal installs the CPU-only version of PyTorch (pytorch-cpu). If you'd like to use a CUDA-enabled (GPU) version of PyTorch, simply run:

poetry remove torch
poetry add torch@^2.3

This ensures you have the correct version of PyTorch for your system's GPU acceleration.

Next Steps: Running Notebooks

Once the installation is complete, you can open any notebook from the notebooks directory. When launching Jupyter, select the virtual environment created by Poetry (it should appear with a name similar to pi-optimal-xyz-py3.10).

If you don’t see this environment, you might need to run:

poetry run ipython kernel install --user --name=pi-optimal

This command will register your new environment as an option in Jupyter.

Now you’re all set—ready to code! 🚀

🌱 Roadmap

We will publish our roadmap in the upcoming weeks. Have suggestions or would like to see a new feature prioritized? Let us know in our Discord or open an issue.

📜 License

pi_optimal is distributed under the GNU Affero General Public License (AGPL). See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.4

Apr 17, 2025

0.1.3

Feb 14, 2025

0.1.2

Feb 6, 2025

0.1.1

Jan 31, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pi_optimal-0.1.4.tar.gz (54.7 kB view details)

Uploaded Apr 17, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pi_optimal-0.1.4-py3-none-any.whl (63.1 kB view details)

Uploaded Apr 17, 2025 Python 3

File details

Details for the file pi_optimal-0.1.4.tar.gz.

File metadata

Download URL: pi_optimal-0.1.4.tar.gz
Upload date: Apr 17, 2025
Size: 54.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.2 CPython/3.10.17 Linux/6.8.0-1021-azure

File hashes

Hashes for pi_optimal-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`dba07367e4a2a85bccdb04a05d0302ad3534d4b738be8abdf6c1c74d9192dea0`
MD5	`0d3c9cd3bafb79bd5581043d190ce1d5`
BLAKE2b-256	`fb5faae5510e3c003e7bc01da36af58ba565e54f777367831d85ccfbb9d15b09`

See more details on using hashes here.

File details

Details for the file pi_optimal-0.1.4-py3-none-any.whl.

File metadata

Download URL: pi_optimal-0.1.4-py3-none-any.whl
Upload date: Apr 17, 2025
Size: 63.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.2 CPython/3.10.17 Linux/6.8.0-1021-azure

File hashes

Hashes for pi_optimal-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7a1a8660330646fb1b9d5082fac4dcc63f91253280f28cdadde57b5df2dc37f5`
MD5	`76a7a327d9be444ea7d05c7632cfedb3`
BLAKE2b-256	`e6e7255cfc1daa1dda89b29921306e54ab51de696af569a9edf10a8aa1bad40a`

See more details on using hashes here.

pi-optimal 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🤖 What is `pi_optimal`?

🎯 Why use `pi_optimal`?

🌐 Use Cases

🚀 Getting Started

Installation

Example Usage

✨ Features

📖 Documentation

🤝 Contributing and Community

🙋 Get Help

🔨 Development

🌱 Roadmap

📜 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

pi-optimal 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🤖 What is pi_optimal?

🎯 Why use pi_optimal?

🌐 Use Cases

🚀 Getting Started

Installation

Example Usage

✨ Features

📖 Documentation

🤝 Contributing and Community

🙋 Get Help

🔨 Development

🌱 Roadmap

📜 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

🤖 What is `pi_optimal`?

🎯 Why use `pi_optimal`?