Open-source framework for Reinforcement Learning integrated with Large Language Models

These details have not been verified by PyPI

Project links

Project description

RL-LLM Toolkit

Democratizing Reinforcement Learning with Large Language Models

🚀 Overview

RL-LLM Toolkit is an open-source framework that integrates Reinforcement Learning with Large Language Models to create intelligent agents accessible to beginners and researchers. By simulating human feedback via LLMs, we reduce RLHF costs by up to 50% while maintaining training quality.

Key Features

🎮 Gymnasium-Compatible Environments: Easy-to-use RL environments for games, finance, and robotics
🤖 LLM-Powered Rewards: Generate dense rewards using local or API-based LLMs
📊 State-of-the-Art Algorithms: PPO, DQN, and more with modular architecture
🔧 Plug-and-Play Design: Swap algorithms, environments, and LLMs effortlessly
📚 Educational Focus: Interactive Jupyter notebooks and comprehensive tutorials
🌐 Hugging Face Integration: Share models and datasets with the community

🎯 Quick Start

# Install the toolkit
pip install rl-llm-toolkit

# Run a simple example
python -m rl_llm_toolkit.examples.cartpole

📦 Installation

From PyPI (Coming Soon)

pip install rl-llm-toolkit

From Source

git clone https://github.com/tonipcv/hugo.git
cd hugo
pip install -e .

Optional Dependencies

# For LLM integration
pip install -e ".[llm]"

# For development
pip install -e ".[dev]"

# For all features
pip install -e ".[all]"

💡 Usage Example

from rl_llm_toolkit import RLEnvironment, PPOAgent, LLMRewardShaper
from rl_llm_toolkit.llm import OllamaBackend

# Create environment
env = RLEnvironment("CartPole-v1")

# Set up LLM-based reward shaping
llm = OllamaBackend(model="llama3")
reward_shaper = LLMRewardShaper(llm, prompt_template="custom_template")

# Train agent
agent = PPOAgent(env, reward_shaper=reward_shaper)
agent.train(total_timesteps=100000)

# Evaluate
agent.evaluate(episodes=10, render=True)

🏗️ Architecture

rl-llm-toolkit/
├── rl_llm_toolkit/          # Core package
│   ├── agents/              # RL algorithms (PPO, DQN, etc.)
│   ├── environments/        # Custom environments
│   ├── llm/                 # LLM integrations
│   ├── rewards/             # Reward shaping utilities
│   ├── utils/               # Helper functions
│   └── cli/                 # Command-line tools
├── examples/                # Example scripts and notebooks
├── tests/                   # Test suite
└── docs/                    # Documentation

🎓 Examples

CartPole with LLM Feedback: Train a classic control agent with GPT-4 reward shaping
Crypto Trading Bot: Build a trading agent using historical data and LLM market analysis
Multi-Agent Game: Coordinate multiple agents in a competitive environment

🤝 Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

📊 Roadmap

Now (0-3 months)

✅ Core RL framework with PPO/DQN
✅ Basic LLM integration (Ollama, OpenAI)
🔄 Interactive examples and tutorials
🔄 Comprehensive documentation

Next (3-6 months)

Offline RL support
Financial trading environments
Hugging Face model hub integration
Community leaderboards

Later (6-12 months)

Real-time collaboration features
Video reasoning integration
Advanced multi-agent systems
Research partnerships

📄 License

MIT License - see LICENSE for details.

🙏 Acknowledgments

Inspired by projects like PufferLib, Neural MMO, and the broader open-source RL community.

📬 Contact

Twitter/X: @your_handle
Discord: Join our community
Issues: GitHub Issues

Star ⭐ this repo if you find it useful!

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

Mar 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rl_llm_toolkit-0.2.0.tar.gz (101.6 kB view details)

Uploaded Mar 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rl_llm_toolkit-0.2.0-py3-none-any.whl (66.3 kB view details)

Uploaded Mar 2, 2026 Python 3

File details

Details for the file rl_llm_toolkit-0.2.0.tar.gz.

File metadata

Download URL: rl_llm_toolkit-0.2.0.tar.gz
Upload date: Mar 2, 2026
Size: 101.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for rl_llm_toolkit-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`bff53b096b2f1b4df344f57397455ea788364b4eaacdf849933d8964be901c1a`
MD5	`cdefe9af2d82245d339018e1db3fc50b`
BLAKE2b-256	`6215b09fa5c021c5221a2ab6277817e6a003539fe004198711a1be48c8d461d4`

See more details on using hashes here.

File details

Details for the file rl_llm_toolkit-0.2.0-py3-none-any.whl.

File metadata

Download URL: rl_llm_toolkit-0.2.0-py3-none-any.whl
Upload date: Mar 2, 2026
Size: 66.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for rl_llm_toolkit-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5f53ec60604f9024017c0c392712701270ded6a3beec202395ad7917faf45e2b`
MD5	`3d9caa5e4a35a07c9c1d1814d6ebc9fc`
BLAKE2b-256	`13d0207d489f2e3b10f48de749a857f41d07455d6827ba8956a01b3123896af3`

See more details on using hashes here.

rl-llm-toolkit 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

RL-LLM Toolkit

🚀 Overview

Key Features

🎯 Quick Start

📦 Installation

From PyPI (Coming Soon)

From Source

Optional Dependencies

💡 Usage Example

🏗️ Architecture

🎓 Examples

🤝 Contributing

📊 Roadmap

Now (0-3 months)

Next (3-6 months)

Later (6-12 months)

📄 License

🙏 Acknowledgments

📬 Contact

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes