Agent Gym - Pytorch
Project description
Agent Gym
Agent Gym is a framework for training and evaluating reinforcement learning agents in a gym-like environment.
Usage
from agentgym.r1_pipeline import R1Pipeline, SFTConfig
r1_pipeline = R1Pipeline(sft_model="gpt2", sft_dataset="stanfordnlp/imdb", sft_args=SFTConfig(output_dir="/tmp"))
r1_pipeline.run()
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
agentgym-0.0.1.tar.gz
(7.0 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agentgym-0.0.1.tar.gz.
File metadata
- Download URL: agentgym-0.0.1.tar.gz
- Upload date:
- Size: 7.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.12.8 Darwin/23.3.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6e0211b994debbda1f8f46743f8bae04dbde8f31ce207b383fe857d2dd53390d
|
|
| MD5 |
af94e627ea43c9a121eedfd591b849e5
|
|
| BLAKE2b-256 |
2fe80e269d0ec92674f939dfaf4916c4f8b77e0ee09a6af495b41c230acf0804
|
File details
Details for the file agentgym-0.0.1-py3-none-any.whl.
File metadata
- Download URL: agentgym-0.0.1-py3-none-any.whl
- Upload date:
- Size: 7.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.12.8 Darwin/23.3.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
515ed5165d01a45137407bb238202c0d424fd294a451dfd0901278c9e33525fb
|
|
| MD5 |
345ad0a526594a518e3a234c3924b2c2
|
|
| BLAKE2b-256 |
0c2608ded15bfdbc0219ba1e9c96c4ed7cec63d3e18b71dfafb39a4974d02939
|