Skip to main content

Unified API for training and inference

Project description

SkyRL: A Modular Full-stack RL Library for LLMs

| Documentation | Twitter/X | Huggingface | Slack Workspace |


Overview

[!IMPORTANT] Note: SkyRL is undergoing a repo reorganization into the skyrl/ folder, which unifies the skyrl libraries below into a single package. The existing packages below are fully functional but will be migrated to new paths shortly. For full Tinker API support please use the skyrl/ folder. See the Tinker Quickstart docs to get started. See issue: https://github.com/NovaSky-AI/SkyRL/issues/1145

SkyRL is a full-stack RL library that provides the following components:

  • skyrl: Our new unified library for RL on your own hardware, with support for the Tinker API. skyrl combines our previous work:

    • skyrl-train: A modular, performant training framework for RL.
    • skyrl-tx: A cross-platform library implementing a backend for the Tinker API, with a unified engine for training and inference.
  • skyrl-agent: Our agent layer for training long-horizon, real-world agents. For exact reproduction of SkyRL-v0 results, please checkout to commit a0d50c482436af7fac8caffa4533616a78431d66.

  • skyrl-gym: Our gymnasium of tool-use tasks, including a library of math, coding, search and SQL environments implemented in the Gymnasium API.

Getting Started

For a guide on developing with SkyRL, take at look at our Development Guide docs.

For model training, checkout skyrl to start using, modifying, or building on top of the SkyRL training stack. See our quickstart docs to ramp up!

For building environments, checkout skyrl-gym to integrate your task in the simple gymnasium interface.

For agentic pipelines, check out skyrl-agent for our work on optimizing and scaling pipelines for multi-turn tool use LLMs on long-horizon, real-environment tasks.

News

  • [2026/02/17] 🎉 SkyRL is officially integrated with Harbor! Train your terminal-use agent! [Blog]
  • [2026/02/13] 🎉 SkyRL now implements the Tinker API! Run any training script written in the Tinker API on your local GPUs with SkyRL! [Blog]
  • [2025/11/26] 🎉 We released SkyRL-Agent: An agent layer for efficient, multi-turn, long-horizon agent training and evaluation. [Paper]
  • [2025/10/06] 🎉 We released SkyRL tx: An open implementation of a backend for the Tinker API to run a Tinker-like service on their own hardware. [Blog]
  • [2025/06/26] 🎉 We released SkyRL-v0.1: A highly-modular, performant RL training framework. [Blog]
  • [2025/06/26] 🎉 We released SkyRL-Gym: A library of RL environments for LLMs implemented with the Gymnasium API. [Blog]
  • [2025/05/20] 🎉 We released SkyRL-SQL: a multi-turn RL training pipeline for Text-to-SQL, along with SkyRL-SQL-7B — a model trained on just 653 samples that outperforms both GPT-4o and o4-mini!
  • [2025/05/06] 🎉 We released SkyRL-v0: our open RL training pipeline for multi-turn tool use LLMs, optimized for long-horizon, real-environment tasks like SWE-Bench!

Links

Projects using SkyRL

  • Biomni-R0: Using RL to Hill-Climb Biomedical Reasoning Agents to Expert-Level GitHub Repo stars
  • How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models GitHub Repo stars
  • OpenThoughts-Agent: Data recipes and robust infrastructure for training AI agents GitHub Repo stars
  • Endless Terminals: A fully autonomous pipeline that procedurally generates terminal tasks for RL training with no human annotation needed GitHub Repo stars

Acknowledgement

This work is done at Berkeley Sky Computing Lab in collaboration with Anyscale, with generous compute support from AnyscaleDatabricks, NVIDIA, Lambda Labs, AMD, AWS, Modal, and Daytona.

We adopt many lessons and code from several great projects such as veRL, OpenRLHF, Search-R1, OpenReasonerZero, and NeMo-RL. We appreciate each of these teams and their contributions to open-source research!

Citation

If you find the work in this repository helpful, please consider citing:

@misc{cao2025skyrl,
  title     = {SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning},
  author    = {Shiyi Cao and Sumanth Hegde and Dacheng Li and Tyler Griggs and Shu Liu and Eric Tang and Jiayi Pan and Xingyao Wang and Akshay Malik and Graham Neubig and Kourosh Hakhamaneshi and Richard Liaw and Philipp Moritz and Matei Zaharia and Joseph E. Gonzalez and Ion Stoica},
  year      = {2025},
}
@misc{liu2025skyrlsql,
      title={SkyRL-SQL: Matching GPT-4o and o4-mini on Text2SQL with Multi-Turn RL},
      author={Shu Liu and Sumanth Hegde and Shiyi Cao and Alan Zhu and Dacheng Li and Tyler Griggs and Eric Tang and Akshay Malik and Kourosh Hakhamaneshi and Richard Liaw and Philipp Moritz and Matei Zaharia and Joseph E. Gonzalez and Ion Stoica},
      year={2025},
}
@misc{griggs2025skrylv01,
      title={Evolving SkyRL into a Highly-Modular RL Framework},
      author={Tyler Griggs and Sumanth Hegde and Eric Tang and Shu Liu and Shiyi Cao and Dacheng Li and Charlie Ruan and Philipp Moritz and Kourosh Hakhamaneshi and Richard Liaw and Akshay Malik and Matei Zaharia and Joseph E. Gonzalez and Ion Stoica},
      year={2025},
      note={Notion Blog}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

skyrl-0.1.0.tar.gz (580.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

skyrl-0.1.0-py3-none-any.whl (417.5 kB view details)

Uploaded Python 3

File details

Details for the file skyrl-0.1.0.tar.gz.

File metadata

  • Download URL: skyrl-0.1.0.tar.gz
  • Upload date:
  • Size: 580.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"22.04","id":"jammy","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for skyrl-0.1.0.tar.gz
Algorithm Hash digest
SHA256 4d9fd2bf7b4a0decfa4d841b23114cc61f4f23174bb50f09c014e5f1d17a7a81
MD5 211467047d5fc8204ca786dd43b4f68e
BLAKE2b-256 82f3f4183b2faf9bcb551e7ccbe86357df43d8d72551a1af45e8dc3dee8647cf

See more details on using hashes here.

File details

Details for the file skyrl-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: skyrl-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 417.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"22.04","id":"jammy","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for skyrl-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ac1b8d6203f3003e61fe353e9dbfdd1165abe91907238ac2f8fc259a3b2bb528
MD5 f05ad67231904324d8e48d5f87c215b5
BLAKE2b-256 4e6106045245beed5bac6aa4e5055ac8a6afc0f74a330d140ec5e3194964c7a1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page