Skip to main content

No project description provided

Project description

AgentJet

Benchmarking Docs License Python

AgentJet (AJet) is a cutting-edge, user-friendly training framework designed to optimize agents and workflows (built with OpenAI SDK, AgentScope, and even vllm http requests), fine-tuning language model weights behind the scenes.

Simply provide your Agent workflow, training data, and reward function, and we will be ready to enhance your agents to their optimal performance!

💡 Minimum Example

Let's begin with the simplest example: a math agent with a tool call.

  • First, please check out the installation guide to set up the training environment.
  • Then, tune your first model using the minimum example.
    ajet --conf tutorial/example_math_agent/math_agent.yaml --backbone='verl' --with-ray
    

Features

We aim to build a easy-to-learn Agent tuner that unlock more possibilities for agent developers:

  • Easy and Friendly. AgentJet helps you tune models behind your agent workflows easily, optimizing your agents for top performance with minimal effort.
  • Rich Tutorial Library. AgentJet provides a rich library of examples as tutorials.
  • Efficient and Scalable. AgentJet uses [verl] as the default backbone (--backbone=verl). However, we also support trinity as alternative backbone, accelerating your tuning process via fully asynchronous RFT.
  • Flexible and Fast. AgentJet supports multi-agent workflows and adopts a context merging technique, accelerating training by 1.5x to 10x when the workflow involves multi-turn (or multi-agent) conversations.
  • Reliability and Reproducibility. Our team keeps track of framework performance across multiple tasks + major-git-version + training-backbones (under construction, still gathering data, comming soon).

For advanced researchers, AgentJet also provides high-resolution logging and debugging solutions:

  • High-Resolution Logging: AgentJet allows users to save and inspect token-level rollout details, recording token IDs, token loss masks, and even token logprobs to facilitate workflow development and agent diagnostics.
  • Fast Debugging: AgentJet also provides the --backbone=debug option for the best debugging experience, shortening your wait period from minutes to seconds after code changes and enabling breakpoint debugging in IDEs.

🚀 Quick Start

Installation

We recommend using uv for dependency management.

  1. Clone the Repository:
git clone https://github.com/modelscope/AgentJet.git
cd AgentJet
  1. Set up Environment:
uv venv --python=3.10.16 && source .venv/bin/activate
uv pip install -e .[trinity]
# Note: flash-attn must be installed after other dependencies
uv pip install flash_attn==2.8.1 --no-build-isolation --no-cache-dir

Run Training

You can start training your first agent with a single command using a pre-configured YAML file. Take the Math agent as an example:

ajet --conf tutorial/example_math_agent/math_agent.yaml

Example Library

Explore our rich library of examples to kickstart your journey:


🧩 Core Concepts

AgentJet makes agent fine-tuning straightforward by separating the developer interface from the internal execution logic.

image

1. The User-Centric Interface

To optimize an agent, you provide three core inputs:

  • Trainable Workflow: Define your agent logic by inheriting the Workflow class, supporting both simple agent setups and advanced multi-agent collaborations.
  • Task Reader: Load training tasks from JSONL files, HuggingFace datasets, interactive environments, or auto-generate them from documents.
  • Task Judger: Evaluates agent outputs and assigns rewards to guide training.

2. Internal System Architecture

The internal system orchestrates several specialized modules to handle the complexities of RL training and agent interactions.

  • Launcher: Manages background service processes (Ray, vLLM) and routes the backbone.
  • Task Reader: Handles data ingestion, augmentation, and filtering.
  • Task Rollout: Bridges LLM engines and manages the Gym environment lifecycle.
  • Task Runner: Executes the Agent workflow and calculates rewards.
  • Model Tuner: Forwards inference requests from the workflow to the LLM engine.
  • Context Tracker: Monitors LLM calls and automatically merges shared-history timelines to improve training efficiency by 1.5x to 10x.

🚦 Navigation

🗺️ Roadmap

AgentJet is a constantly evolving project. We are planning to add the following features in the near future.

  • Advanced LLM-based multi-agent reinforcement learning.
  • Training dataset generation from few-shot samples.
  • Prompt tuning.
  • Multi-modal training support.
  • Cross-process Tuner wrapper to pass though process forking.
  • Providing training → user feedback → data augmentation → retraining data flywheel example.
  • Optimize configurations for long-context adaptation on smaller GPUs.
  • Add LoRA training examples.
  • Covering LangGraph and AutoGen frameworks.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agentjet-0.0.1-py3-none-any.whl (172.8 kB view details)

Uploaded Python 3

File details

Details for the file agentjet-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: agentjet-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 172.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for agentjet-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4a2b4f7c6360360d06e53ceeadc9834e6216e93a1e2bb81de501c3d9d41df968
MD5 4bf8d2277f465515942f529b0f2cb23b
BLAKE2b-256 e658409b4527356c5b81098cbc7e8ec7230644dcd4b42ac2d83814cb16bd2f44

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page