Agentic Research and Evaluation Suite
Project description
ARES: Agentic Research & Evaluation Suite
ARES is an RL-first framework for training and evaluating LLM agents, especially coding agents.
It is a modern gym: the environment layer powering RL research.
Quick Start
Pre-requisites
- Python >= 3.12
Getting Started
Install with uv:
uv add martian-ares
ARES comes packaged with useful presets for different code agent & environment configurations. List them with:
uv run python -c "import ares; print(ares.list_presets())"
You can get started by using this minimal loop to run mini-swe-agent on SWE-bench Verified sequentially.
Note: to run this particular example you will need:
- Docker (with the daemon running)
- A Martian API key (see below)
import asyncio
import ares
from ares import llms
async def main():
# This requires `CHAT_COMPLETION_API_KEY` to be set with a Martian API key--see below.
agent = llms.ChatCompletionCompatibleLLMClient(model="openai/gpt-5-mini")
async with ares.make("sbv-mswea") as env:
ts = await env.reset()
while not ts.last():
action = await agent(ts.observation) # observation = LLM request
ts = await env.step(action) # action = LLM response
print(f"{action}\n{ts}")
if __name__ == "__main__":
asyncio.run(main())
To run the example above you'll need a Martian API key set in your .env file. To get a key:
- Go to https://app.withmartian.com
- on the
Billingtab, add a payment method + top up some credits. - on the
API Keystab create an API key. - write
CHAT_COMPLETION_API_KEY={your-key}in your.env
Alternatively, you can use another chat completions-compatible endpoint by setting both:
CHAT_COMPLETION_API_BASE_URLCHAT_COMPLETION_API_KEY
Next Steps
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file martian_ares-0.0.2.tar.gz.
File metadata
- Download URL: martian_ares-0.0.2.tar.gz
- Upload date:
- Size: 112.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9ba59d95a7e243a2618bfb50d5b3b08086cb486eb16446a79bc1a81670b2dec7
|
|
| MD5 |
879fa306babb6c38c9ebc8b32e91b044
|
|
| BLAKE2b-256 |
cabd297b46d9f5cf7e4d226e8b95dc27a930ed2ec6a16c3663e80a65dffb82b3
|
File details
Details for the file martian_ares-0.0.2-py3-none-any.whl.
File metadata
- Download URL: martian_ares-0.0.2-py3-none-any.whl
- Upload date:
- Size: 103.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
641a586e96a5ac618bb4f07215345d3b63e59477988d1c17905e16b9136dd016
|
|
| MD5 |
3dbfd71cccfa2710644f398f8e804b7e
|
|
| BLAKE2b-256 |
081d41a7767c1690546be8006f05bc76051ea4c998a78f153d2c51e653378e6b
|