Skip to main content

Agentic Research and Evaluation Suite

Project description

ARES: Agentic Research & Evaluation Suite

Documentation PyPI version License

image

ARES is an RL-first framework for training and evaluating LLM agents, especially coding agents.

It is a modern gym: the environment layer powering RL research.

Quick Start

Pre-requisites

  • Python >= 3.12

Getting Started

Install with uv:

uv add martian-ares

ARES comes packaged with useful presets for different code agent & environment configurations. List them with:

uv run python -c "import ares; print(ares.list_presets())"

You can get started by using this minimal loop to run mini-swe-agent on SWE-bench Verified sequentially.

Note: to run this particular example you will need:

  • Docker (with the daemon running)
  • A Martian API key (see below)
import asyncio

import ares
from ares import llms

async def main():
    # This requires `CHAT_COMPLETION_API_KEY` to be set with a Martian API key--see below.
    agent = llms.ChatCompletionCompatibleLLMClient(model="openai/gpt-5-mini")

    async with ares.make("sbv-mswea") as env:
        ts = await env.reset()
        while not ts.last():
            action = await agent(ts.observation)   # observation = LLM request
            ts = await env.step(action)            # action = LLM response
            print(f"{action}\n{ts}")

if __name__ == "__main__":
    asyncio.run(main())

To run the example above you'll need a Martian API key set in your .env file. To get a key:

  1. Go to https://app.withmartian.com
  2. on the Billing tab, add a payment method + top up some credits.
  3. on the API Keys tab create an API key.
  4. write CHAT_COMPLETION_API_KEY={your-key} in your .env

Alternatively, you can use another chat completions-compatible endpoint by setting both:

  • CHAT_COMPLETION_API_BASE_URL
  • CHAT_COMPLETION_API_KEY

Next Steps

  1. Check out the examples
  2. Read the docs to understand ARES and its key abstractions

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

martian_ares-0.0.2.tar.gz (112.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

martian_ares-0.0.2-py3-none-any.whl (103.9 kB view details)

Uploaded Python 3

File details

Details for the file martian_ares-0.0.2.tar.gz.

File metadata

  • Download URL: martian_ares-0.0.2.tar.gz
  • Upload date:
  • Size: 112.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.1

File hashes

Hashes for martian_ares-0.0.2.tar.gz
Algorithm Hash digest
SHA256 9ba59d95a7e243a2618bfb50d5b3b08086cb486eb16446a79bc1a81670b2dec7
MD5 879fa306babb6c38c9ebc8b32e91b044
BLAKE2b-256 cabd297b46d9f5cf7e4d226e8b95dc27a930ed2ec6a16c3663e80a65dffb82b3

See more details on using hashes here.

File details

Details for the file martian_ares-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: martian_ares-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 103.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.1

File hashes

Hashes for martian_ares-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 641a586e96a5ac618bb4f07215345d3b63e59477988d1c17905e16b9136dd016
MD5 3dbfd71cccfa2710644f398f8e804b7e
BLAKE2b-256 081d41a7767c1690546be8006f05bc76051ea4c998a78f153d2c51e653378e6b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page