Tools for running a SWE agent using Composio platform

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

Project description

SWE Development Kit

SWE Development Kit

Overview

swekit is a framework for building SWE agents on by utilising composio tooling ecosystem. SWE Kit allows you to

Scaffold agents which works out-of-the-box with choice of your agentic framework, crewai, llamaindex, etc...
Tools to add or optimise your agent's abilities
Benchmark your agents against SWE-bench

Dependencies

Before getting started, ensure you have the following set up:

Installation:
```
pip install swekit composio-core
```
Install agentic framework of your choice and the Composio plugin for the same: Here we're using crewai for the example:
```
pip install crewai composio-crewai
```
GitHub Access Token:

The agent requires a github access token to work with your repositories, You can create one at https://github.com/settings/tokens with necessary permissions and export it as an environment variable using export GITHUB_ACCESS_TOKEN=<your_token>
LLM Configuration: You also need to setup API key for the LLM provider you're planning to use. By default the agents scaffolded by swekit uses openai client, so export OPENAI_API_KEY before running your agent

Getting Started

Creating a new agent

Scaffold your agent using:
```
swekit scaffold crewai -o <path>
```
This creates a new agent in <path>/agent with four key files:
- main.py: Entry point to run the agent on your issue
- agent.py: Agent definition (edit this to customise behaviour)
- prompts.py: Agent prompts
- benchmark.py: SWE-Bench benchmark runner
Run the agent:
```
cd agent
python main.py
```
You'll be prompted for the repository name and issue.

Docker Environment

The SWE-agent runs in Docker by default for security and isolation. This sandboxes the agent's operations, protecting against unintended consequences of arbitrary code execution.

To run locally instead, modify workspace_env in agent/agent.py. Use caution, as this bypasses Docker's protective layer.

Running the Benchmark

SWE-Bench is a comprehensive benchmark designed to evaluate the performance of software engineering agents. It comprises a diverse collection of real-world issues from popular Python open-source projects, providing a robust testing environment.

To run the benchmark:

Ensure Docker is installed and running on your system.
Execute the following command:
```
cd agent
python benchmark.py --test-split=<test_split>
```
- By default, python benchmark.py runs only 1 test instance.
- Specify a test split ratio to run more tests, e.g., --test-split=1:300 runs 300 tests.

To run the benchmarks in E2B or FlyIO sandbox, you can set the workspace_env in the evaluate function call in benchmark.py

from composio import ExecEnv

(...)

    evaluate(
        bench,
        dry_run=False,
        test_range=test_range,
        test_instance_ids=test_instance_ids_list,
        workspace_env=ExecEnv.E2B
    )

To use E2B or FlyIO sandboxes you'll require API key for respective platforms, to use E2B export your API key as E2B_API_KEY and to use FlyIO export your API token as FLY_API_TOKEN.

Note: We utilize SWE-Bench-Docker to ensure each test instance runs in an isolated container with its specific environment and Python version.

To extend the functionality of the SWE agent by adding new tools or extending existing ones, refer to the Development Guide.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

Release history Release notifications | RSS feed

0.1.1

Jul 23, 2024

0.1.0

Jul 23, 2024

This version

0.1.0rc4 pre-release

Jul 16, 2024

0.1.0rc3 pre-release

Jul 12, 2024

0.1.0rc2 pre-release

Jul 10, 2024

0.1.0rc1 pre-release

Jul 9, 2024

0.1.0rc0 pre-release

Jul 9, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swekit-0.1.0rc4.tar.gz (47.7 kB view hashes)

Uploaded Jul 16, 2024 Source

Built Distribution

swekit-0.1.0rc4-py3-none-any.whl (57.6 kB view hashes)

Uploaded Jul 16, 2024 Python 3

Hashes for swekit-0.1.0rc4.tar.gz

Hashes for swekit-0.1.0rc4.tar.gz
Algorithm	Hash digest
SHA256	`09004abaac00e270c058d85a16f3ee2ad50485e83af5c72c93c5b2c0c73d5ef5`
MD5	`5d18ab07a1f735ce52fb1bedd7ffd7d7`
BLAKE2b-256	`dd3a6ec7c82224837370d069094a95f839342dcf92f5e7513cf4fcb16b4bc436`

Hashes for swekit-0.1.0rc4-py3-none-any.whl

Hashes for swekit-0.1.0rc4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2622c3ec13d4db2ab2cb26b193b8d6b0b46386e4d202fd5bf135f46e3f988db3`
MD5	`273f71d572d538d2b017fcde7267a0a4`
BLAKE2b-256	`e7022f343844ed2e1a9dd93d684fb7ebc25930c0d827934f3fcd1daa51002e84`