These details have not been verified by PyPI

Project links

Project description

Open Agent Tools Coder

Open Agent Tools (oats) enables small-to-large self-hosted ai models to use local source code when running tool-calling agentic workloads. We actively data mine 20,970+ (2+ TB) popular github repos using large and small ai models to create reuseable: json, markdown and parquet files for local-first tool-calling models. How does it work? Over multiple passes, we compile and export a fast, compressed prompt index for all python source code in any repo. Agents refer to the local prompt index to use already-written source code on disk instead of http with mcp or having an expensive frontier ai model re-build something that is already working locally with expensive tokens. We use oats to free up large model tokens usage by delegating the local tool-calling to smaller, open source ai models.

Open Agent Tools (oats) - Architecture - Intro Tool Calling Pipeline for Powering Up Small AI Models

Supports running local self-hosted models that can run 1-250+ local tool-calling commands using an agentic coding ai.
Supports over 141,000 tools using the open-agent-tools prompt indices repo. Requires cloning the repo(s) locally for the tool-calling to function.
Find more OATs Prompt Indices Datasets on HuggingFace

Example Knowledge Graph with Semantic Tree for Litigation Tool-Calling

Supported Coder Slash Commands

By default if there is no starting / character in the prompt, then coder treats the message a as chat message.

Here are the supprted internal slash commands:

/help - supported usage
/mode - change mode
/approve - toggle auto approval mode
/browse - browse to a url using playwright and support storing as json, parquet with storage on s3
/clear - clear the session
/session - view the session
/cost - view token usage
/config - view the config
/profile - view the coder profile feature flags
/files - view the current files
/diff - view the git diff for the repo (assuming coder is running in a git repo)
/log - view the logs
/history - view the chat history
/tools - view the default tools
/model - view the current provider model
/models - view the models
/new - new session
/switch - switch provider
/provider - view the current provider
/compact - compact the chat sesssion for reducing token context windows. this is automatically done already but this command allows for manual context control.

Install

Here is a recording showing how to install and get started quickly:

git clone https://github.com/district-solutions/open-agent-tools-coder oats
cd oats

pip install -e .

# litellm installs an older aiohttp version, upgrade this to the new version and ignore the warning
pip install --upgrade aiohttp

Setup

Local Tool Calling Alignment and Prompt Index Validation with RLHF Curation

This section does not require any ai models, it is validating that your local python runtime is ready for matching prompts to local tools. You can modify the prompt index file locally to map functions to different prompts. Let us know what you find!

We do this before deploying ai models because we can validate the prompt-to-tool mapping works before we add complexity with multiple self-hosted local ai models.

Confirm your local repo is setup for using the included repo_uses prompt index file. This command lets you quickly check which tools will show up for any prompt before burning any tokens on ai messages. Use this approach to validate a prompt will map to the expected tool before chatting to an ai model:

get-tools -p 'get third friday'

The output should be a valid json dictionary with a dictionary containing minimal choices for a small agentic ai model to process locally with local source code tool-calling:

{
  "status": true,
  "actions": [
    "get_third_friday"
  ],
  "prompts": [
    "generate third Friday dates for the next 6 months in YYYYMMDD format"
  ],
  "src_files": [
    "coder/date.py"
  ],
  "partial_actions": [],
  "partial_prompts": [],
  "partial_src_files": [],
  "index_files": [
    "/opt/ds/coder/.ai/AGENT.repo_uses.python.tools.json"
  ],
  "tool_data": {
    "query": "get third friday",
    "model": "bm25",
    "reranked": false,
    "best_files": [
      "coder/date.py"
    ],
    "best_uses": {
      "coder/date.py": {
        "utc": "utc datetime",
        "get_utc_str": "get utc",
        "get_utc_datetime": "get the current timezone-aware UTC datetime",
        "get_naive_datetime": "get the current timezone-naive datetime from UTC",
        "get_third_friday_dates": "generate third Friday dates for the next 6 months in YYYYMMDD format",
        "run_date_tool": "run the date module to print third Friday dates for the next 6 months"
      }
    },
    "results": [
      {
        "file": "coder/date.py",
        "func": "get_third_friday_dates",
        "description": "generate third Friday dates for the next 6 months in YYYYMMDD format",
        "score": 1.0,
        "retrieval_score": 1.0
      }
    ]
  },
  "version": "9"
}

Start vLLM Chat and Tool Calling Models

cd stack

Deploy vLLM with Qwen36 27B or the Qwen36 35B model

We only need 1 of these models loaded on a 5090 or on an nvidia blackwell RTX 6000 to run completely locally:

Download the quantized version of 27B: https://huggingface.co/cyankiwi/Qwen3.6-27B-AWQ-INT4 to ./stack/models/hf/qwen/Qwen3.6-27B-AWQ-INT4

and/or

Download the quantized version of 35B: https://huggingface.co/cyankiwi/Qwen3.6-35B-A3B-AWQ-4bit to ./stack/models/hf/qwen/Qwen3.6-35B-A3B-AWQ-4bit
Deploying the Qwen36 27B with vLLM requires >35 GB VRAM:

./restart-vllm-qwen36-27b.sh

Deploying the Qwen36 35B with vLLM requires >35 GB VRAM:

./restart-vllm-qwen36-35b.sh

Deploy vLLM with FunctionGemma 270m Instruct

Download FunctionGemma from HuggingFace: https://huggingface.co/google/functiongemma-270m-it to the dir below. Use your huggingface username and huggingface token as the git username/password.

git clone https://huggingface.co/google/functiongemma-270m-it stack/models/hf/google/functiongemma-270m-it

Now that the model is ready, deployment requires ~6 GB RAM/VRAM

./restart-tool-functiongemma-1.sh

Local Model Setup with the coder.json

To use local models from any directory on disk, make sure to set the CODER_CONFIG_FILE env variable to the default

# may want to add to your ~/.bashrc to always load at the abosolute path on disk:
# export CODER_CONFIG_FILE=PATH/coder.json
# for testing from the repo's root directory:
export CODER_CONFIG_FILE=$(pwd)/oats/config/coder.json

Optional - Setup the coder.json File for vLLM or Additional Local Models

We usually keep the credentials outside the repo like:

# from the repo root dir
cp ./oats/config/coder.json /opt/oats-coder.json

Then we edit the /opt/oats-coder.json file and then set the env variable in our ~/.bashrc:

export CODER_CONFIG_FILE=/opt/oats-coder.json

Chatting with AI

Start the OATs Coder

$ ff
Let's build together!! 🤗 🤖 🔨 🔧
Starting up oats coder please wait...
If you hit an error, please open an issue so we can help fix it:
github.com/district-solutions/open-agent-tools-coder/issues

  coder v1.2.0  ·  chat:latest  ·  vllm-small
  /opt/ds/oats
  ──────────────────────────────────────────────────
  Enter to send · Alt+Enter for newline · /help for commands

  mode: edit — edit — supervised, ask before writes. Switch with /edit /auto /plan /caveman

❯

Verify Chat Works

❯ say hello
  ──────────────────────────────────────────────────
2026-05-12 17:01:23 - sprc - INFO - loading_core_tools: 15
2026-05-12 17:01:23 - sprc - INFO - using_core_tools: {'tool_search', 'websearch', 'grep', 'todowrite', 'read', 'todoread', 'edit',
'memory_write', 'bash', 'webfetch', 'glob', 'multiedit', 'memory_read', 'write', 'question'} model_id: vllm-small@hosted_vllm/chat:latest

Hello! How can I help you today?

  2.0s

Troubleshooting

vllm Unauthorized Error

If you see this error, then you need to ensure your CODER_CONFIG_FILE environment variable is set to the correct file:

LLM error: litellm.AuthenticationError: AuthenticationError: Hosted_vllmException - {"error":"Unauthorized"}

Confirm the providers show up as expected:

$ pv
vllm-small (vllm-small): configured
t1 (t1): configured
ow (ow): not configured
Anthropic (anthropic): not configured
OpenAI (openai): not configured
Azure OpenAI (azure): not configured
Google AI (google): not configured
Mistral (mistral): not configured
Groq (groq): not configured
OpenRouter (openrouter): not configured
Together AI (together): not configured
Cohere (cohere): not configured
Ollama (ollama): configured

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.12

May 15, 2026

1.0.11

May 14, 2026

1.0.10

May 14, 2026

This version

1.0.9

May 14, 2026

1.0.8

May 14, 2026

1.0.6

May 14, 2026

1.0.5

May 14, 2026

1.0.4

May 13, 2026

1.0.3

May 13, 2026

1.0.2

May 13, 2026

1.0.1

May 13, 2026

1.0.0

May 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oats_coder-1.0.9.tar.gz (330.2 kB view details)

Uploaded May 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

oats_coder-1.0.9-py3-none-any.whl (426.3 kB view details)

Uploaded May 14, 2026 Python 3

File details

Details for the file oats_coder-1.0.9.tar.gz.

File metadata

Download URL: oats_coder-1.0.9.tar.gz
Upload date: May 14, 2026
Size: 330.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for oats_coder-1.0.9.tar.gz
Algorithm	Hash digest
SHA256	`8b363cec6e380804ba147e4e730f8fa2107bf17c65456c3f21493d503acd18b3`
MD5	`b325fdfa202e7248d839e8e38bec1ccc`
BLAKE2b-256	`db898a7afe268296e11188f3b4c0d9574fd4d77b6544ef048a201c6ac840127a`

See more details on using hashes here.

File details

Details for the file oats_coder-1.0.9-py3-none-any.whl.

File metadata

Download URL: oats_coder-1.0.9-py3-none-any.whl
Upload date: May 14, 2026
Size: 426.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for oats_coder-1.0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`16a7b50a911cd42eb0bd4bc8fa8c3f2eb2f973c5b5618ff1f5076d76f0170bda`
MD5	`9263dde8a2d6dd6a7f4c7312b9ec4eb9`
BLAKE2b-256	`dd6105f3da539dd270f1743e979a564b77d9efa7cb5209bb58645d732bf69537`

See more details on using hashes here.

oats-coder 1.0.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Open Agent Tools Coder

Supported Coder Slash Commands

Install

Setup

Local Tool Calling Alignment and Prompt Index Validation with RLHF Curation

Start vLLM Chat and Tool Calling Models

Deploy vLLM with Qwen36 27B or the Qwen36 35B model

Deploy vLLM with FunctionGemma 270m Instruct

Local Model Setup with the coder.json

Optional - Setup the coder.json File for vLLM or Additional Local Models

Chatting with AI

Start the OATs Coder

Verify Chat Works

Troubleshooting

vllm Unauthorized Error

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes