Skip to main content

A small, powerful CLI coding agent for open AI models

Project description

Swival Logo

Swival

A coding agent for any model. Documentation

Swival is a CLI coding agent built to be practical, reliable, and easy to use. It works with frontier models, but its main goal is to be as reliable as possible with smaller models, including local ones. It is designed from the ground up to handle tight context windows and limited resources without falling apart.

It connects to LM Studio, HuggingFace Inference API, OpenRouter, Google Gemini, ChatGPT Plus/Pro, or any OpenAI-compatible server (ollama, llama.cpp, mlx_lm.server, vLLM, etc.), sends your task, and runs an autonomous tool loop until it produces an answer. With LM Studio it auto-discovers your loaded model, so there's nothing to configure. A few thousand lines of Python, no framework.

Quickstart

LM Studio

  1. Install LM Studio and load a model with tool-calling support. Recommended first model: qwen3-coder-next (great quality/speed tradeoff on local hardware). Crank the context size as high as your hardware allows.
  2. Start the LM Studio server.
  3. Install Swival (requires Python 3.13+):
uv tool install swival
  1. Run:
swival "Refactor the error handling in src/api.py"

That's it. Swival finds the model, connects, and goes to work.

HuggingFace

export HF_TOKEN=hf_...
uv tool install swival
swival "Refactor the error handling in src/api.py" \
    --provider huggingface --model zai-org/GLM-5

You can also point it at a dedicated endpoint with --base-url and --api-key.

OpenRouter

export OPENROUTER_API_KEY=sk_or_...
uv tool install swival
swival "Refactor the error handling in src/api.py" \
    --provider openrouter --model z-ai/glm-5

ChatGPT Plus/Pro

Use OpenAI models through your existing ChatGPT Plus or Pro subscription -- no API key needed.

uv tool install swival
swival "Refactor the error handling in src/api.py" \
    --provider chatgpt --model gpt-5.4

On first use, a device code and URL are printed to your terminal. Open the URL, enter the code, and authorize with your ChatGPT account. Tokens are cached locally for subsequent runs.

Generic (OpenAI-compatible)

swival "Refactor the error handling in src/api.py" \
    --provider generic \
    --base-url http://127.0.0.1:8080 \
    --model my-model

Works with ollama, llama.cpp, mlx_lm.server, vLLM, and anything else that speaks the OpenAI chat completions protocol. No API key required for local servers.

Interactive sessions

swival --repl

The REPL carries conversation history across questions, which makes it good for exploratory work and longer tasks.

Updates and uninstall

uv tool upgrade swival    # update
uv tool uninstall swival  # remove

What makes it different

Reliable with small models. Context management is one of Swival's strengths. It keeps things clean and focused, which is especially important when you are working with models that have tight context windows. Graduated compaction, persistent thinking notes, and a todo checklist all survive context resets, so the agent doesn't lose track of multi-step plans even under pressure.

Your models, your way. Works with LM Studio, HuggingFace Inference API, OpenRouter, Google Gemini, ChatGPT Plus/Pro, and any OpenAI-compatible server. With LM Studio, it auto-discovers whatever model you have loaded. With HuggingFace or OpenRouter, point it at any supported model. With Google Gemini, use Gemini models through Google's native API. With ChatGPT Plus/Pro, authenticate through your browser and use OpenAI's models through your existing subscription. With the generic provider, connect to ollama, llama.cpp, mlx_lm.server, vLLM, or any other compatible server. You pick the model and the infrastructure.

Review loop and LLM-as-a-judge. Swival has a configurable review loop that can run external reviewer scripts or use a built-in LLM-as-judge to automatically evaluate and retry agent output. Good for quality assurance on tasks that matter.

Built for benchmarking. Pass --report report.json and Swival writes a machine-readable evaluation report with per-call LLM timing, tool success/failure counts, context compaction events, and guardrail interventions. Useful for comparing models, settings, skills, and MCP servers systematically on real coding tasks.

Skills, MCP, and A2A. Extend the agent with SKILL.md-based skills for reusable workflows, connect to external tools via the Model Context Protocol, and talk to remote agents via the Agent-to-Agent (A2A) protocol.

Small enough to read and hack. A few thousand lines of Python across a handful of files, with no framework underneath. Read the whole agent in an afternoon. If something doesn't work the way you want, change it.

CLI-native. stdout is exclusively the final answer. All diagnostics go to stderr. Pipe Swival's output straight into another command or a file.

Documentation

Full documentation is available at swival.dev.

  • Getting Started -- installation, first run, what happens under the hood
  • Usage -- one-shot mode, REPL mode, CLI flags, piping, exit codes
  • Tools -- what the agent can do: file ops, search, editing, web fetching, thinking, task tracking, command execution
  • Safety and Sandboxing -- path resolution, symlink protection, command whitelisting, YOLO mode
  • Skills -- creating and using SKILL.md-based agent skills
  • Customization -- config files, project instructions, system prompt overrides, tuning parameters
  • Context Management -- compaction, snapshots, knowledge survival, and how Swival handles tight context windows
  • Providers -- LM Studio, HuggingFace, OpenRouter, Google Gemini, ChatGPT Plus/Pro, and generic OpenAI-compatible server configuration
  • MCP -- connecting external tool servers via the Model Context Protocol
  • A2A -- connecting to remote agents via the Agent-to-Agent protocol
  • Reports -- JSON reports for benchmarking and evaluation
  • Reviews -- external reviewer scripts for automated QA and LLM-as-judge evaluation
  • Using Swival with AgentFS -- copy-on-write filesystem sandboxing for safe agent runs

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swival-0.1.25.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

swival-0.1.25-py3-none-any.whl (141.9 kB view details)

Uploaded Python 3

File details

Details for the file swival-0.1.25.tar.gz.

File metadata

  • Download URL: swival-0.1.25.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for swival-0.1.25.tar.gz
Algorithm Hash digest
SHA256 051de2c26a821c9b1bd8f786f0fd1b75f656a4a8b57367395a03eb26b3f48c11
MD5 4857cf12091a5acc3bf11b003ce676d2
BLAKE2b-256 d2ad8d2af81054c957483c934be2bbab12cce51932d4c1eda536cbf09b17f6c5

See more details on using hashes here.

File details

Details for the file swival-0.1.25-py3-none-any.whl.

File metadata

  • Download URL: swival-0.1.25-py3-none-any.whl
  • Upload date:
  • Size: 141.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for swival-0.1.25-py3-none-any.whl
Algorithm Hash digest
SHA256 fa93b509c508fb48668530159e9b4e703638939a84d2d712d220926b8c329de0
MD5 f3b876046d9373f88c454fc0b59c41a9
BLAKE2b-256 3827df40e64b840f4822d5dfc0ce1aeece89e5f3500215edb909267db7406da7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page