Multi-LLM Council Backend for AGENT-K - Three-stage consensus with GPT, Gemini, and Claude

These details have not been verified by PyPI

Project links

Project description

AGENT-K

Multi-Agent Claude Code Terminal Suite

╭─────────────────────────────────────────────────────────────────────────────────╮
│                              AGENT-K v1.0                                       │
╰─────────────────────────────────────────────────────────────────────────────────╯

Transform your terminal into a team of specialized AI agents. AGENT-K orchestrates multiple Claude instances working in parallel on your software development and ML research tasks.

What is AGENT-K?

AGENT-K extends Claude Code with multi-agent orchestration. Instead of a single AI assistant, you get a coordinated team of specialists - each with domain expertise, working together on complex tasks.

╭─ You ────────────────────────────────────────────────────────────────────────────╮
│ Build a secure REST API with user authentication                                 │
╰──────────────────────────────────────────────────────────────────────────────────╯

╭─ Orchestrator ───────────────────────────────────────────────────────────────────╮
│ Breaking down task...                                                            │
│   → Engineer: Implement REST endpoints and JWT authentication                    │
│   → Tester: Write API integration tests                                          │
│   → Security: Review for OWASP vulnerabilities                                   │
│   → Scout: Find latest best practices for JWT in 2025                            │
╰──────────────────────────────────────────────────────────────────────────────────╯

[●] Orchestrator   Coordinating
[◐] Engineer       Writing src/api/auth.py...
[◐] Scout          Searching for JWT best practices...
[ ] Tester         Waiting for implementation
[ ] Security       Queued for review

Why AGENT-K?

The Problem

Claude is brilliant, but complex tasks often require:

Multiple perspectives (implementation, testing, security)
Parallel work (why write tests sequentially after code?)
Up-to-date information (Claude's training data becomes stale)
Specialized focus (security reviews need different prompts than coding)

The Solution

AGENT-K spawns specialized Claude agents that:

Work in parallel on different aspects of your task
Coordinate automatically through a central orchestrator
Stay current with a dedicated Scout agent for real-time research
Follow best practices with domain-specific system prompts

Features

Feature	Description
Multi-Agent Orchestration	Coordinate 5-6 specialized agents working in parallel
Two Modes	Software Development (default) and ML Research & Training
Interactive Chat	Familiar interface like `claude` but with a whole team
Visual Mode	tmux-based multi-pane view of all agents working
Scout Agent	Real-time web/GitHub/paper search to stay current
Date Awareness	Agents know when to verify potentially outdated info
Focus Mode	Talk directly to any specialist agent
File-Based IPC	Agents coordinate through structured JSON messages

Agent Teams

Development Mode (Default)

Agent	Specialty
Orchestrator	Task decomposition, coordination, result aggregation
Engineer	Code implementation, debugging, refactoring
Tester	Unit/integration tests, coverage analysis
Security	OWASP vulnerability review, secrets detection
Scout	Real-time search for current best practices

ML Mode (`--mode ml`)

Agent	Specialty
Orchestrator	ML project lifecycle management
Researcher	Literature review, SOTA analysis, paper summaries
ML Engineer	Model implementation, training loops, optimization
Data Engineer	Data pipelines, preprocessing, augmentation
Evaluator	Metrics, benchmarking, experiment tracking
Scout	arXiv, HuggingFace, Papers With Code search

Installation

Homebrew (macOS/Linux)

brew tap de5truct0/agentk
brew install agentk8

npm

npm install -g agentk8

pip

pip install agentk8

Quick Install Script

curl -sSL https://raw.githubusercontent.com/de5truct0/agentk/main/install.sh | bash

From Source

git clone https://github.com/de5truct0/agentk.git
cd agentk
make install

Note: Package name is agentk8 on all registries. The installed command is agentk.

Requirements

jq - JSON processing (brew install jq)
claude - Claude Code CLI (Install here)
tmux - Optional, for visual mode (brew install tmux)

Quick Start

# Start interactive session
agentk

# Start ML research mode
agentk --mode ml

# Start with visual panels (requires tmux)
agentk --visual

# One-shot task
agentk -c "Refactor the user service to use async/await"

Usage

Interactive Session

$ agentk

╭─────────────────────────────────────────────────╮
│  AGENT-K v1.0                                   │
╰─────────────────────────────────────────────────╯
Mode: Software Development Mode

Type your request or /help for commands.

╭─ You ─────────────────────────────────────────────────
│

Session Commands

Command	Description
`/status`	Show all agent states and current tasks
`/logs <agent>`	View agent output
`/kill <agent\|all>`	Stop agent(s)
`/focus <agent>`	Talk directly to one agent
`/unfocus`	Return to orchestrator
`/visual`	Toggle tmux panel view
`/clear`	Clear screen
`/help`	Show all commands
`/exit`	End session

Scout Commands (Both Modes)

Command	Description
`/search <query>`	Web search for latest info
`/github <query>`	Search GitHub repos and code
`/papers <topic>`	Search arXiv/Semantic Scholar
`/libs <task>`	Find best libraries for task
`/sota <topic>`	Get state-of-the-art approaches

ML-Specific Commands

Command	Description
`/experiment <name>`	Start a new experiment
`/metrics`	Show current training metrics
`/tensorboard`	Open TensorBoard
`/checkpoint`	Save model state
`/compare <e1> <e2>`	Compare experiments
`/huggingface <query>`	Search HuggingFace Hub

Visual Mode

Launch with --visual to see all agents in a tmux layout:

┌───────────────────┬───────────────────┬───────────────────┐
│   ORCHESTRATOR    │     ENGINEER      │      TESTER       │
│                   │                   │                   │
│ Breaking down     │ Implementing      │ Waiting for       │
│ task into         │ auth module...    │ implementation... │
│ subtasks...       │                   │                   │
├───────────────────┼───────────────────┼───────────────────┤
│     SECURITY      │      SCOUT        │      [MAIN]       │
│                   │                   │                   │
│ Queued for        │ Searching JWT     │ You: _            │
│ review            │ best practices... │                   │
│                   │                   │                   │
└───────────────────┴───────────────────┴───────────────────┘

How It Works

                          ┌─────────────┐
                          │    User     │
                          └──────┬──────┘
                                 │
                                 ▼
                    ┌────────────────────────┐
                    │     Orchestrator       │
                    │  (task decomposition)  │
                    └───────────┬────────────┘
                                │
           ┌────────────────────┼────────────────────┐
           │                    │                    │
           ▼                    ▼                    ▼
   ┌───────────────┐   ┌───────────────┐   ┌───────────────┐
   │   Engineer    │   │    Tester     │   │   Security    │
   │ (implements)  │   │  (validates)  │   │  (reviews)    │
   └───────┬───────┘   └───────┬───────┘   └───────┬───────┘
           │                    │                    │
           └────────────────────┴────────────────────┘
                                │
                                ▼
                    ┌────────────────────────┐
                    │    File-Based IPC      │
                    │ workspace/tasks/*.json │
                    └────────────────────────┘

You enter a request in the interactive session
Orchestrator analyzes and breaks it into subtasks
Specialist agents spawn as parallel Claude subprocesses
Agents work on your actual project files
Results aggregate back through the orchestrator
You see the combined output with full context

Configuration

Create ~/.agentk/config.sh:

# Custom model (default: claude-3-sonnet)
export AGENTK_MODEL="claude-3-opus-20240229"

# Log level: debug, info, warn, error
export LOG_LEVEL="info"

# Custom workspace location
export AGENTK_WORKSPACE="/custom/path/workspace"

# Parallel agent limit (default: 4)
export AGENTK_MAX_PARALLEL=6

Project Structure

agentk/
├── agentk                # Main CLI entry point
├── lib/
│   ├── core.sh          # Core utilities, logging, date context
│   ├── ui.sh            # Terminal UI, spinners, chat boundaries
│   ├── ipc.sh           # Inter-process communication
│   ├── spawn.sh         # Agent subprocess management
│   └── visual.sh        # tmux integration
├── modes/
│   ├── shared/
│   │   └── scout.md     # Scout agent system prompt
│   ├── dev/             # Development mode prompts
│   │   ├── orchestrator.md
│   │   ├── engineer.md
│   │   ├── tester.md
│   │   └── security.md
│   └── ml/              # ML mode prompts
│       ├── orchestrator.md
│       ├── researcher.md
│       ├── ml-engineer.md
│       ├── data-engineer.md
│       └── evaluator.md
└── workspace/           # Runtime data (gitignored)
    ├── tasks/           # Task queue (JSON)
    ├── results/         # Agent outputs
    ├── logs/            # Agent logs
    └── experiments/     # ML experiment tracking

Known Limitations

Limitation	Workaround
File conflicts when agents edit same file	Use `/focus` to serialize work on critical files
Each agent = separate API call (cost)	Use orchestrator's judgment on when to parallelize
Agents don't share real-time context	Orchestrator maintains shared state in workspace
Rate limiting with many parallel agents	`AGENTK_MAX_PARALLEL` limits concurrent spawns

Roadmap

Web UI dashboard
Custom agent definitions
Persistent conversation history
Cost tracking per agent
Team collaboration mode
Plugin system for custom tools

Contributing

Contributions welcome!

Fork the repository
Create a feature branch (git checkout -b feature/amazing)
Make your changes
Run tests (make test)
Submit a pull request

License

MIT License - see LICENSE for details.

Acknowledgments

Inspired by Boris Cherny's parallel Claude workflow
Built for the Claude Code community
Powered by Anthropic's Claude

AGENT-K - Because one Claude is good, but a team of Claudes is better.

GitHub | npm | PyPI

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.3.8

Jan 16, 2026

2.3.7

Jan 16, 2026

2.3.6

Jan 15, 2026

2.3.5

Jan 15, 2026

This version

2.3.4

Jan 15, 2026

2.3.3

Jan 15, 2026

2.3.2

Jan 15, 2026

2.3.1

Jan 15, 2026

1.0.5

Jan 12, 2026

1.0.4

Jan 12, 2026

1.0.3

Jan 12, 2026

1.0.2

Jan 12, 2026

1.0.1

Jan 12, 2026

1.0.0

Jan 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentk8-2.3.4.tar.gz (24.5 kB view details)

Uploaded Jan 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentk8-2.3.4-py3-none-any.whl (22.7 kB view details)

Uploaded Jan 15, 2026 Python 3

File details

Details for the file agentk8-2.3.4.tar.gz.

File metadata

Download URL: agentk8-2.3.4.tar.gz
Upload date: Jan 15, 2026
Size: 24.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for agentk8-2.3.4.tar.gz
Algorithm	Hash digest
SHA256	`b32a0dd33c0d8810ca8f07b5ec3af1a0f24de96da16ff685fec3e57a98de51be`
MD5	`9de0cb349686ef343ec201c8d3084ba1`
BLAKE2b-256	`20772f4c88558d19a4b331e59a85210c0e97e478a97b842d3426294f9dfe9a32`

See more details on using hashes here.

File details

Details for the file agentk8-2.3.4-py3-none-any.whl.

File metadata

Download URL: agentk8-2.3.4-py3-none-any.whl
Upload date: Jan 15, 2026
Size: 22.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for agentk8-2.3.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d47dfa9820bb0b1768a99cdb699ad6f5b8b0eaa704f2fb48cf57432973e38466`
MD5	`192e890a0d73f52943cc8e25aac626d2`
BLAKE2b-256	`433db6c222fe6507f0bdb1456bed47a0f566bca18064d70f028c20ff40f0b562`

See more details on using hashes here.

agentk8 2.3.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AGENT-K

What is AGENT-K?

Why AGENT-K?

The Problem

The Solution

Features

Agent Teams

Development Mode (Default)

ML Mode (--mode ml)

Installation

Homebrew (macOS/Linux)

npm

pip

Quick Install Script

From Source

Requirements

Quick Start

Usage

Interactive Session

Session Commands

Scout Commands (Both Modes)

ML-Specific Commands

Visual Mode

How It Works

Configuration

Project Structure

Known Limitations

Roadmap

Contributing

License

Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

ML Mode (`--mode ml`)