A local autonomous coding agent CLI powered by Ollama.

These details have not been verified by PyPI

Project description

🧠 DevAgent

A Lightweight Local Open-Source Miniature of Claude Code CLI

A production-grade local coding agent that finds bugs, writes patches, reviews its own code, and validates with tests — all offline, all local, zero API costs.

Quick Start • Architecture • Benchmarks • Roadmap • Contributing

🛡️ Why DevAgent?

DevAgent is built on a "Safety-First" architecture for high-integrity developer infrastructure. Unlike chatbots that guess code, DevAgent is a local autonomous agent that operates within a strictly observed environment.

📊 Empirical Validation

We don't hide our limitations. DevAgent v3.2.3 has been stress-tested against real-world, "messy" repositories, providing a transparent Benchmark Report.

	Other Agents	DevAgent
Safety Isolation	❌	✅ Strict Sandbox Mode
Recovery	❌	✅ Git-native + Snapshot Rollback
Validation	❌	✅ Empirical Stress Tests
Transparency	❌	✅ Visible Failure Taxonomy
Privacy	❌	✅ 100% Local (Ollama)
Costs	💸	✅ Zero API Costs

Philosophy: Safety > Intelligence. Observability > Reasoning. Retrieval > Huge Context. Reliability > Hype.

✨ Features

🔁 ReAct Loop — Thought → Action → Observation → Fix → Review → Test cycle

🧠 Planner — LLM generates an action plan before coding

🔍 Semantic Search — FAISS + sentence-transformers code retrieval

🔎 Code Search — ripgrep-powered with cross-platform fallback

📝 Self-Review — LLM critiques its own fixes, revises until approved

🩹 Patch Engine — Line-level unified diffs instead of full file rewrites

🧪 Test-Driven — Runs pytest after every fix, retries on failure

🏖️ Sandbox Mode — Agent works in an isolated copy, applies changes only on success

📊 Benchmarks — 5 built-in benchmark suites with automated evaluation

📈 Metrics — Latency, token estimates, retries, and performance tracking

📋 Full Audit Trail — Every step logged to logs/run.json

🔒 100% Offline — Runs on Ollama with small models (2-4 GB)

⚡ Low Resource — Works on RTX 3050 (4 GB VRAM) / 16 GB RAM

🚀 Quick Start

Prerequisites

Python 3.11+
Ollama installed and running

Install & Setup

# 1. Clone
git clone https://github.com/VedantJadhav701/Developer-Code-Intelligence-Agent.git
cd Developer-Code-Intelligence-Agent

# 2. Install
pip install devagent-cli  # (Coming soon to PyPI)
# Or locally: pip install -e .

# 3. Verify System (CRITICAL)
# This checks your Python environment, Ollama connection, and dependencies
devagent doctor

# 4. Pull the model
ollama pull qwen2.5-coder:3b

# 5. Run!
devagent run --task "Fix the divide-by-zero bug" --root ./demo_project

CLI Subcommands

Command	Description
`devagent run`	Execute a coding task on a project
`devagent benchmark`	Run the automated benchmark suite
`devagent doctor`	Check system health and dependencies
`devagent models`	List available Ollama models
`devagent version`	Show current version

🛡️ Reliability & Safety

DevAgent is built for production-grade reliability:

Isolated Sandbox: Agent works in sandbox_workspace/, keeping your source clean until success.
Auto-Snapshot: Creates a safety restore point before every execution.
Instant Rollback: Revert agent changes with devagent rollback.
Traceability: Every thought and tool call is logged to logs/run.json.
Environment Awareness: Detects and uses your project's Python environment automatically.

🕹️ Interactive Mode

Run with --interactive (or -i) to review colorized diffs before they are applied to your project.

devagent run --task "Fix bug" --interactive

🏗️ Architecture

graph TD
    CLI[DevAgent CLI] --> Orchestrator[ReAct Orchestrator]
    Orchestrator --> Safety[Safety Manager: Snapshots]
    Orchestrator --> Memory[Working Memory]
    Orchestrator --> Retrieval[Semantic Retrieval FAISS]
    Orchestrator --> Tools[Tool Suite: pytest, ripgrep, git]
    Orchestrator --> Reviewer[Self-Review Loop]
    Reviewer --> Patch[Surgical Patch Engine]
    Patch --> Sandbox[Sandbox Environment]

No API keys. No sign-ups. No cloud.

Optional: Enable Semantic Search

pip install faiss-cpu sentence-transformers

Without these, DevAgent falls back to keyword search — still fully functional.

🎬 Demo

 ____              _                    _
|  _ \  _____   __/ \   __ _  ___ _ __ | |_
| | | |/ _ \ \ / / _ \ / _` |/ _ \ '_ \| __|
| |_| |  __/\ V / ___ \ (_| |  __/ | | | |_
|____/ \___| \_/_/   \_\__, |\___|_| |_|\__|
                       |___/

+==========================================================+
|        DEVELOPER CODE INTELLIGENCE AGENT                 |
|        Model: qwen2.5-coder:3b                          |
|        Sandbox: OFF                                      |
+==========================================================+

  [PLAN] LIKELY_FILES: calculator.py
  1. search_code: divide
  2. read_file: calculator.py
  3. run_tests

  ----------------------------------------
  ITERATION 1/5
  ----------------------------------------
  [TOOL] Executing: search_code(divide)
  >> Found: calculator.py:10:def divide(a, b)
  [REVIEW] #1: APPROVED
  >> Tests: 5 passed ✓

  [OK]  AGENT COMPLETED SUCCESSFULLY

  Status:     success
  Steps used: 1/5
  Patches:    1
  Time:       8.2s

🏗️ Architecture

                    ┌─────────────────────────────┐
                    │       CLI (main.py)          │
                    │  --task --root --model       │
                    │  --sandbox --benchmark       │
                    │  --auto-commit --auto-push   │
                    └──────────┬──────────────────┘
                               │
                    ┌──────────▼──────────────────┐
                    │     Planner Layer            │
                    │  Identifies files + strategy │
                    └──────────┬──────────────────┘
                               │
                    ┌──────────▼──────────────────┐
                    │  Retrieval Layer (Memory)    │
                    │  FAISS + Sentence-Transformers│
                    │  Chunk → Embed → Top-K       │
                    └──────────┬──────────────────┘
                               │
                    ┌──────────▼──────────────────┐
                    │     ReAct Agent Loop         │
                    │                              │
                    │  1. THOUGHT   (LLM)          │
                    │  2. ACTION    (Tool)          │
                    │  3. OBSERVATION               │
                    │  4. FIX       (LLM)          │
                    │  5. REVIEW    (LLM)          │
                    │  6. PATCH     (Diff Engine)   │
                    │  7. TEST      (pytest)        │
                    │                              │
                    │  if FAIL → retry              │
                    │  if PASS → done ✓            │
                    └──┬──────────────┬───────────┘
                       │              │
              ┌────────▼──┐    ┌──────▼──────┐
              │   Tools   │    │   Ollama    │
              │           │    │  (Local)    │
              │ • search  │    │             │
              │ • semantic│    │ qwen2.5-    │
              │ • read    │    │  coder:3b   │
              │ • patch   │    │ phi3:mini   │
              │ • pytest  │    │ mistral:7b  │
              │ • flake8  │    │             │
              │ • git_diff│    └─────────────┘
              │ • sandbox │
              └───────────┘

9-Layer Architecture

Layer	Module	Purpose
1. CLI	`main.py`	Argument parsing, mode selection, banner
2. Planner	`app/planner.py`	Task interpretation, file identification
3. Retrieval	`app/memory.py`	FAISS index, semantic chunking, Top-K search
4. Tools	`tools/*`	8 real tools: search, semantic_search, read, write, test, lint, git, sandbox
5. Agent	`app/agent.py`	ReAct orchestration loop
6. Review	`app/reviewer.py`	Self-critique with APPROVED/REVISE
7. Validation	`tools/test_runner.py`	pytest + flake8 execution feedback
8. Logging	`utils/logger.py`	Structured JSON audit trail
9. Safety	`app/sandbox.py`	Isolated workspace, path validation

📁 Project Structure

Developer-Code-Intelligence-Agent/
├── app/
│   ├── agent.py            # Core ReAct agent engine
│   ├── planner.py          # Task planning layer
│   ├── reviewer.py         # Self-review module
│   ├── llm.py              # Ollama integration
│   ├── memory.py           # FAISS retrieval + working memory
│   ├── patcher.py          # Unified diff patch engine
│   ├── sandbox.py          # Sandbox workspace manager
│   └── state.py            # Shared state dataclass
├── tools/
│   ├── search.py           # Code search (ripgrep + fallbacks)
│   ├── semantic_search.py  # FAISS semantic search
│   ├── file_ops.py         # Safe file read/write
│   ├── test_runner.py      # pytest runner
│   ├── linter.py           # flake8 linter
│   ├── git_tools.py        # Git diff/commit/push
│   └── benchmark_runner.py # Benchmark evaluation
├── utils/
│   ├── logger.py           # Structured JSON logger
│   ├── config.py           # Centralized configuration
│   └── metrics.py          # Performance metrics
├── benchmarks/
│   ├── divide_by_zero/     # Benchmark: zero division guard
│   ├── missing_validation/ # Benchmark: input validation
│   ├── syntax_error/       # Benchmark: syntax fix
│   ├── import_bug/         # Benchmark: wrong import
│   └── edge_case/          # Benchmark: empty list handling
├── demo_project/           # Sample buggy project
├── docs/
│   └── USER_GUIDE.md       # Full usage guide
├── main.py                 # CLI entry point
├── devagent.py             # Global CLI wrapper
├── devagent.bat            # Windows global shortcut
├── requirements.txt
├── CONTRIBUTING.md
├── CHANGELOG.md
├── CODE_OF_CONDUCT.md
├── SECURITY.md
├── LICENSE
└── README.md

💻 CLI Reference

python main.py --task "TASK" --root ./project [OPTIONS]

Flag	Default	Description
`--task`, `-t`	(required)	The coding task for the agent
`--root`, `-r`	`.`	Project root directory
`--model`	`qwen2.5-coder:3b`	Any Ollama model
`--max-steps`, `-m`	`5`	Max ReAct iterations
`--benchmark`	off	Run benchmark suite
`--sandbox`	off	Run in isolated sandbox
`--auto-commit`	off	Git commit on success
`--auto-push`	off	Git push after commit
`--verbose`, `-v`	off	Verbose output

Examples

# Fix a specific bug
python main.py -t "Fix the TypeError in user_service.py" -r ./backend

# Run in sandbox mode (safe — doesn't touch real files until success)
python main.py -t "Fix divide-by-zero bug" -r ./project --sandbox

# Auto-commit changes on success
python main.py -t "Add input validation" -r ./api --auto-commit

# Use a stronger model
python main.py -t "Refactor auth middleware" -r ./server --model mistral:7b

# Run benchmarks
python main.py --benchmark

# More retries for complex tasks
python main.py -t "Make all tests pass" -r ./project --max-steps 10

📖 Full User Guide →

📊 Benchmarks

DevAgent includes 5 built-in benchmarks to evaluate agent performance:

Benchmark	Bug Type	Difficulty
`divide_by_zero`	Missing guard clause	Easy
`missing_validation`	No input validation	Medium
`syntax_error`	Broken syntax	Medium
`import_bug`	Wrong module name	Easy
`edge_case`	Empty list crash	Medium

Run benchmarks:

python main.py --benchmark
python main.py --benchmark --model phi3:mini

🔧 Supported Models

Model	Size	Speed	Quality	Best For
`qwen2.5-coder:3b`	1.9 GB	⚡ Fast	★★★★	Default — best for code
`qwen2.5:3b`	1.9 GB	⚡ Fast	★★★☆	General fallback
`phi3:mini`	2.2 GB	⚡ Fast	★★★☆	Good reasoning
`qwen3:4b`	2.5 GB	⚡ Fast	★★★★	Better understanding
`gemma2:2b`	1.6 GB	⚡⚡	★★☆☆	Ultra-low resource
`mistral:7b`	4.4 GB	🐢	★★★★★	Best quality (8GB+ RAM)

🗺️ Roadmap

✅ Completed (v2.0)

Core ReAct agent loop
Self-review module
Tool system (9 tools)
Planner layer
Semantic retrieval (FAISS)
Patch engine (unified diffs)
Sandbox mode
Benchmark system (5 suites)
Metrics + structured logging
Git integration
CLI with all flags

🔜 Coming Next

Multi-file support — Agent works across multiple files simultaneously
Language support — JavaScript, TypeScript, Go, Rust
Plugin system — Custom tools via YAML/Python
Watch mode — Auto-fix on test failure (--watch)
VS Code extension — Run agent from your editor
Conversation memory — Learn from past runs
Multi-agent mode — Planner + Coder + Reviewer + Evaluator agents

🤝 Contributing

We welcome contributions! See CONTRIBUTING.md for details.

git checkout -b feature/your-feature
# ... make changes ...
python -m pytest demo_project/ -v
git commit -m "feat: your feature"
git push origin feature/your-feature

Good first issues are tagged and waiting: Browse good first issues →

📜 License

MIT — use it however you want. See LICENSE.

⭐ Star History

If DevAgent helps you, give it a star! It helps others discover the project.

Built with 🧠 by Vedant Jadhav

A lightweight local open-source miniature of Claude Code CLI.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

3.7.5

May 6, 2026

This version

3.3.1

May 6, 2026

3.3.0

May 6, 2026

3.2.3

May 6, 2026

3.2.2

May 6, 2026

3.2.1

May 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

devagent_cli-3.3.1.tar.gz (45.3 kB view details)

Uploaded May 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

devagent_cli-3.3.1-py3-none-any.whl (49.2 kB view details)

Uploaded May 6, 2026 Python 3

File details

Details for the file devagent_cli-3.3.1.tar.gz.

File metadata

Download URL: devagent_cli-3.3.1.tar.gz
Upload date: May 6, 2026
Size: 45.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for devagent_cli-3.3.1.tar.gz
Algorithm	Hash digest
SHA256	`8de46c4203554c99ad217aefe3f74d1cb067e55f5833c345eab37f5827c8cd0f`
MD5	`5ab1974ebc3b27bdce5580d60576beee`
BLAKE2b-256	`8133aeda10d81473e9ab7a3622de5fb900755f4cb8ac315031dbd263313aeb33`

See more details on using hashes here.

Provenance

The following attestation bundles were made for devagent_cli-3.3.1.tar.gz:

Publisher: publish.yml on VedantJadhav701/Developer-Code-Intelligence-Agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: devagent_cli-3.3.1.tar.gz
- Subject digest: 8de46c4203554c99ad217aefe3f74d1cb067e55f5833c345eab37f5827c8cd0f
- Sigstore transparency entry: 1452673878
- Sigstore integration time: May 6, 2026
Source repository:
- Permalink: VedantJadhav701/Developer-Code-Intelligence-Agent@3b903b0eea44baa7915c115180a510b71d2daf7f
- Branch / Tag: refs/tags/v3.3.1
- Owner: https://github.com/VedantJadhav701
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@3b903b0eea44baa7915c115180a510b71d2daf7f
- Trigger Event: push

File details

Details for the file devagent_cli-3.3.1-py3-none-any.whl.

File metadata

Download URL: devagent_cli-3.3.1-py3-none-any.whl
Upload date: May 6, 2026
Size: 49.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for devagent_cli-3.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3eb13b61f1c33f87cd99791c3ce66236b28b3ff61a0d1e77128c9330c2ff0912`
MD5	`4ba789c36e84f4c37bc0b46ed6e755b5`
BLAKE2b-256	`7ddf12f42630ec1926451f3b7bc02d5b505b14f02118f16ff66fc83cd674826a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for devagent_cli-3.3.1-py3-none-any.whl:

Publisher: publish.yml on VedantJadhav701/Developer-Code-Intelligence-Agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: devagent_cli-3.3.1-py3-none-any.whl
- Subject digest: 3eb13b61f1c33f87cd99791c3ce66236b28b3ff61a0d1e77128c9330c2ff0912
- Sigstore transparency entry: 1452674469
- Sigstore integration time: May 6, 2026
Source repository:
- Permalink: VedantJadhav701/Developer-Code-Intelligence-Agent@3b903b0eea44baa7915c115180a510b71d2daf7f
- Branch / Tag: refs/tags/v3.3.1
- Owner: https://github.com/VedantJadhav701
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@3b903b0eea44baa7915c115180a510b71d2daf7f
- Trigger Event: push

devagent-cli 3.3.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

🧠 DevAgent

A Lightweight Local Open-Source Miniature of Claude Code CLI

🛡️ Why DevAgent?

📊 Empirical Validation

✨ Features

🚀 Quick Start

Prerequisites

Install & Setup

CLI Subcommands

🛡️ Reliability & Safety

🕹️ Interactive Mode

🏗️ Architecture

Optional: Enable Semantic Search

🎬 Demo

🏗️ Architecture

9-Layer Architecture

📁 Project Structure

💻 CLI Reference

Examples

📊 Benchmarks

🔧 Supported Models

🗺️ Roadmap

✅ Completed (v2.0)

🔜 Coming Next

🤝 Contributing

📜 License

⭐ Star History

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance