An autonomous, LangGraph-powered AI development agency.

Project description

My Dev Team 🚀

An autonomous, LangGraph-powered AI development agency. My Dev Team takes raw project requirements and processes them through a multi-agent workflow (Product Manager, System Architect, Developers, and QA) to incrementally build, test, and deliver production-ready code.

Features

Multi-Agent Architecture: Specialized AI agents handle distinct phases of the software development lifecycle.
Semantic Model Routing: Automatically routes tasks to the most cost-effective or capable LLMs based on the task type (reasoning, coding, or fast-utility).
Strict Test-Driven Development (TDD): Testing is never an afterthought. Tasks are generated with embedded testing criteria, and the Developer writes unit tests alongside implementation code for immediate QA validation.
State Recovery & Resiliency: Powered by asynchronous SQLite checkpointing. If an API rate limit is hit or a workflow is interrupted, you can resume the exact thread without losing a single token of progress.
Telemetry & Cost Tracking: Automatically tallies prompt and completion tokens across the entire workflow. Calculates exact USD costs dynamically using LiteLLM's live pricing registry, printing a detailed receipt at the end of every run.
Incremental Development: The System Architect breaks down requirements into a manageable backlog of strictly formatted JSON tasks.
Self-Healing Code: The Developer, Reviewer, and QA Engineer agents continuously loop until unit tests pass and code meets specifications.
Structured Outputs: Powered by Pydantic and LangChain, ensuring zero "Markdown spillage" and robust state management.
Extensible: Easily add custom tools like HumanInTheLoop or WorkspaceSaver.

Sandboxed QA Execution

The QA Engineer agent does not rely on LLM "guesswork" or mental simulation to test code. It executes the generated code in reality.

Zero Hallucinations: The QA node mounts the active workspace into a temporary directory and runs the actual test suite (e.g., pytest, npm test). It reads the exact stdout/stderr tracebacks to accurately report bugs back to the Developer.
Ephemeral Isolation: Code is executed securely using the Docker SDK. Containers are strictly isolated, resource-limited (CPU/RAM), and immediately destroyed after the test run, ensuring your host machine is never at risk.
Universal Runtime Auto-Detection: The sandbox dynamically inspects the workspace or takes explicit direction from the System Architect to pull the correct Docker image (Python, Node.js, etc.) on the fly.

Centralized Configuration

Code and configuration are strictly separated to make the framework maintainable and extensible.

Model Routing (config/llms.yaml): All provider definitions (Groq, OpenAI, Ollama) and model routing logic (reasoning, coding, fast-utility) are centralized in a single YAML file, making it trivial to update models as new ones are released.
Agent Prompts (config/agents/**): Every agent's persona, system instructions, and constraints are stored as clean Markdown files with YAML frontmatter. No massive, hardcoded prompt strings cluttering the Python logic!
Sandbox Environments (config/sandbox.yaml): Docker base images and test execution commands for various runtimes (Python, Node.js) are completely decoupled. You can easily add support for entirely new programming languages by simply defining the image and test command in YAML, without touching the core Python engine.

Installation

You can install the package directly via pip:

pip install my-dev-team

(For local development, clone the repository and run pip install -e .)

1. Preparing Your Project File

The crew requires a text file outlining your project requirements. By default, it looks for a specific header format to extract the project name and thread ID.

Create a file named project.txt:

Subject: NEW PROJECT: Web Scraper CLI

I need a Python command-line tool that scrapes articles from a given URL.
It should extract the title, author, and main body text, and save the output as a JSON file.

Requirements:
- Use BeautifulSoup4 for parsing.
- Include a `--url` argument and an `--output` argument.
- Write unit tests for the parsing logic.

2. Usage (CLI)

The fastest way to use the framework is via the terminal command included in the package.

devteam project.txt

Advanced CLI Options

You can easily switch between cloud providers and local models, and adjust rate limits based on your API tier:

# Run entirely locally for free using Ollama, with no rate limit!
devteam project.txt --provider ollama

# Run using OpenAI's flagship models, limited to 15 requests per minute
devteam project.txt --provider openai --rpm 15

# Resume an interrupted run exactly where it left off
devteam --resume web_scraper_cli_20260312_083500

Available Arguments:

project_file: (Optional if resuming) Path to your project requirements text file.
--resume: Resume a specific thread ID (e.g., my_app_20260312_083500).
--provider: Choose the LLM backend. Options: groq, ollama (default), openai.
--rpm: API requests per minute. Set to 0 to disable rate limiting (default: 0).

Note: Ensure you have the corresponding API keys (e.g., GROQ_API_KEY, OPENAI_API_KEY) set in your .env file, or ensure your local Ollama instance is running.

3. Intelligent Model Routing (LLM Factory)

My Dev Team doesn't just use one model for everything. It uses an advanced Semantic Routing architecture via LLMFactory.

Instead of hardcoding a specific model (like gpt-5.3-codex), each agent requests a specific capability category and temperature. The Factory evaluates your chosen --provider and dynamically spins up the most cost-effective, capable model for that exact task.

The Categories

reasoning: For the System Architect and Product Manager. Maps to deep-thinking models.
code-generator: For the Senior Developer. Maps to strict, syntax-heavy models.
code-analyzer: For the QA and Reviewer agents. Maps to deep-context evaluation models.
fast-utility: For the Reporter. Maps to blazing-fast, ultra-cheap models for simple text summarization.

4. Usage (Python API)

If you want to integrate the crew into your own application, customize the LLM Factory's routing table, or override specific agent behaviors, use the clean Python API:

import asyncio
import aiosqlite
from pathlib import Path
from dotenv import load_dotenv

from langgraph.checkpoint.sqlite.aio import AsyncSqliteSaver
from devteam import VirtualCrew, ProjectManager, LLMFactory
from devteam.agents import ProductManager, SystemArchitect, SeniorDeveloper, CodeReviewer, QAEngineer, FinalQAEngineer, Reporter
from devteam.extensions import HumanInTheLoop, WorkspaceSaver
from devteam.utils import RateLimiter, TelemetryTracker

load_dotenv()

def build_crew(project_folder: Path, llm_factory: LLMFactory, checkpointer: AsyncSqliteSaver, rpm: int = 0) -> VirtualCrew:
    # Initialize agents using built-in prompt templates
    agents = {
        'pm': ProductManager.from_config('product-manager.md'),
        'architect': SystemArchitect.from_config('system-architect.md'),
        'developer': SeniorDeveloper.from_config('senior-developer.md'),
        'reviewer': CodeReviewer.from_config('code-reviewer.md'),
        'qa': QAEngineer.from_config('qa-engineer.md'),
        'final_qa': FinalQAEngineer.from_config('final-qa-engineer.md'),
        # Example: Forcing the reporter to use a more creative reasoning model
        'reporter': Reporter.from_config('reporter.md', model_category='reasoning', temperature=0.7)
    }
    # Add extensions like saving files to disk or requiring human approval
    extensions = [
        WorkspaceSaver(workspace_dir=workspace_dir),
        HumanInTheLoop()
    ]
    return VirtualCrew(
        manager=ProjectManager(),
        agents=agents,
        extensions=extensions,
        checkpointer=checkpointer,
        rate_limiter=RateLimiter(requests_per_minute=rpm) if rpm > 0 else None
    )

async def main():
    requirements = "Build a simple Python calculator CLI with basic arithmetic."
    workspace = Path('./workspaces/calculator_app')
    workspace.mkdir(parents=True, exist_ok=True)
    db_path = workspace / 'state.db'
    telemetry = TelemetryTracker()
    factory = LLMFactory(provider='groq', callbacks=[telemetry])
    try:
        async with aiosqlite.connect(db_path) as conn:
            checkpointer = AsyncSqliteSaver(conn)
            crew = build_crew(workspace, provider='groq', checkpointer=checkpointer, rpm=30)
            print("🚀 Starting the AI Dev Team...")
            final_state = await crew.execute(
                thread_id="calc_run_01",
                requirements=requirements
            )
        if final_state.abort_requested:
            print("❌ Workflow aborted by user or validation failure.")
        elif final_state.success:
            print("🎉 Project completed successfully!")
            print(f"Total Revisions: {final_state.total_revisions}")
            if final_state.final_report:
                print(final_state.final_report)
        else:
            print("🚨 Release failed: Integration bugs found!")
            for bug in final_state.integration_bugs:
                print(f" - {bug}")
    except KeyboardInterrupt:
        print("\n\n🛑 Workflow interrupted by user (Ctrl+C).")
        print(f"💡 You can resume this exact state later by running:")
        print(f"   dev-team --resume {thread_id}")

    finally:
        telemetry.print_receipt()

if __name__ == "__main__":
    asyncio.run(main())

AI Agents

Product Manager: Analyzes requirements, asks clarifying questions, and writes detailed Technical Specifications.
System Architect: Breaks specifications down into a cohesive backlog of developer tasks.
Senior Developer: Incrementally writes code and unit tests for the current task.
Code Reviewer: Analyzes the generated code for security, style, and logic issues.
QA Engineer: Mentally simulates execution and evaluates the code against the task requirements.
Final QA Engineer: Performs a full-repository integration test once all tasks are complete.
Reporter: Generates a comprehensive final Markdown report for stakeholders.

Project details

Release history Release notifications | RSS feed

0.12.4

Apr 21, 2026

0.12.3

Apr 20, 2026

0.12.2

Apr 16, 2026

0.12.1

Apr 16, 2026

0.12.0

Apr 12, 2026

0.11.5

Apr 7, 2026

0.11.3

Apr 6, 2026

0.11.2

Apr 4, 2026

0.11.1

Apr 3, 2026

0.11.0

Apr 3, 2026

0.10.0

Apr 1, 2026

0.9.2

Mar 30, 2026

0.9.1

Mar 29, 2026

0.9.0

Mar 29, 2026

0.8.0

Mar 28, 2026

0.7.1

Mar 26, 2026

0.7.0

Mar 25, 2026

0.6.1

Mar 24, 2026

0.6.0

Mar 23, 2026

0.5.1

Mar 22, 2026

0.5.0

Mar 18, 2026

0.4.2

Mar 17, 2026

0.4.1

Mar 17, 2026

0.4.0

Mar 15, 2026

This version

0.3.0

Mar 14, 2026

0.2.0

Mar 13, 2026

0.1.0

Mar 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

my_dev_team-0.3.0.tar.gz (42.3 kB view details)

Uploaded Mar 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

my_dev_team-0.3.0-py3-none-any.whl (51.4 kB view details)

Uploaded Mar 14, 2026 Python 3

File details

Details for the file my_dev_team-0.3.0.tar.gz.

File metadata

Download URL: my_dev_team-0.3.0.tar.gz
Upload date: Mar 14, 2026
Size: 42.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for my_dev_team-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`2c25bb25926b26a4205b6c2ecc02b33ee632d935e3161e0837fce289f8ae28ef`
MD5	`0c704362062d82537c35e5a457393c39`
BLAKE2b-256	`a61e6d176c9b69423b238a5613256a06a23314a3c81eaf1bec4a935515bd8865`

See more details on using hashes here.

File details

Details for the file my_dev_team-0.3.0-py3-none-any.whl.

File metadata

Download URL: my_dev_team-0.3.0-py3-none-any.whl
Upload date: Mar 14, 2026
Size: 51.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for my_dev_team-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f1ca711efc79d44fdf26fab8bbb5e471b5bcf995acc52a70a50f326c1f600dd9`
MD5	`137d2c3824d013a44985d9e04c9f7d94`
BLAKE2b-256	`cada4a2d52ceb00c75546ae5fe904aa33205454d1093bead921e7d6ab5471631`

See more details on using hashes here.

my-dev-team 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

My Dev Team 🚀

Features

Sandboxed QA Execution

Centralized Configuration

Installation

1. Preparing Your Project File

2. Usage (CLI)

Advanced CLI Options

Available Arguments:

3. Intelligent Model Routing (LLM Factory)

The Categories

4. Usage (Python API)

AI Agents

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes