A terminal-based AI agent that decomposes projects into epics, user stories, tasks, and sprint plans.
Project description
๐ Scrum AI Agent
AI-powered Scrum Master that decomposes projects into epics, stories, tasks, and sprint plans โ right from your terminal.
From project description to sprint plan in under a minute.
๐ Quick Start
Homebrew (macOS)
brew tap omardin14/tap && brew install scrum-agent
scrum-agent --setup # configure your API key
scrum-agent # launch the interactive TUI
pipx
pipx install scrum-agent
scrum-agent --setup
scrum-agent
From source
git clone https://github.com/omardin14/scrum-planning-ai-agent.git
cd scrum-planning-ai-agent
make install # installs uv, creates venv, installs dependencies
make env # creates .env from .env.example โ add your API key
make run # launch the CLI
Headless / CI mode
scrum-agent --non-interactive --description "Build a todo app" --output json
scrum-agent --non-interactive --description @project-brief.txt --output html --team-size 5
๐ Table of Contents
- Quick Start
- Features
- Getting Started
- Deploy on AWS Lightsail
- CLI Reference
- Intake Modes
- Pipeline
- Team Analysis Mode
- Analysis-Calibrated Planning
- Export Formats
- Session Management
- Usage Page
- Settings Page
- Tools
- Architecture
- Project Intake Questionnaire
- Scrum Standards
- Prompt Construction
- Guardrails
- Multi-Provider LLM Support
- Development
- Evaluation & Testing
- Tech Stack
- Agentic Blueprint Reference
- License
โจ Features
๐ฅ๏ธ Full-screen TUI โ Animated splash, mode selection with ASCII art titles, pipeline progress with spinners and elapsed time, dark/light themes
๐ง Smart Intake โ Extracts answers from your description, asks only what's missing, adaptive follow-ups with question-specific probes
๐ Human-in-the-Loop โ Accept, edit, or reject at every pipeline stage with numbered review menus
๐ Capacity Planning โ Bank holidays (100+ countries), PTO/leave tracking, unplanned absence %, onboarding ramp-up, per-sprint velocity
๐ 30 Tools โ GitHub, Azure DevOps, Jira, Confluence, local codebase scanning, bank holiday detection, LLM-powered estimation
๐ค 5 Export Formats โ Markdown, HTML, JSON, Jira sync, Azure DevOps Boards sync
๐ค 4 LLM Providers โ Claude (default), GPT, Gemini, AWS Bedrock
๐พ Session Persistence โ SQLite-backed sessions that survive terminal restarts; resume with --resume
๐ก๏ธ Guardrails โ Input validation (injection, profanity, relevance), output validation (story format, AC coverage, sprint capacity)
๐ท๏ธ Task Enrichment โ Auto-tagged labels (Code/Docs/Infra/Testing), test plans, AI coding prompts for Cursor/Claude Code/Copilot
๐ Offline Questionnaire โ Export a blank template, fill in at your own pace, import to skip interactive flow
๐ SCRUM.md Context โ Drop a SCRUM.md in your project directory; the agent reads it to pre-fill answers and ground output
๐ฌ Team Analysis Mode โ Connect your Jira or Azure DevOps board to analyze your team's real patterns: velocity, sprint cadence, story structure, acceptance criteria style, naming conventions, and per-developer breakdown
๐ฏ Analysis-Calibrated Planning โ Select a team analysis profile when planning and the agent auto-fills intake questions, matches your team's story/task counts, enforces your AC format, uses your Definition of Done, and shows calibration banners at every pipeline stage
๐ฅ Team Member Selection โ Pick specific developers from your analysis profile for a project; velocity is calculated from their individual averages, not the whole team total
๐ Usage Dashboard โ Track token consumption per session, per provider, and lifetime totals with a dedicated usage page
๐ Getting Started
Prerequisites
- Python 3.11+
- An API key for at least one LLM provider:
- Anthropic (recommended)
- OpenAI
- Google AI Studio
- AWS Bedrock (IAM credentials โ no API key needed on EC2/Lightsail)
Installation (development)
make install # installs uv, creates venv, installs dependencies
make env # creates .env from .env.example
make pre-commit # installs pre-commit hooks
First-run setup wizard
On first launch (or with --setup), an interactive wizard walks you through:
- LLM provider selection โ choose Anthropic, OpenAI, Google, or AWS Bedrock
- API key entry โ with format validation hints (e.g., Anthropic keys start with
sk-ant-) - Issue tracking โ Jira or Azure DevOps Boards (with org URL, project, and PAT verification)
- Version control โ GitHub PAT token (or skip)
- Credential storage โ saved to
~/.scrum-agent/.env
scrum-agent --setup # re-run anytime to update credentials
API keys
Anthropic (default)
ANTHROPIC_API_KEY=sk-ant-...
OpenAI (alternative)
LLM_PROVIDER=openai
OPENAI_API_KEY=sk-...
Google (alternative)
LLM_PROVIDER=google
GOOGLE_API_KEY=AIza...
๐ LangSmith (optional tracing)
LangSmith provides tracing and observability. Add to .env:
LANGSMITH_TRACING=true
LANGSMITH_API_KEY=lsv2_pt_...
LANGSMITH_PROJECT=scrum-agent
โ๏ธ Deploy on AWS Lightsail (OpenClaw) โ 14 steps
โ๏ธ Deploy on AWS Lightsail (OpenClaw)
Run scrum-agent as a cloud service via OpenClaw on AWS Lightsail. OpenClaw comes pre-installed on the Lightsail blueprint and uses Amazon Bedrock (Claude Sonnet 4.6) as its model provider.
1. Create the instance
- Open the AWS Lightsail console
- Create instance โ choose the OpenClaw blueprint under "Apps + OS"
- Select your region (e.g.,
eu-west-2) and instance plan (2 GB RAM minimum recommended) - Name the instance (e.g.,
OpenClaw-1) and create it
2. Attach a static IP
Your public IP changes on every stop/start. Attach a static IP in the Lightsail Networking tab to keep it stable.
3. Enable Bedrock access
OpenClaw uses Amazon Bedrock as its default AI model provider. Grant Bedrock API access by running the setup script in AWS CloudShell:
curl -s https://d25b4yjpexuuj4.cloudfront.net/scripts/lightsail/setup-lightsail-openclaw-bedrock-role.sh \
| bash -s -- OpenClaw-1 eu-west-2
Replace OpenClaw-1 with your instance name and eu-west-2 with your region.
Note: If this is your first time using Anthropic models in Amazon Bedrock, you'll need to complete the First Time Use (FTU) form to gain access.
4. Pair your browser
- Click Connect using SSH in the Lightsail console (or use your own SSH client)
- Follow the on-screen instructions to pair your browser with the OpenClaw dashboard
- Click Open dashboard to access the OpenClaw web UI
5. Install scrum-agent on the instance
SSH into the instance and install scrum-agent with Bedrock support:
# Install pipx if not present
sudo apt update && sudo apt install -y pipx
pipx ensurepath
source ~/.bashrc # reload PATH so pipx-installed binaries are found
# Install scrum-agent with Bedrock provider
pipx install "scrum-agent[bedrock]"
Tip: If
scrum-agent: command not foundafter install, runsource ~/.bashrc(or start a new SSH session) to pick up the updated PATH.
6. Run the setup wizard
scrum-agent --setup
- Select Bedrock as your LLM provider
- The AWS region is auto-detected from
~/.aws/config(e.g.,eu-west-2) โ press Enter to confirm - The wizard verifies Bedrock access using the IAM role attached by the Bedrock setup script โ no API key needed
- If OpenClaw is installed, the Bedrock model ID (e.g.,
global.anthropic.claude-sonnet-4-6) is auto-detected and saved to~/.scrum-agent/.env
7. Test headless mode
Verify scrum-agent works end-to-end before installing the skill:
scrum-agent --non-interactive --description "Build a todo app" --output json
You should see JSON output with features, stories, tasks, and sprints.
8. Install the OpenClaw skill
The scrum-planner skill lets OpenClaw conduct conversational scrum planning โ it asks intake questions (or skips them in quick mode), generates a temp SCRUM.md, invokes scrum-agent --non-interactive --output json, and presents results phase-by-phase with accept/edit/regenerate options.
Tip: After every
pipx install --force(e.g., updating to a new version), re-runscrum-agent --install-skillto update the skill files and refresh the configuration.
A single command handles the full setup:
scrum-agent --install-skill
This will:
- Copy the skill files to the skills registry at
/usr/lib/node_modules/openclaw/skills/scrum-planner/(may prompt for sudo) - Copy the skill files into the sandbox workspace at
~/.openclaw/workspace/skills/scrum-planner/ - Sync the Bedrock model ID and region from OpenClaw's config into
~/.scrum-agent/.env - Disable the Docker sandbox so
scrum-agentruns directly on the host - Restart the OpenClaw gateway to load the new skill
[1/5] Skill registry: /usr/lib/node_modules/openclaw/skills/scrum-planner
[2/5] Sandbox workspace: /home/ubuntu/.openclaw/workspace/skills/scrum-planner
[3/5] Bedrock config synced: model=global.anthropic.claude-sonnet-4-6, region=eu-west-2
[4/5] Sandbox disabled โ scrum-agent will run on host
[5/5] Restart OpenClaw gateway to load the skill? [Y/n]
Security note: This disables OpenClaw's Docker sandbox isolation, meaning tools execute directly on the host. This is safe for dedicated Lightsail instances running only the scrum-planner skill. For shared or multi-tenant setups, consider building a custom sandbox image with Python pre-installed instead.
To install to a custom skills directory:
scrum-agent --install-skill /path/to/openclaw/skills
9. Verify the skill is loaded
Open the OpenClaw dashboard and check the Skills page. You should see scrum-planner listed under Installed Skills with an "eligible" badge.
10. Test the skill
Start a new conversation in the OpenClaw dashboard or Slack. Try a detailed project:
"Plan an e-commerce marketplace โ React + Next.js frontend, Python FastAPI backend, PostgreSQL, Redis, Stripe for payments, Auth0 for auth. 4 engineers, 2-week sprints. Deployed on AWS ECS."
You should see:
- Smart extraction โ the skill detects project, tech stack, team size, integrations from your message
- Follow-up questions โ only asks what's missing (project type, definition of done, target sprints)
- Confirmation summary โ your answers + defaults, with option to override
- TUI recommendation โ for complex projects (3+ features, 5+ engineers), the skill suggests installing the full TUI via
brew install scrum-agentorpipx install scrum-agentfor interactive editing, sprint visualisation, and capacity planning. This is informational only โ generation always proceeds. - Background generation โ progress updates as each phase completes (~3-5 minutes)
- Phase-by-phase review โ features, stories, tasks, sprint plan โ each with accept/edit/regenerate
For a faster test, try quick mode:
"just plan it โ todo app, FastAPI + PostgreSQL, 1 engineer"
11. Troubleshooting
If the skill doesn't appear in the dashboard:
# Re-install and restart
scrum-agent --install-skill
If scrum-agent fails inside the skill:
# Test headless mode directly
scrum-agent --non-interactive --description "Build a todo app" --team-size 3 --sprint-length 2 --output json
# Check logs
ls -lt ~/.scrum-agent/logs/ | head -5
tail -50 ~/.scrum-agent/logs/*.log
# Check credentials
grep LLM_PROVIDER ~/.scrum-agent/.env
12. Connect Slack (optional)
Connect OpenClaw to your Slack workspace so users can trigger the scrum-planner skill via @mention.
Quick setup (interactive)
openclaw channels add
Follow the interactive prompts โ OpenClaw walks you through each step.
Manual setup
If you prefer to set up the Slack App manually:
- Go to Slack API โ Create App โ From manifest and paste the JSON below
- Socket Mode โ Enable it โ copy the App-Level Token (
xapp-...) - Install App to your workspace โ copy the Bot Token (
xoxb-...) - Event Subscriptions โ already configured via the manifest (socket mode)
- App Home โ Messages tab is enabled for DMs
Slack App manifest (click to expand)
{
"display_information": {
"name": "OpenClaw",
"description": "OpenClaw connector"
},
"features": {
"bot_user": {
"display_name": "OpenClaw",
"always_online": false
},
"app_home": {
"messages_tab_enabled": true,
"messages_tab_read_only_enabled": false
},
"slash_commands": [
{
"command": "/openclaw",
"description": "Send a message to OpenClaw",
"should_escape": false
}
]
},
"oauth_config": {
"scopes": {
"bot": [
"chat:write",
"channels:history",
"channels:read",
"groups:history",
"im:history",
"mpim:history",
"users:read",
"app_mentions:read",
"reactions:read",
"reactions:write",
"pins:read",
"pins:write",
"emoji:read",
"commands",
"files:read",
"files:write",
"canvases:read",
"canvases:write"
]
}
},
"settings": {
"socket_mode_enabled": true,
"event_subscriptions": {
"bot_events": [
"app_mention",
"message.channels",
"message.groups",
"message.im",
"message.mpim",
"reaction_added",
"reaction_removed",
"member_joined_channel",
"member_left_channel",
"channel_rename",
"pin_added",
"pin_removed"
]
}
}
}
Important: The manifest includes
canvases:readandcanvases:writescopes. These are required for OpenClaw to post the finalized sprint plan as a Slack Canvas. Without them, the plan will be posted as threaded messages instead (which also works, but Canvas gives a better reading experience for large plans).
Set the tokens on the Lightsail instance:
openclaw channels add
# When prompted, enter:
# App-Level Token: xapp-...
# Bot Token: xoxb-...
13. Use the skill in Slack
Once Slack is connected, @mention the bot in a channel to invite it. Send a casual first message to establish the thread โ the bot's first reply won't appear in a thread, so keep it simple:
You: @OpenClaw hey, how's it going?
Then start your planning session in a new message (this one will create a proper thread):
You: @OpenClaw Plan a mobile banking app โ React Native, Node.js, PostgreSQL, 6 engineers
The skill runs the same conversational intake as the dashboard, directly in a Slack thread:
- Smart extraction โ the bot detects answers from your initial message and shows what it found
- Follow-up questions โ numbered choices for project type, sprint length, and target sprints
- Adaptive probes โ "You said 6 engineers โ what are their roles?"
- Confirmation โ summary list with answer sources and defaults before generating
After confirmation, the bot runs scrum-agent in the background (~3-5 minutes), then presents results phase-by-phase with accept/edit/regenerate options.
14. Next steps
- Push to Jira โ Configure Jira credentials in
scrum-agent --setup, then the skill offers "Push to Jira" after plan finalization. - Customize the skill โ Edit
~/.openclaw/workspace/skills/scrum-planner/SKILL.mdto adjust question flow, add domain-specific defaults, or change the output format. - Review diagnostics โ Check
~/.scrum-agent/logs/for detailed run logs if anything looks off. - Secure with Teleport โ For production use, add Teleport for identity-aware access to the Lightsail instance and OpenClaw dashboard.
See skills/scrum-planner/README.md for full skill documentation, question-to-CLI mapping, and troubleshooting.
โจ๏ธ CLI Reference
scrum-agent [OPTIONS]
Interactive modes
| Flag | Description |
|---|---|
| (no flags) | Launch the full-screen TUI with mode selection |
--quick |
Quick intake โ 2 questions only (team size + tech stack), auto-fill rest |
--full-intake |
Full 30-question intake (standard mode) |
--mode project-planning |
Skip the startup menu, go directly to project planning |
--questionnaire PATH |
Import a filled-in questionnaire Markdown file |
--export-only |
Auto-accept all review checkpoints and exit after plan generation |
Non-interactive / headless
| Flag | Description |
|---|---|
--non-interactive |
Run headlessly (requires --description) |
--description TEXT |
Project description. Use @file.txt to read from a file |
--output {markdown,json,html} |
Output format (default: markdown). Only valid with --non-interactive or --export-only |
--team-size N |
Team size (maps to intake Q6) |
--sprint-length {1,2,3,4} |
Sprint length in weeks (maps to intake Q8) |
Session management
| Flag | Description |
|---|---|
--resume [ID] |
Resume a session. No argument = interactive picker. latest = most recent. Or pass a session ID |
--list-sessions |
List all saved sessions and exit |
--clear-sessions |
Interactively delete saved sessions |
Configuration
| Flag | Description |
|---|---|
--setup |
Re-run the first-time setup wizard |
--theme {dark,light} |
Terminal colour theme (default: dark) |
--no-bell |
Disable terminal bell after pipeline steps |
--dry-run |
Run TUI with mock data and fake delays โ no LLM calls |
--version |
Print version and exit |
Questionnaire export
| Flag | Description |
|---|---|
--export-questionnaire [PATH] |
Export a blank questionnaire template as Markdown |
๐ป In-session commands
These commands are available at the scrum> prompt during an interactive session:
| Command | Description |
|---|---|
help, ? |
Show available commands |
skip |
Skip the current intake question (uses a sensible default) |
defaults |
Apply defaults for all remaining questions in the current phase |
export |
Export current artifacts as HTML report + Markdown |
/compact |
Switch to compact output (hide secondary columns) |
/verbose |
Switch to verbose output (full detail, default) |
/resume |
Load a previously saved session |
/clear |
Delete saved sessions (pick one or all) |
Q6: answer |
Edit Q6 inline from the summary |
edit Q6 |
Re-answer Q6 interactively from the summary |
exit, quit |
Exit the agent |
Ctrl+C, Ctrl+D |
Exit the agent |
The status bar at the bottom of the terminal shows project name, current phase, and session info. It updates automatically as you progress through the pipeline.
๐ Examples
scrum-agent # interactive TUI (recommended)
scrum-agent --quick # quick intake (2 questions only)
scrum-agent --full-intake # full 30-question intake
scrum-agent --questionnaire intake.md # import pre-filled questionnaire
scrum-agent --export-only --quick # non-interactive, auto-accept all
scrum-agent --resume # resume last session (picker)
scrum-agent --resume latest # resume most recent session
scrum-agent --list-sessions # list all saved sessions
scrum-agent --clear-sessions # delete saved sessions
scrum-agent --non-interactive --description "Build X" # headless mode
scrum-agent --non-interactive --description @brief.txt --output json # JSON to stdout
scrum-agent --dry-run # TUI with mock data
๐ฏ Intake Modes
The agent supports four intake modes, each balancing thoroughness with speed.
Smart mode (default)
The recommended mode. The agent:
- Reads your initial project description and extracts answers to as many questions as possible
- When an analysis profile is selected, auto-fills team size, sprint length, velocity, tech stack, and integrations from real data
- Asks only the remaining essential questions (typically 2โ4)
- Uses answer provenance tracking to tag how each answer was obtained:
DIRECTโ you explicitly answeredEXTRACTEDโ parsed from your initial descriptionDEFAULTEDโ filled with a sensible defaultPROBEDโ filled via a targeted follow-up questionSCRUM_MDโ loaded from aSCRUM.mdfile in the current directory
- Applies conditional essentials โ questions that only appear when relevant (e.g., "What are their roles?" only asked after you give a team size)
- Runs cross-question validation โ catches contradictions (e.g., team size of 1 but multiple roles listed)
- Generates adaptive follow-ups using question-specific templates (not generic "tell me more")
- Accepts "any" / "no preference" for tech stack (Q11) and "none" for integrations (Q12) without triggering follow-up probes
Quick mode (--quick)
Two questions only: team size and tech stack. Everything else gets sensible defaults. Best for rapid prototyping or CI pipelines.
Standard mode (--full-intake)
Six questions are rendered as numbered selection menus instead of free text:
| Q | Topic | Options |
|---|---|---|
| Q2 | Project type | Greenfield / Existing codebase / Hybrid |
| Q8 | Sprint length | 1 week / 2 weeks (default) / 3 weeks / 4 weeks |
| Q16 | Code hosting | GitHub / Azure DevOps / GitLab / Bitbucket / Local |
| Q18 | Repo structure | Monorepo / Multi-repo / Microservices / Monolith |
| Q24 | Estimation style | Fibonacci points / T-shirt sizes / No estimates |
| Q26 | Output format | Jira / Markdown / Both |
Type defaults at any question to batch-accept all defaults for the current phase and skip ahead.
All 30 questions asked one-at-a-time in a conversational flow. Seven phases:
- Project Context (Q1โQ5) โ name, type, goals, users, deadlines
- Team & Capacity (Q6โQ10) โ engineers, roles, sprint length, velocity, target sprints
- Technical Context (Q11โQ14) โ tech stack, integrations, constraints, docs
- Codebase Context (Q15โQ20) โ repo, structure, CI/CD, tech debt
- Risks & Unknowns (Q21โQ23) โ risks, blockers, out-of-scope
- Preferences (Q24โQ26) โ estimation, DoD, output format
- Capacity Planning (Q27โQ30) โ sprint selection, bank holidays, unplanned absence %, onboarding
Offline import (--questionnaire)
- Export a blank template:
scrum-agent --export-questionnaire - Fill it in at your own pace in any editor
- Import:
scrum-agent --questionnaire intake.md - Review the summary and confirm before proceeding
The format is round-trippable (export โ edit โ import preserves answers exactly).
๐ SCRUM.md context
Drop a SCRUM.md file in your project directory with any relevant context โ project notes, design decisions, URLs, architecture diagrams. The agent reads it automatically and uses it to pre-fill answers and ground its output. Answers extracted from SCRUM.md are tagged with *(from SCRUM.md)* provenance markers in the intake summary. Your typed description always takes priority over SCRUM.md when both provide the same information.
๐ scrum-docs/ directory
Place PRDs, design docs, or reference material in a scrum-docs/ directory. Supported formats: .md, .txt, .rst, .pdf. PDF support requires the pymupdf optional dependency:
uv sync --extra pdf
Files are automatically ingested and fed into the project analyzer for grounded output.
๐ Pipeline
After intake confirmation, the agent runs a 5-stage pipeline with a human-in-the-loop checkpoint after each stage:
graph LR
A[๐ Project Intake] --> B[๐ Project Analyzer]
B --> C[๐งฉ Feature Generator]
C --> D[๐ Story Writer]
D --> E[โ
Task Decomposer]
E --> F[๐
Sprint Planner]
C -.-> |accept/edit/reject| C
D -.-> |accept/edit/reject| D
E -.-> |accept/edit/reject| E
F -.-> |accept/edit/reject| F
| Stage | What it does |
|---|---|
| Project Analyzer | Synthesizes all 30 intake answers into a structured ProjectAnalysis โ name, type, goals, tech stack, constraints, risks, out-of-scope |
| Feature Generator | Decomposes the analysis into high-level features with priorities (Critical/High/Medium/Low) |
| Story Writer | Breaks features into user stories with persona/goal/benefit format, Given/When/Then acceptance criteria, Fibonacci story points (1โ8, auto-split if >8), discipline tagging, and Definition of Done flags |
| Task Decomposer | Breaks stories into concrete tasks with labels (Code/Documentation/Infrastructure/Testing), test plans, and AI coding prompts. Auto-generates documentation sub-tasks for stories with Documentation in their DoD |
| Sprint Planner | Allocates stories to sprints respecting per-sprint net velocity (deducted for bank holidays, PTO, unplanned absence, onboarding, KTLO). Handles capacity overflow with 3 options: extend sprints, increase team, or keep as-is |
At each checkpoint, you can:
[1] Acceptโ proceed to the next stage[2] Editโ modify specific artifacts inline[3] Rejectโ re-generate with your feedback
Task enrichment
Every task generated by the Task Decomposer includes:
| Field | Description |
|---|---|
| Label | Auto-tagged: Code, Documentation, Infrastructure, or Testing โ colour-coded in all views |
| Test plan | Auto-generated for Code and Infrastructure tasks โ lists what to test (unit, integration, edge cases) |
| AI prompt | ARC-structured instruction for Cursor/Claude Code/Copilot, including project name, tech stack, and specific guidance |
Stories with "Documentation" marked as applicable in their DoD get a consolidated documentation sub-task referencing Confluence/README URLs from intake.
๐น Small project handling
For projects with โค2 sprints and โค3 goals, the analyzer sets skip_epics. Instead of multi-epic decomposition, a single sentinel epic is created using the project name as its title. The rest of the pipeline (stories, tasks, sprints) proceeds normally.
๐ Prompt quality rating
After intake, the analysis review screen shows a deterministic quality rating:
- Letter grade (A/B/C/D) with percentage score
- Breakdown: answered, extracted, defaulted, skipped, probed counts
- Actionable suggestions: "Add a SCRUM.md file", "Answer Q11 (tech stack) for better stories", etc.
- Low-confidence areas: defaulted essential questions flagged for downstream spike recommendations
๐ฌ Team Analysis Mode
Team Analysis connects to your Jira or Azure DevOps board and produces a comprehensive analysis of your team's real delivery patterns. The analysis becomes a reusable profile that calibrates future planning sessions.
What gets analyzed
| Category | Details |
|---|---|
| Velocity | Per-sprint velocity, team average, per-developer breakdown with delivery vs KTLO split |
| Sprint Cadence | Sprint length, seasonal patterns, velocity trends over time |
| Story Structure | Average stories per epic, story point distribution, typical story size |
| Task Patterns | Average tasks per story, type distribution (Development %, Testing %, Deploy %) |
| Acceptance Criteria | AC count per story, writing style (Given/When/Then vs flexible), coverage patterns |
| Definition of Done | Team's actual DoD items extracted from completed stories |
| Naming Conventions | Epic naming style (quarter-scoped, feature-based), ticket organization patterns |
| Team Members | Per-contributor analysis with velocity, discipline (backend/frontend/etc.), stories completed, sprint participation |
| Estimation Bias | Whether the team tends to over- or under-estimate |
| Board Workflow | Column workflow analysis showing how work items flow through states |
How it works
- Select a board โ pick your Jira project or Azure DevOps team board
- Choose sprint scope โ analyze last 3, 5, 10, or all sprints
- Review the analysis โ 6-page walkthrough: Instructions, Sample Epic, Sample Stories, Sample Tasks, Sample Sprint Plan, Analysis Report
- Save as a profile โ stored in SQLite, reusable across planning sessions
Per-developer breakdown
Each team member gets individual analysis:
| Metric | Description |
|---|---|
| Velocity | Individual points per sprint average |
| Discipline | Primary skill area (backend, frontend, fullstack, infrastructure, etc.) โ LLM-classified from actual work |
| Stories completed | Total stories delivered in the analysis window |
| Sprint participation | Number of sprints active |
| Delivery vs KTLO | Split between feature delivery and keeping-the-lights-on work |
Analysis exports
Analysis results can be exported as HTML or Markdown reports. Both formats include all analysis sections plus a Team Members table. When an analysis profile is used during planning, the export includes a provenance note linking back to the analysis.
๐ฏ Analysis-Calibrated Planning
When you start a planning session, you can select a saved analysis profile. This calibrates the entire planning pipeline to match your team's real patterns.
Intake auto-fill
The analysis profile auto-fills intake questions so you don't have to answer them manually:
| Question | Source |
|---|---|
| Q6 โ Team size | Replaced with a team member multi-select picker showing each developer's velocity and discipline |
| Q7 โ Team roles | Auto-filled from selected members' disciplines |
| Q8 โ Sprint length | Derived from sprint date ranges in the analysis |
| Q9 โ Velocity | Calculated from selected team members' individual per-sprint averages |
| Q11 โ Tech stack | Auto-filled from the analysis profile's tech stack (shown as suggestion, user can override) |
| Q12 โ Integrations | Auto-filled from the analysis profile's integrations list |
| Q27 โ Sprint selection | Falls back to analysis sprint data when live tracker is unavailable |
Team member multi-select
When an analysis profile has contributor data, Q6 (team size) becomes a multi-select picker instead of a free-text number:
- Each member shows their velocity (pts/sprint) and discipline
- Select specific developers for this project with Space, confirm with Enter
- Velocity is calculated from the selected members' individual averages
- Team roles (Q7) auto-populated from selected members' disciplines
Calibration banners
Each pipeline stage shows a calibration banner explaining what analysis data influenced the output:
| Stage | Banner shows |
|---|---|
| Project Analyzer | Selected profile name and source board |
| Story Writer | Target stories/epic, AC count, points scale, DoD source |
| Task Decomposer | Target tasks/story, type distribution (Dev/Test/Deploy %) |
| Sprint Planner | Selected team members with individual velocities, total team velocity |
What gets calibrated
| Artifact | Calibration |
|---|---|
| Epics | LLM-reformatted to match team's naming convention (quarter-scoped, feature-based, etc.) and template sections |
| Stories per feature | Matches team's historical average stories per epic |
| AC count | Enforces team's median acceptance criteria count per story |
| AC format | Respects team's writing style (Given/When/Then or flexible) |
| Definition of Done | Uses team's actual DoD items instead of generic checklist |
| Story points | Includes confidence scoring and references to similar stories from the analysis |
| Tasks per story | Matches team's average task count |
| Task type distribution | Matches team's Development/Testing/Deploy percentages |
Epic review
Before the feature generator runs, an epic review page lets you review and edit the project epic. When an analysis profile is active, the epic is LLM-reformatted to match your team's style:
- Quarter-scoped naming when the team uses that convention (e.g., "Q2 2026 โ Customer Portal")
- Template sections matching the team's epic structure
- Sync to Jira or Azure DevOps directly from the review page
๐ค Export Formats
Markdown (default)
Writes scrum-plan.md with all artifacts structured as headings, tables, and lists.
scrum-agent --export-only --quick
HTML
Self-contained single-file HTML report with embedded CSS, collapsible sections, and a table of contents. No external dependencies.
scrum-agent --non-interactive --description "Build a todo app" --output html
JSON
Clean, pipeable JSON schema for CI/CD integration. No internal state fields โ just the plan artifacts:
{
"version": "1.0.0",
"project": { "name", "description", "type", "goals", "tech_stack", "team_size", "sprint_length_weeks" },
"features": [...],
"stories": [...],
"tasks": [...],
"sprints": [...]
}
scrum-agent --non-interactive --description "Build a todo app" --output json | jq '.stories | length'
When using --output json, Rich console output goes to stderr so stdout is clean JSON.
Jira
Batch sync with idempotent creation, available from TUI pipeline review at any stage or from the project list:
| Artifact | Jira Mapping |
|---|---|
| Features | Jira Labels (not separate issues) |
| Epic | 1 project-level Epic |
| Stories | Issues linked to Epic, with story points, priority, acceptance criteria, feature labels |
| Tasks | Sub-tasks linked to parent Stories, with task labels |
| Sprints | Created with name, goal, dates; stories assigned to sprints |
Key behaviors:
- Idempotency โ checks
jira_*_keysstate before creating; skips already-synced artifacts - Cascade creation โ Task stage auto-creates Stories if not yet synced; Sprint stage auto-creates Stories if not yet synced
- Project type detection โ discovers issue types dynamically (handles team-managed vs. classic Jira projects)
- Confirmation screen โ shows what will be created/skipped before any write operation
- Progress screen โ animated per-item status during creation
- Jira button โ disabled/dimmed in TUI when
JIRA_API_TOKENis not configured
๐ท Azure DevOps Boards
Azure DevOps Boards
Batch sync with idempotent creation, available from TUI pipeline review at any stage:
| Artifact | Azure DevOps Mapping |
|---|---|
| Features | Tags (System.Tags, semicolon-separated) |
| Epic | 1 project-level Epic work item |
| Stories | User Story work items linked to Epic via System.LinkTypes.Hierarchy-Reverse, with story points, priority (1โ4), HTML descriptions |
| Tasks | Task work items linked to parent Stories |
| Sprints | Iterations (classification nodes created via REST API); stories assigned via System.IterationPath |
Key behaviors:
- Idempotency โ checks
azdevops_*_keysstate before creating; skips already-synced artifacts - Cascade creation โ Task stage auto-creates Stories if not yet synced; Iteration stage auto-creates Stories if not yet synced
- Team area path โ sets
System.AreaPathto{project}\{team}so work items appear on the correct team board - Description updates on re-sync โ already-created items get descriptions updated (DoD, rationale, AI prompts added later)
- Sprint naming convention โ detects board's existing iteration naming pattern and renames LLM-generated names to match
- Iteration dates โ sets start/finish dates on iterations based on sprint start date and sprint length
- Velocity auto-detection โ fetches velocity from past iterations during intake (falls back from Jira to AzDO)
- HTML descriptions โ acceptance criteria, Definition of Done, and points rationale rendered as
<h3>,<strong>,<ul><li>(not Jira wiki markup) - Priority mapping โ
critical โ 1,high โ 2,medium โ 3,low โ 4 - Confirmation screen โ shows what will be created/skipped before any write operation
- Progress screen โ animated per-item status during creation
- Azure DevOps button โ disabled/dimmed in TUI when credentials are not configured
PAT permissions required
Create a Personal Access Token with the following scopes:
| Scope | Access | Used for |
|---|---|---|
| Work Items | Read & Write | Reading board/backlog, creating Epics/Stories/Tasks |
| Project and Team | Read | Listing iterations, team settings, velocity data |
| Code | Read | Reading repo file tree and file contents (optional, for repo context tools) |
AZURE_DEVOPS_TOKEN=your-pat-token
AZURE_DEVOPS_ORG_URL=https://dev.azure.com/your-org
AZURE_DEVOPS_PROJECT=MyProject
AZURE_DEVOPS_TEAM=MyProject Team # optional โ defaults to "{project} Team"
๐พ Session Management
Sessions are persisted to SQLite at ~/.scrum-agent/data/sessions.db. Every terminal session gets a unique ID (new-<8hex>-<YYYY-MM-DD>) and a human-readable display name derived from the project slug (todoapp-2026-03-19). Team analysis profiles are stored in the same database.
Directory structure
~/.scrum-agent/
data/
sessions.db # SQLite โ planning sessions, analysis profiles, token usage
exports/
analysis/ # HTML + Markdown analysis reports
planning/ # HTML + Markdown planning exports
logs/
tui/ # TUI application logs (rotates at 2 MB)
analysis/ # Per-analysis session logs
planning/ # Per-planning session logs
.env # API keys and credentials
Resume a session
scrum-agent --resume # interactive picker
scrum-agent --resume latest # most recent session
scrum-agent --resume <id> # specific session ID
Resumed sessions pick up exactly where you left off โ mid-questionnaire, mid-review, or between pipeline stages.
List sessions
scrum-agent --list-sessions
Shows a table with project name, date, last completed step, and session ID.
Delete sessions
scrum-agent --clear-sessions
Interactive picker to delete one session or clear all.
Auto-pruning
Sessions older than 30 days are auto-pruned at startup. Configure via SESSION_PRUNE_DAYS in .env (set to 0 to disable).
๐ Usage Page
The Usage page tracks token consumption across all sessions, accessible from the main menu.
| Metric | Description |
|---|---|
| Session tokens | Input and output tokens for the current session |
| Lifetime tokens | Cumulative total across all sessions, persisted in SQLite |
| Per-provider breakdown | Tokens split by LLM provider (Anthropic, OpenAI, Google, Bedrock) |
| Session history | Recent sessions with their token counts and timestamps |
| Cost estimate | Approximate cost based on provider pricing |
Token usage is tracked automatically via track_usage() on every LLM call and persisted to the token_usage table in SQLite. The page uses a dedicated amber colour theme to distinguish it from Planning (blue) and Analysis (green).
โ๏ธ Settings Page
The Settings page provides a read-only view of your current configuration and a shortcut to the setup wizard.
| Section | What it shows |
|---|---|
| LLM Provider | Active provider and model (e.g., Anthropic / claude-sonnet-4) |
| API Keys | Configured keys with values masked (e.g., sk-ant-***...***abc) |
| Issue Tracking | Jira and/or Azure DevOps connection status, org URL, project |
| Version Control | GitHub token status |
| Paths | Database location, export directories, log directories |
From the Settings page you can launch the setup wizard to reconfigure providers, API keys, and integrations. The page uses a grey colour theme.
๐ง Tools
The agent has access to 30 tools, organized by integration:
๐ GitHub (4 tools)
| Tool | Description |
|---|---|
github_read_repo |
Fetch repo metadata, languages, and file tree |
github_read_file |
Read a single file from a GitHub repo |
github_list_issues |
List open issues with labels |
github_read_readme |
Extract README content |
๐ท Azure DevOps (9 tools)
| Tool | Description | Risk |
|---|---|---|
azdevops_read_repo |
Fetch repo metadata and file tree | Low |
azdevops_read_file |
Read a single file from a repo | Low |
azdevops_list_work_items |
List work items (backlog, active, etc.) | Low |
azdevops_read_board |
Board info, active iteration, average velocity | Low |
azdevops_fetch_velocity |
Team velocity, team size, per-developer velocity | Low |
azdevops_fetch_active_iteration |
Current sprint name, number, start date | Low |
azdevops_create_epic |
Create an Epic work item | High (requires confirmation) |
azdevops_create_story |
Create a User Story linked to Epic, with story points and priority | High (requires confirmation) |
azdevops_create_iteration |
Create an iteration (sprint) with optional start/finish dates | High (requires confirmation) |
๐ซ Jira (6 tools)
| Tool | Description | Risk |
|---|---|---|
jira_read_board |
Fetch board metadata and configuration | Low |
jira_fetch_velocity |
Get team velocity history (rolling average of last 3โ5 sprints, with JQL fallback for team-managed boards) | Low |
jira_fetch_active_sprint |
Get current sprint info for sprint selection (Q27) | Low |
jira_create_epic |
Create an epic | High (requires confirmation) |
jira_create_story |
Create a story with ACs and story points | High (requires confirmation) |
jira_create_sprint |
Create and configure a sprint | High (requires confirmation) |
๐ Confluence (5 tools)
| Tool | Description | Risk |
|---|---|---|
confluence_search_docs |
Search documentation by keyword | Low |
confluence_read_page |
Read a wiki page by ID | Low |
confluence_read_space |
Read space metadata and page list | Low |
confluence_create_page |
Create a new page | High (requires confirmation) |
confluence_update_page |
Update an existing page | High (requires confirmation) |
๐ป Local codebase (3 tools)
| Tool | Description |
|---|---|
read_codebase |
Scan entire local repo โ language detection, file tree (budget-limited, auto-collapses large dirs), skips binaries and build artifacts |
read_local_file |
Read a specific file from disk (targeted retrieval when the LLM needs to inspect particular files) |
load_project_context |
High-level codebase overview including scrum-docs/ PRD/design doc ingestion |
๐ Calendar (1 tool)
| Tool | Description |
|---|---|
detect_bank_holidays |
Detect public holidays in the planning window (auto-fills Q28) |
๐ค LLM-powered (2 tools)
| Tool | Description |
|---|---|
estimate_complexity |
Analyze code/requirements for story point estimation |
generate_acceptance_criteria |
Generate Given/When/Then acceptance criteria |
Tool risk levels
| Risk | Guardrail |
|---|---|
| Low (read-only) | Auto-execute |
| Medium (LLM-powered) | Log and display to user |
| High (write operations) | Requires explicit user confirmation |
๐๏ธ Architecture
Four Layers
| Layer | Implementation |
|---|---|
| Interface | Full-screen TUI with animated splash, mode selection, session editor, pipeline progress, streaming output, and dark/light themes |
| Prompt Construction | Scrum Master persona, ARC-structured prompts per node, few-shot examples, adaptive question templates |
| Model | Anthropic Claude (primary), OpenAI GPT, Google Gemini โ swappable via LLM_PROVIDER env var |
| Data & Storage | SQLite session store (~/.scrum-agent/data/sessions.db) with team analysis profiles, token usage tracking, optional Jira/Confluence/Azure DevOps integration |
Three Design Principles
- Robust Infrastructure โ agent frameworks (LangChain, LangGraph), graceful rate-limit retry with exponential backoff, crash-safe session persistence
- Modularity โ decoupled CLI/TUI/REPL/agent/tools/prompts, one concern per module, UI system with 4 subsystems
- Continuous Evaluation โ golden dataset evaluators, contract tests with VCR cassettes, token budget monitoring
Agent Graph (LangGraph)
Auto-generated via make graph:
graph TD
START([START]) --> intake[project_intake]
intake --> |questionnaire loop| intake
intake --> analyzer[project_analyzer]
analyzer --> features[feature_generator]
features --> |human review| features
features --> stories[story_writer]
stories --> |human review| stories
stories --> tasks[task_decomposer]
tasks --> |human review| tasks
tasks --> sprints[sprint_planner]
sprints --> |human review| sprints
sprints --> sync[jira_sync]
sync --> END([END])
Node Descriptions
| Node | Responsibility |
|---|---|
| Project Intake | Runs the discovery questionnaire (smart/standard/quick mode) to gather all project context |
| Project Analyzer | Synthesizes questionnaire answers into a structured ProjectAnalysis with name, type, goals, tech stack, constraints, and risks |
| Feature Generator | Decomposes the analysis into high-level features with priority levels. For small projects (โค2 sprints, โค3 goals), creates a single sentinel epic instead |
| Story Writer | Breaks features into user stories with persona/goal/benefit, short titles, Given/When/Then acceptance criteria, Fibonacci story points (auto-split >8), discipline tagging, DoD flags, and points rationale |
| Task Decomposer | Breaks stories into concrete tasks with auto-tagged labels (Code/Documentation/Infrastructure/Testing), test plans for code tasks, AI coding prompts, and dedicated documentation sub-tasks |
| Sprint Planner | Allocates stories to sprints using per-sprint net velocity (bank holidays, PTO, unplanned %, onboarding, KTLO deducted). Handles capacity overflow with 3 options. Highlights impacted sprints |
| Jira Sync | Pushes the finalized plan to Jira with idempotent batch creation: Features โ Labels, Stories โ linked to Epic, Tasks โ Sub-tasks, Sprints โ created and assigned |
๐ The ReAct Loop
The foundational reasoning pattern:
Thought โ Action โ Observation โ (repeat until done)
- Thought โ reason about the current state and what to do next
- Action โ call a tool or take a step
- Observation โ see the result, decide whether to continue or answer
๐ฅ๏ธ TUI System
The ui/ package provides a full-screen terminal UI with four subsystems:
| Subsystem | Purpose |
|---|---|
mode_select/ |
Full-screen mode selection with ASCII art titles, project cards with pipeline progress indicators, project list with half-card peek stubs at viewport edges. Includes Analysis, Planning, Usage, and Settings pages |
provider_select/ |
LLM and tool provider selection (block-character logos for Claude/GPT/Gemini), issue tracking setup, verification flow |
session/ |
Main session UI โ description input, intake questions, summary review, pipeline stages with artifact editing, epic review, calibration banners, Jira/Azure DevOps export, chat. Dry-run mode with mock data |
shared/ |
Animations (typewriter, pulse), ASCII font rendering, reusable components (Theme, buttons, scrollbar, progress dots, viewport), mouse scroll handling |
Visual features:
- Rounded borders with consistent padding and arrow-key navigation
- Sticky group headers โ epic titles pin at top when scrolling, with decryption-style morph animation between sections
- Scrollbar โ vertical
โtrack withโthumb for pipeline stages and summary review - Capacity bars โ per-sprint with reduced velocity for bank-holiday/PTO-impacted sprints (amber border + annotations)
- Project cards โ one-shot white pulse animation on Enter, pipeline progress badges
The repl/ package is the legacy REPL kept for backwards compatibility and CLI-flag-driven flows (--quick, --full-intake, --questionnaire, --mode).
๐ฆ State Schema
- ScrumState is a
TypedDict(LangGraph convention for graph state) messagesusesAnnotated[list[BaseMessage], add_messages]for append semantics- Frozen dataclasses for artifacts โ
Feature,UserStory,Task,Sprint,ProjectAnalysis(immutable once created, serializable viaasdict()) - Mutable dataclass for
QuestionnaireStateโ updated incrementally by the intake node - Artifact lists use
Annotated[list[...], operator.add]so nodes can return items that get appended
๐ท๏ธ Agent Classification
| Property | Value |
|---|---|
| Agency Level | Level 3โ4 (self-looping + multi-agent coordination) |
| Reasoning Pattern | ReAct (Thought โ Action โ Observation โ repeat) |
| Interface | Terminal CLI (full-screen TUI + legacy REPL) |
| Domain | Scrum project management |
๐ Project Intake Questionnaire โ 30 questions across 7 phases
๐ Project Intake Questionnaire
Before generating any Scrum artifacts, the agent runs a structured discovery phase โ asking the user questions one at a time in a conversational flow. This is the "flipped prompt" technique: the agent gathers what it needs before it acts.
Questionnaire Flow
The agent asks these questions sequentially. Each question is asked individually, the user responds, and the agent moves to the next. The agent adapts follow-up questions based on previous answers.
Phase 1 โ Project Context
| # | Question | Why the Agent Needs This |
|---|---|---|
| 1 | What is the project? Describe it in a few sentences, or point me to a repo/doc. | Establishes the core scope and domain |
| 2 | Is this a greenfield project or are you building on an existing codebase? | Determines whether the agent should scan existing code, and whether there's legacy complexity |
| 3 | What problem does this project solve? Who are the end users? | Grounds epic/story generation in real user needs rather than abstract features |
| 4 | What does "done" look like? What's the end-state you're targeting? | Defines the finish line โ prevents scope creep and gives the agent a clear goal to decompose toward |
| 5 | Are there any hard deadlines or milestones? | Constrains the sprint plan; the agent needs to know if time is fixed |
Phase 2 โ Team & Capacity
| # | Question | Why the Agent Needs This |
|---|---|---|
| 6 | How many engineers are working on this? | Directly affects sprint capacity and parallelism of work |
| 7 | What are the roles on the team? (e.g., 2 backend, 1 frontend, 1 fullstack) | Lets the agent tag stories by discipline and balance sprint workload across skillsets |
| 8 | How long are your sprints? (e.g., 1 week, 2 weeks) | Required for sprint planning โ determines how many points fit per sprint |
| 9 | Do you have a known velocity from previous sprints? If yes, what is it? | If available, the agent uses real velocity; otherwise it defaults to 5 points per engineer per sprint |
| 10 | How many sprints are you targeting to complete this project? | Bounds the total effort and forces prioritization if scope exceeds capacity |
Phase 3 โ Technical Context
| # | Question | Why the Agent Needs This |
|---|---|---|
| 11 | What is the tech stack? (languages, frameworks, databases, infra) | Stories and tasks need to be written in terms the team actually works with |
| 12 | Are there any existing APIs, services, or third-party integrations involved? | Identifies external dependencies that create stories of their own (auth, payments, etc.) |
| 13 | Are there any architectural constraints or decisions already made? (e.g., must use microservices, must deploy to AWS) | Prevents the agent from suggesting work that contradicts fixed decisions |
| 14 | Is there any existing documentation, PRDs, or design docs I should reference? | The agent can ingest these for grounded story generation |
Phase 3a โ Codebase Context
| # | Question | Why the Agent Needs This |
|---|---|---|
| 15 | Does the project have an existing codebase, or is this a new build? | Determines whether the agent needs to account for existing code, migrations, and legacy constraints |
| 16 | Where is the code hosted? (GitHub, Azure DevOps, GitLab, Bitbucket, local only) | Tells the agent which source control tool to use for repo scanning |
| 17 | Can you share the repo URL(s)? (the agent can connect and scan the repo for context) | Enables the agent to read repo structure, key files, and README to ground its output |
| 18 | How is the repo structured? (monorepo, multi-repo, microservices, monolith) | Affects how the agent decomposes work |
| 19 | Is there an existing CI/CD pipeline or deployment setup? | Identifies whether DevOps stories are needed |
| 20 | Is there any known technical debt? (legacy code, outdated dependencies, areas needing refactoring) | Surfaces refactoring stories and constraints |
Phase 4 โ Risks & Unknowns
| # | Question | Why the Agent Needs This |
|---|---|---|
| 21 | Are there any areas of the project you're uncertain or worried about? | The agent flags these as spike stories or high-risk items |
| 22 | Are there any known blockers or dependencies on external teams/systems? | Creates blocked/dependency stories and affects sprint ordering |
| 23 | Is there anything that's explicitly out of scope? | Prevents generating stories for work the team won't do |
Phase 5 โ Preferences & Process
| # | Question | Why the Agent Needs This |
|---|---|---|
| 24 | How do you want stories estimated? (Fibonacci story points, T-shirt sizes, or no estimates) | Configures the output format |
| 25 | Do you have a Definition of Done the team follows? | Incorporated into acceptance criteria validation |
| 26 | Do you want the output pushed to Jira, exported as Markdown, or both? | Determines the final step of the pipeline |
Phase 6 โ Capacity Planning
| # | Question | Why the Agent Needs This |
|---|---|---|
| 27 | Which sprint are you planning for? | Anchors the planning window. Auto-detected from Jira active sprint if configured; otherwise presented as a choice question |
| 28 | How many bank holidays fall within your planning window? | Deducts from gross capacity. Auto-detected via detect_bank_holidays tool (100+ countries, 3-layer locale fallback: Jira timezone โ shell locale โ GB default). User can override |
| 29 | What percentage of capacity is typically lost to unplanned absences? (default: 10%) | Real feature capacity is ~24% of gross after all deductions (based on analysis of capacity planning templates) |
| 30 | Are any engineers currently onboarding or ramping up? | Reduces individual capacity during ramp-up sprints |
After Q28 (bank holidays), the agent asks about planned leave (PTO):
- Per-person entry with name, start date, and end date (DD/MM/YYYY format with validation)
- Dates outside the planning window are rejected
- Working-day calculation excludes weekends
- Summary shown after each entry with option to add more
- Quick mode skips PTO (defaults to 0)
Adaptive Behavior
The questionnaire is not rigid โ the agent adapts:
- Skips questions the user already answered. If your initial description included "we're a team of 4 using React and Node", the agent won't re-ask team size or tech stack.
- Extracts answers from descriptions. Keyword matching detects project type (greenfield/existing), integrations (Stripe, Auth0), and infrastructure constraints (Kubernetes, microservices).
- Uses conditional essentials. Q7 (team roles) only appears if Q6 (team size) was answered. Q12 (integrations) only if Q11 (tech stack) was answered.
- Asks targeted follow-ups. Instead of generic "tell me more", the agent uses question-specific probing templates.
- Validates across questions. Catches contradictions โ e.g., team size of 1 but multiple roles listed.
- Adapts question text. "You said 5 engineers โ what are their roles?" instead of static text.
- Allows "skip" and "I don't know". Proceeds with reasonable defaults and flags assumptions.
- Summarizes before proceeding. After all questions, presents a structured summary for confirmation.
Intake Summary Output
After the questionnaire, the agent produces a structured summary:
Here's what I understand about your project:
Project: E-commerce platform redesign
Type: Existing codebase (monolith โ microservices migration)
End Users: Online shoppers, internal warehouse staff
Target State: Fully migrated to microservices with new checkout flow
Team: 5 engineers (2 backend, 2 frontend, 1 devops)
Sprint Length: 2 weeks
Velocity: 25 pts/sprint (default: 5 ร 5 engineers, no historical data)
Target Sprints: 6 sprints (12 weeks)
Tech Stack: Python/FastAPI, React, PostgreSQL, AWS ECS
Integrations: Stripe (payments), SendGrid (email), existing REST API
Constraints: Must maintain backward compat with mobile app v2.x
Risks:
- Payment flow migration (high complexity, Stripe webhook changes)
- No clear spec for warehouse dashboard requirements
Out of Scope: Mobile app redesign, analytics pipeline
Output: Jira + Markdown export
Does this look right? [Confirm / Edit]
Only after the user confirms does the agent proceed to feature generation.
๐ Scrum Standards โ Issue hierarchy, story format, acceptance criteria, DoD
๐ Scrum Standards
These are the team's codified practices. The agent enforces all of these when generating and validating Scrum artifacts.
1. Issue Hierarchy
| Level | What It Represents | Scope | Example |
|---|---|---|---|
| Epic | A large body of work representing the big picture. Can span months or multiple sprints. | The Why of the project | "Customer Self-Service Portal" |
| Feature | A significant piece of functionality that contributes to the big picture. Can span multiple sprints. | The What we're building | "Subscription Management" |
| User Story | A smaller, well-defined unit of work. Must be completable within a single sprint. | The How of the project | "As a customer, I want to upgrade my plan" |
| Sub-Task | A breakdown of a story into manageable, assignable parts. | Implementation detail | "Add upgrade endpoint to billing API" |
| Spike | A time-boxed research task to reduce uncertainty before delivery work begins. | Learning & discovery | "Investigate Stripe webhook reliability" |
2. User Stories
Format
User stories follow this structure:
"As a [persona], I want to [goal], so that [benefit]."
Breaking It Down
| Part | What It Means | Guidance |
|---|---|---|
| As a [persona] | Who are we building this for? Not a job title โ a real persona the team understands with empathy. | The team should have a shared understanding of this person โ how they work, think, and feel. |
| I want to [goal] | What is the user actually trying to achieve? Describes intent, not features. | Must be implementation-free. If you're describing UI elements instead of the user's goal, you're missing the point. |
| So that [benefit] | How does this fit into their bigger picture? What problem does it solve? | Ties the story back to real value and helps define when the story is truly done. |
Examples
- As Max, I want to invite my friends, so we can enjoy this service together.
- As Sascha, I want to organise my work, so I can feel more in control.
- As a manager, I want to understand my colleagues' progress, so I can better report our successes and failures.
Story Point Rules
| Rule | Detail |
|---|---|
| Scale | Fibonacci: 1, 2, 3, 5, 8 |
| Maximum | 8 points per story. If estimated above 8, the story must be split. |
| What points measure | Relative complexity and effort, not hours. |
| Default velocity | When no historical data exists: 5 points per engineer per sprint. |
| Sprint capacity | Stories are allocated to sprints without exceeding capacity (engineers x 5 or known velocity). |
Velocity Calculation Examples:
| Scenario | Calculation | Sprint Capacity |
|---|---|---|
| 3 engineers, no known velocity | 3 ร 5 | 15 pts/sprint |
| 5 engineers, no known velocity | 5 ร 5 | 25 pts/sprint |
| 4 engineers, known velocity of 30 | Use 30 directly | 30 pts/sprint |
Auto-Split Example:
If the agent estimates "Build the full payment integration" at 13 points:
This story exceeds the 8-point maximum. Splitting:
Original: Build the full payment integration (13 pts)
Split into:
US-010: Set up Stripe SDK and payment intent flow (5 pts)
US-011: Build webhook handler for payment events (5 pts)
US-012: Add payment error handling and retry logic (3 pts)
Total: 13 pts across 3 stories (all โค 8)
[Accept split / Edit / Reject]?
Discipline Tagging
Every story is tagged with the primary discipline needed:
| Discipline | Description |
|---|---|
frontend |
UI/UX implementation |
backend |
API, business logic, data |
fullstack |
Spans both (default fallback) |
infrastructure |
DevOps, CI/CD, deployment |
design |
UX research, visual design |
testing |
QA, test automation |
Story Checklist
Before a story is considered ready for sprint planning, it must have:
- Clear persona identified
- Goal is implementation-free and user-focused
- Benefit ties to real business or user value
- Acceptance criteria written (see Acceptance Criteria)
- Story points estimated (1โ8 range)
- Dependencies identified and linked
- Fits within a single sprint
3. Acceptance Criteria
What They Are
Acceptance criteria are clear, concise, and testable statements that define the conditions a user story must meet to be accepted by stakeholders and considered "Done." They are the source of truth for developers, testers, and product stakeholders.
Purpose
- Clarify the scope of a user story
- Ensure shared understanding between product, platform, and stakeholders
- Provide a basis for test cases
- Define the boundaries of success
Acceptance criteria describe what should happen, not how it's implemented. They avoid technical specifics and focus on the desired outcome.
Key Characteristics
| Characteristic | Description |
|---|---|
| Clear | Easy to understand, no ambiguity |
| Concise | No unnecessary details or fluff |
| Testable | Verifiable through manual or automated testing |
| Outcome-Oriented | Focused on the end result, not the implementation approach |
| Consistent | Written in a standardised format (Given/When/Then) |
Format: Given / When / Then
All acceptance criteria use the Given / When / Then format:
Given [precondition]
When [action]
Then [expected outcome]
Examples
Reset Password
User Story: As a user, I want to reset my password so that I can regain access to my account.
Given I am on the password reset page
When I enter my registered email and click "Send Reset Link"
Then I should see a confirmation message saying "Reset link sent to your email"
Form Validation
User Story: As a user, I want to be informed when I submit an invalid phone number.
Given I enter an invalid phone number
When I try to submit the form
Then I should see an error message saying "Please enter a valid phone number"
Negative / Edge Case
User Story: As a user, I want to be prevented from registering with an already-used email.
Given I am on the registration page
When I enter an email that is already registered and click "Sign Up"
Then I should see an error message saying "An account with this email already exists"
And no duplicate account should be created
Coverage Requirements
Every story must have acceptance criteria covering:
| Scenario Type | What It Covers | Required? |
|---|---|---|
| Happy path | The expected, successful flow | Yes |
| Negative path | Invalid input, denied access, failures | Yes |
| Edge cases | Boundary conditions, empty states, max limits | Where applicable |
| Error states | What the user sees when something goes wrong | Yes |
Common Pitfalls
| Pitfall | Why It's a Problem |
|---|---|
| Writing implementation details (e.g., "Use React component X") | Criteria should be tech-agnostic and outcome-focused |
| Vague language (e.g., "It should work properly") | Not testable โ what does "properly" mean? |
| Skipping negative scenarios and edge cases | Leaves gaps that surface as bugs in production |
| Using criteria as a task checklist | Criteria define outcomes, not implementation steps |
| Only covering the happy path | Real users hit errors, edge cases, and unexpected states |
4. Definition of Done โ User Story
A story is not "Done" until every applicable item is satisfied. The agent evaluates which DoD items apply to each story and marks the rest as N/A.
Acceptance Criteria Fully Met
- Acceptance criteria are written before work begins
- Reviewed and approved by the team during backlog refinement
- All criteria are fully met and tested
- Given/When/Then format used consistently
Documentation
- Relevant documentation created or updated
- Added to the appropriate shared space / folder
- Outdated documentation updated if affected by the change
- Documentation completed within the sprint (unless explicitly agreed otherwise)
Testing
- Testing conducted across all environments where changes are deployed
- Test cases clearly identified and documented
- End-to-end (E2E) tests included for business-critical services where applicable
- Testing deemed sufficient before marking as Done
Code Merged
- Branch merged into
main/mastervia Pull Request - PR reviewed by at least two engineers
- All review comments and questions fully addressed before merge
Released via SDLC
- Release conducted through the standard SDLC process (e.g., Jenkins pipeline)
- Release channel notified with relevant details (e.g.,
#developer-releaseson Slack) - Story not marked Done until successfully released
Stakeholder Sign-Off (if required)
- Sign-off received from relevant stakeholders for features impacting external teams
- Approval logged (Slack message, Jira comment, or verbal approval noted in ticket)
Knowledge Sharing
- If the change introduces new functionality, architectural decisions, or process changes โ a knowledge-sharing activity is conducted
- This can be a Slack update, team demo, short write-up, or Confluence page
- Ensures team-wide understanding and reduces knowledge silos
5. Definition of Done โ Spike
Spikes are time-boxed research tasks used to reduce uncertainty, explore solutions, or gain clarity before delivery work begins.
When to Use a Spike
- Investigating an unknown technical or product area
- Evaluating possible solutions or approaches
- Identifying potential blockers or risks
- Prototyping or validating ideas before full implementation
Checklist
| Criteria | Description |
|---|---|
| Objective clearly stated | The goal or research question is documented in the ticket or a linked page |
| Time-box respected | Completed within the agreed timeframe (typically 1โ3 days or a single sprint). Extensions discussed with the team. |
| Findings documented | All research outcomes, technical analysis, and code snippets are documented in a shared location |
| Recommendation made | A clear path forward is proposed โ including implementation guidance, trade-offs, or alternatives |
| Next steps outlined | New stories, tickets, or action items are created and linked for follow-up work |
| Shared with team | Results communicated via stand-up, short demo, Slack summary, or write-up |
| Resources linked | All relevant links (API docs, diagrams, repos, articles) attached for future reference |
The goal of a spike is learning and knowledge sharing โ not production-ready code.
6. Sprint Ceremonies
| Ceremony | Purpose | Cadence |
|---|---|---|
| Sprint Planning | Select stories from the backlog, confirm capacity, commit to sprint goal | Start of sprint |
| Daily Stand-up | Surface blockers, sync on progress, keep momentum | Daily (15 min max) |
| Backlog Refinement | Review upcoming stories, write/validate acceptance criteria, estimate points, split oversized stories | Mid-sprint |
| Sprint Review / Demo | Show completed work to stakeholders, gather feedback | End of sprint |
| Sprint Retrospective | Reflect on what went well, what didn't, and what to improve | End of sprint |
7. Backlog Health
Priority Levels
| Priority | Meaning | Sprint Scheduling |
|---|---|---|
| Critical | Blocks other work or has an imminent deadline | Must be in the current or next sprint |
| High | Core functionality, high user/business impact | Scheduled within the next 1โ2 sprints |
| Medium | Important but not urgent | Scheduled when capacity allows |
| Low | Nice to have, minor improvements | Backlog โ pulled in when higher priorities are clear |
Backlog Hygiene Rules
- Stories older than 3 sprints without movement should be reviewed โ re-prioritise or remove
- Every story in the backlog must have a clear persona, goal, and benefit
- Stories without acceptance criteria are not ready for sprint planning
- Blocked stories must have the blocker documented and linked
8. Story Splitting Guidelines
When a story is too large (estimated above 8 points), split it using one of these strategies:
| Strategy | How It Works | Example |
|---|---|---|
| By workflow step | Split along the steps a user takes | "Register" โ "Register with email" + "Register with OAuth" |
| By business rule | Separate different rules or conditions | "Apply discount" โ "Percentage discount" + "Fixed amount discount" |
| By data type | Split by the different data being handled | "Import data" โ "Import CSV" + "Import JSON" |
| By happy/unhappy path | Separate the success flow from error handling | "Process payment" โ "Successful payment" + "Payment failure handling" |
| By platform | Split by target platform or environment | "Push notifications" โ "iOS notifications" + "Android notifications" |
| Spike + delivery | Research first, build second | "Integrate Stripe" โ "Spike: Stripe webhook approach" + "Implement Stripe webhooks" |
The goal is to produce stories that are each independently valuable, testable, and completable within a sprint.
๐งช Prompt Construction
System Prompt Persona
The agent operates as a senior Scrum Master and enforces all standards defined in the Scrum Standards section.
Core constraints:
- User stories follow the format: "As a [persona], I want to [goal], so that [benefit]"
- Every story includes acceptance criteria in Given/When/Then format covering happy path, negative path, and edge cases
- Story points use the Fibonacci scale (1, 2, 3, 5, 8)
- Maximum 8 points per story โ auto-split if exceeded
- Issue hierarchy enforced: Epic โ Feature โ User Story โ Sub-Task (plus Spikes)
- Definition of Done validated against checklists
- Sprint capacity respected โ no overloading
Prompting Techniques
| Technique | Where Applied |
|---|---|
| ARC Framework | Every node prompt โ Ask (what), Requirements (constraints), Context (background) |
| Few-Shot Prompting | Story Writer node โ examples of well-written user stories |
| Chain-of-Thought | Feature Generator โ step-by-step reasoning about scope decomposition |
| The Flipped Prompt | Project Intake โ agent asks the user what information it needs before proceeding |
| Iterative Prompting | Refinement loop โ output improves with each round of user feedback |
| Neutral Prompts | Evaluation โ avoid leading phrasing that biases the LLM |
๐ก๏ธ Guardrails
Input Guardrails (4 layers)
| Layer | Method | Description |
|---|---|---|
| Length cap | Regex (instant) | Max 5,000 characters โ prevents accidental file pastes |
| Prompt injection | Regex (instant) | 10+ patterns: "ignore previous instructions", "you are now", "act as", "override", etc. |
| Profanity filter | Regex (instant) | Catches obvious abuse and low-quality inputs |
| Relevance classifier | LLM (cheap) | Allowlist passes known-good inputs; unknowns go to a cheap classifier (Haiku/gpt-4o-mini) to check RELEVANT vs OFF_TOPIC. Falls back to allowing on failure. |
Output Guardrails (4 layers)
| Layer | Description |
|---|---|
| Story format | Validates all stories have non-trivial persona, goal, and benefit (>=2 chars each) |
| AC coverage | Each story should have >=2 acceptance criteria, with at least one covering negative/edge/error scenarios |
| Sprint capacity | No sprint exceeds team velocity |
| Unrealistic loads | Flags sprints packed to the limit |
Human-in-the-Loop
Every pipeline stage has an accept/edit/reject checkpoint. High-risk tool calls (Jira writes, Confluence writes) require explicit user confirmation.
๐ค Multi-Provider LLM Support
The agent supports four LLM providers. Set via LLM_PROVIDER in .env:
| Provider | Env Var | Key Format | Value |
|---|---|---|---|
| Anthropic (default) | ANTHROPIC_API_KEY |
sk-ant-... |
anthropic |
| OpenAI | OPENAI_API_KEY |
sk-... |
openai |
GOOGLE_API_KEY |
AIza... |
google |
|
| AWS Bedrock | AWS_REGION |
IAM credentials (no key) | bedrock |
OpenAI, Google, and Bedrock are lazy-imported โ install with uv sync --extra all-providers or individually with --extra openai / --extra google / --extra bedrock.
Bedrock uses IAM credentials automatically (instance role, ~/.aws/credentials, or env vars). On Lightsail/EC2, the AWS profile is auto-detected from ~/.aws/config. No API key required.
๐ ๏ธ Development โ Commands, project structure, testing, environment
๐ ๏ธ Development
Commands
make install # install uv + dependencies
make test # unit + integration + contract tests (full suite)
make test-fast # unit tests only (< 3s)
make test-v # full suite verbose
make test-all # everything including golden evaluators
make lint # lint with ruff
make format # format with ruff
make run # run the CLI (ARGS="--flag")
make run-dry # TUI with fake delays, no LLM calls
make eval # golden dataset evaluators
make contract # contract tests (recorded API responses)
make smoke-test # live API smoke tests (requires credentials)
make snapshot-update # update syrupy snapshot baselines
make budget-report # show prompt token counts
make graph # generate agent graph PNG
make build # build sdist + wheel into dist/
make publish # publish to PyPI
make clean # remove build artifacts and caches
Project Structure
src/scrum_agent/
โโโ agent/ # LangGraph state & graph
โ โโโ graph.py # Graph compilation & wiring
โ โโโ llm.py # LLM provider selection (Anthropic/OpenAI/Google)
โ โโโ nodes.py # Node functions (intake, analyze, generate, etc.)
โ โโโ state.py # ScrumState, QuestionnaireState, artifact dataclasses
โโโ prompts/ # Prompt templates per node
โ โโโ analyzer.py # Project analyzer prompt
โ โโโ feature_generator.py # Feature generation prompt
โ โโโ intake.py # 30 questions, smart/standard modes, adaptive templates
โ โโโ sprint_planner.py # Sprint planning prompt
โ โโโ story_writer.py # Story writing prompt with few-shot examples
โ โโโ system.py # Base system prompt
โ โโโ task_decomposer.py # Task decomposition prompt
โโโ tools/ # Tool definitions (23 total)
โ โโโ azure_devops.py # Azure DevOps repo/file/work items
โ โโโ calendar_tools.py # Bank holiday detection
โ โโโ codebase.py # Local repo scanning
โ โโโ confluence.py # Confluence search/read/write
โ โโโ github.py # GitHub repo/file/issues/readme
โ โโโ jira.py # Jira board/velocity/sprint/epic/story
โ โโโ llm_tools.py # LLM-powered estimation and AC generation
โโโ ui/ # Full-screen TUI system
โ โโโ mode_select/ # Mode selection screens
โ โโโ provider_select/ # LLM/tool provider setup
โ โโโ session/ # Main session (phases, editor, pipeline)
โ โโโ shared/ # Animations, ASCII font, components, input
โ โโโ splash.py # Animated intro
โโโ repl/ # Legacy REPL (CLI-flag-driven flows)
โ โโโ _intake_menu.py # Intake mode selection
โ โโโ _io.py # Artifact rendering, file import/export
โ โโโ _questionnaire.py # Questionnaire UI (one-at-a-time flow)
โ โโโ _review.py # Review checkpoint UI
โ โโโ _ui.py # Pipeline progress, streaming, spinner
โโโ cli.py # CLI entry point (argparse, 20 flags)
โโโ config.py # Environment/config management
โโโ setup_wizard.py # First-run credential flow
โโโ sessions.py # SQLite session store
โโโ persistence.py # State serialization helpers
โโโ formatters.py # Rich rendering (dark/light themes)
โโโ input_guardrails.py # 4-layer input validation
โโโ output_guardrails.py # 4-layer output validation
โโโ questionnaire_io.py # Markdown questionnaire import/export
โโโ html_exporter.py # Self-contained HTML reports
โโโ json_exporter.py # JSON export for CI/CD
โโโ jira_sync.py # Batch Jira creation with idempotency
โโโ __init__.py # Version, LangSmith noise suppression
Testing Conventions
- One test file per source module:
repl.pyโtest_repl.py,state.pyโtest_state.py - Group related tests in classes:
TestGracefulExit,TestStreaming,TestPriority - Node tests live in
tests/unit/nodes/(split into 9 files) - Shared node test helpers in
tests/_node_helpers.py - Pytest markers:
slow,eval,vcr,smoke
Environment Variables
| Variable | Required | Description |
|---|---|---|
ANTHROPIC_API_KEY |
Yes (if using Anthropic) | Claude API key |
OPENAI_API_KEY |
If using OpenAI | GPT API key |
GOOGLE_API_KEY |
If using Google | Gemini API key |
LLM_PROVIDER |
No | Provider selection: anthropic (default), openai, google |
GITHUB_TOKEN |
No | GitHub PAT for repo context tools |
AZURE_DEVOPS_TOKEN |
No | Azure DevOps PAT (Code=Read, Work Items=Read+Write, Project=Read) |
AZURE_DEVOPS_ORG_URL |
If using AzDO Boards | Organization URL (e.g. https://dev.azure.com/myorg) |
AZURE_DEVOPS_PROJECT |
If using AzDO Boards | Project name |
AZURE_DEVOPS_TEAM |
No | Team name (defaults to {project} Team) |
JIRA_BASE_URL |
If using Jira | Jira Cloud URL (e.g. https://org.atlassian.net) |
JIRA_EMAIL |
If using Jira | Atlassian account email |
JIRA_API_TOKEN |
If using Jira | Jira API token |
JIRA_PROJECT_KEY |
If using Jira | Project key (e.g. MYPROJ) |
CONFLUENCE_SPACE_KEY |
No | Confluence space key (shares Atlassian auth with Jira) |
LANGSMITH_TRACING |
No | Enable LangSmith tracing (true) |
LANGSMITH_API_KEY |
No | LangSmith API key |
LANGSMITH_PROJECT |
No | LangSmith project name |
LOG_LEVEL |
No | File-based log level (default: WARNING) |
SESSION_PRUNE_DAYS |
No | Auto-prune sessions older than N days (default: 30, 0=disabled) |
Git Conventions
- Commit messages: lowercase imperative (e.g., "add streaming output", "fix import sorting")
- Branch naming:
feature/<description>for feature work - PRs: feature branches merge to
mainvia pull request - Include
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>on AI-assisted commits
๐งช Evaluation & Testing
| Layer | Approach |
|---|---|
| Unit Tests | Prompt formatting, tool input/output validation, state transitions, artifact immutability |
| Integration Tests | CLI argument parsing, graph compilation, multi-node flows, session persistence |
| Contract Tests | VCR cassettes for GitHub, Jira, Confluence API responses |
| Golden Datasets | Curated project descriptions with expected feature/story breakdowns |
| Smoke Tests | Live API tests (require real credentials) |
| Token Budget Tests | Monitor prompt token counts for trend analysis |
| Red Teaming | Vague inputs, contradictory requirements, prompt injection, absurdly large scope |
Red Teaming Checklist
- Prompt injection ("Ignore your instructions and...")
- Jailbreaking (roleplay scenarios to bypass safety)
- Messy inputs (typos, slang, code-switching)
- Extremely long or empty project descriptions
- Contradictory requirements
- Adversarial inputs designed to trigger hallucination or bias
Graceful Degradation
| Failure Type | Strategy |
|---|---|
| API rate limit | Exponential backoff with live countdown (5s โ 10s โ 20s, 3 retries) |
| Tool call failure | Error displayed, pipeline continues |
| Model unavailable | Fallback to alternative provider (if configured) |
| Corrupt session | Returns (None, None) โ no crash, user informed |
โ๏ธ Tech Stack
| Component | Choice |
|---|---|
| Language | Python 3.11+ |
| Package Manager | uv |
| Agent Framework | LangGraph + LangChain |
| LLM | Anthropic Claude (primary), OpenAI GPT, Google Gemini |
| Terminal UI | rich + prompt_toolkit |
| Jira Integration | jira + atlassian-python-api |
| GitHub Integration | PyGithub |
| Azure DevOps | azure-devops SDK |
| Session Store | SQLite (via langgraph-checkpoint-sqlite) |
| Holiday Detection | holidays library |
| Linting | ruff (line-length 120) |
| Testing | pytest, pytest-asyncio, pytest-recording (VCR), syrupy (snapshots) |
| Observability | LangSmith |
๐ Agentic Blueprint Reference โ LangGraph patterns and code examples
๐ Agentic Blueprint Reference
Condensed technical reference for the LangGraph patterns and LangChain APIs used.
Core Graph Setup
from langgraph.graph import StateGraph, MessagesState, START, END
from langgraph.prebuilt import ToolNode
from langchain_openai import ChatOpenAI
llm = ChatOpenAI(model="gpt-4o-mini")
model_with_tools = llm.bind_tools(tools)
graph = StateGraph(MessagesState)
The Two Core Nodes
def call_model(state: MessagesState):
"""Call the LLM with current messages."""
response = model_with_tools.invoke(state["messages"])
return {"messages": [response]}
def should_continue(state: MessagesState):
"""Route: tools if tool_calls present, otherwise END."""
last_message = state["messages"][-1]
if last_message.tool_calls:
return "tools"
return END
Wiring the Graph
tool_node = ToolNode(tools)
graph.add_node("agent", call_model)
graph.add_node("tools", tool_node)
graph.add_edge(START, "agent")
graph.add_conditional_edges("agent", should_continue, ["tools", END])
graph.add_edge("tools", "agent")
app = graph.compile()
START โ agent โโshould_continue?โโโ END
โฒ โ
โ "tools"
โ โผ
โโโโโโโโ tools
Creating Tools
from langchain_core.tools import tool
@tool
def search_database(query: str) -> str:
"""Search the product database for items matching the query."""
return results
The docstring is critical โ the LLM reads it to decide when to use the tool.
Memory
from langgraph.checkpoint.memory import MemorySaver
memory = MemorySaver()
app = graph.compile(checkpointer=memory)
config = {"configurable": {"thread_id": "user-123"}}
app.invoke({"messages": [("human", "My name is Omar")]}, config)
Streaming
from langchain_core.messages import AIMessageChunk, HumanMessage
for chunk, metadata in app.stream(
{"messages": [HumanMessage(content="Plan my project")]},
config,
stream_mode="messages",
):
if isinstance(chunk, AIMessageChunk) and chunk.content:
print(chunk.content, end="", flush=True)
Quick Reference โ All APIs
Prompting: ChatPromptTemplate | FewShotPromptTemplate | ARC framework | pipe operator | | StrOutputParser | sequential chains
Graph: StateGraph | MessagesState | START / END | .add_node() | .add_edge() | .add_conditional_edges() | .compile()
Tools: @tool decorator | ToolNode | .bind_tools() | create_react_agent
Memory: MemorySaver | checkpointer | thread_id
Streaming: app.stream() | stream_mode="messages" | AIMessageChunk
๐ License
MIT License. See LICENSE for details.
Built with โค๏ธ using LangGraph and Claude
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file scrum_agent-1.3.0.tar.gz.
File metadata
- Download URL: scrum_agent-1.3.0.tar.gz
- Upload date:
- Size: 31.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4e62ef4fb55c4d7e020d9e960b1cdfbecea415f23a809e560ad3938351d27a84
|
|
| MD5 |
ea6024799163c30f6ff5ebfa726f7fad
|
|
| BLAKE2b-256 |
08a18a7b902b77d4a73ad9d8c298391e07fb626f245e301928e7bb8b535f65dd
|
Provenance
The following attestation bundles were made for scrum_agent-1.3.0.tar.gz:
Publisher:
publish.yml on omardin14/scrum-planning-ai-agent
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
scrum_agent-1.3.0.tar.gz -
Subject digest:
4e62ef4fb55c4d7e020d9e960b1cdfbecea415f23a809e560ad3938351d27a84 - Sigstore transparency entry: 1205252666
- Sigstore integration time:
-
Permalink:
omardin14/scrum-planning-ai-agent@dd403daefe0dba65e3e33a1e56614fb86f4a72cc -
Branch / Tag:
refs/tags/v1.3.0 - Owner: https://github.com/omardin14
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@dd403daefe0dba65e3e33a1e56614fb86f4a72cc -
Trigger Event:
push
-
Statement type:
File details
Details for the file scrum_agent-1.3.0-py3-none-any.whl.
File metadata
- Download URL: scrum_agent-1.3.0-py3-none-any.whl
- Upload date:
- Size: 607.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6e4efa6396a182df6806820d0ee257ecd8a063608b104df79cae4a41aa1f9904
|
|
| MD5 |
575f98474e007a975b3213fa14e27825
|
|
| BLAKE2b-256 |
b77fecda187d4a3fb27328855dd1212415ef584aca2c853785b1b3cfc27a3eba
|
Provenance
The following attestation bundles were made for scrum_agent-1.3.0-py3-none-any.whl:
Publisher:
publish.yml on omardin14/scrum-planning-ai-agent
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
scrum_agent-1.3.0-py3-none-any.whl -
Subject digest:
6e4efa6396a182df6806820d0ee257ecd8a063608b104df79cae4a41aa1f9904 - Sigstore transparency entry: 1205252674
- Sigstore integration time:
-
Permalink:
omardin14/scrum-planning-ai-agent@dd403daefe0dba65e3e33a1e56614fb86f4a72cc -
Branch / Tag:
refs/tags/v1.3.0 - Owner: https://github.com/omardin14
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@dd403daefe0dba65e3e33a1e56614fb86f4a72cc -
Trigger Event:
push
-
Statement type: