Architect โ taxonomy-driven skill recommendation engine for AI agent builders
Project description
Skills Tree
๐ This Week's Highlights โ June 15, 2026
๐ฅ Most Active Skills
- Readme โ 1 PR
- Xquik Api โ 1 PR
The AI Agent Skill OS โ Build Smarter Agents, Faster
360 skills across 17 categories. Versioned, benchmarked, and openly evolving.
Stop rediscovering. Start building on what the community has already proven.50 skills are battle-tested today. 308 are stubs waiting for a real example, real I/O, and real failure modes โ see
meta/QUALITY-REPORT.mdfor the full list. PRs that turn a stub into a production-ready entry are the highest-impact contribution you can make.
๐ Browse Live UI ยท ๐บ๏ธ Systems ยท ๐๏ธ Blueprints ยท ๐ Benchmarks ยท ๐ฌ Labs ยท ๐ค Contribute ยท ๐บ Roadmap
๐ฆ Share Skills Tree on X / Twitter โ
๐ Read in your language: ๐ฌ๐ง English ยท ๐ธ๐ฆ ุงูุนุฑุจูุฉ ยท ๐จ๐ณ ไธญๆ ยท ๐ช๐ธ Espaรฑol ยท ๐ฉ๐ช Deutsch ยท ๐ซ๐ท Franรงais ยท ๐ฎ๐ณ เคนเคฟเคจเฅเคฆเฅ ยท ๐ฏ๐ต ๆฅๆฌ่ช ยท ๐ฐ๐ท ํ๊ตญ์ด ยท ๐ง๐ท Portuguรชs ยท ๐ท๐บ ะ ัััะบะธะน
The Problem
Every AI agent builder rediscovers the same skills from scratch.
Someone learns RAG the hard way. Someone else figures out memory injection at 2am. A third person spends a week benchmarking ReAct vs LATS โ and never shares the results. A fourth discovers the same failure modes you already hit last month.
That collective knowledge is disappearing into Slack threads, private repos, and Twitter bookmarks.
Skills Tree fixes that.
What This Is
Skills Tree is the shared operating system for AI agent capabilities.
A living, versioned, community-powered index of everything an agent can do โ at its best, documented with working code, real benchmarks, failure modes, and evolution history.
We don't pretend every entry is finished. Battle-tested skills (badged ๐ข verified) are production-ready and copy-paste safe. Yellow / unscanned skills are the community's TODO list โ open files, real problem space, and the clearest signal of where contributions are most useful.
It's not a list. It's infrastructure being built in public.
๐ Start Here โ Battle-Tested Skills
If you're new, read these first. Each one ships with runnable code, typed I/O, failure modes, and a model-comparison table.
Agent reasoning loops
- ReAct โ Thought โ Action โ Observation, the foundation of tool-using agents
- Chain of Thought โ explicit step-by-step reasoning + self-consistency
- Tree of Thought โ branched reasoning with scoring + beam search
- Reflection / Reflexion โ critique โ revise loop on top of any output
- Self-Consistency โ sample N chains, majority-vote
- Planning โ typed, DAG-validated plans your executor can run
- Task Decomposition โ break a goal into atomic, runnable subtasks
Retrieval & memory
- RAG โ chunk โ embed โ retrieve โ cite, end-to-end with confidence + threshold
- Vector Store Retrieval โ typed top-k cosine search with metadata filtering
- Embedding Generation โ batched, content-hash-cached, Matryoshka-truncatable
- Memory Injection โ top-K user memories per turn
- Short-Term Memory โ token-budgeted rolling window (the foundation for everything else)
Calling LLMs in production
- Function / Tool Calling โ the primitive that turns an LLM into an agent
- OpenAI API โ chat, structured outputs, tools, embeddings, streaming, retry
- Anthropic API โ Claude with tool loop, prompt caching, streaming
Working with text
- Translation โ placeholder-safe MT with glossary + tone
- Paraphrasing โ simplify / formalize / diversify
- OCR โ VLM + classical OCR with confidence-based human-review routing
Code
- Code Generation โ spec โ AST-validated source with self-repair on failure
- Bug Fixing โ agentic loop: read โ patch โ test โ repeat until green
- Code Review โ automated critique with severity tiers
Web
- Web Search โ Tavily/Serper/Brave with recency + host allowlist + TTL cache
- Web Scraping โ trafilatura + BS4 fallback, metadata, redirect-safe
Security
- Input Sanitization โ 4-layer defense: structural + boundary + content + isolation
Action execution
- File Write โ atomic, crash-safe file writes for agents
- HTTP Request โ production HTTP with idempotency, retry-on-idempotent-only, header redaction
- Dependency Auditor โ vulnerability + license + freshness audit
The full battle-tested set is auto-listed in
meta/QUALITY-REPORT.md. The same report names every stub that needs upgrading โ those are the highest-impact PRs you can submit.
What's Inside
skills-tree/
โ
โโโ skills/ โ 360 atomic skill files (50 battle-tested, 308 stubs awaiting upgrade)
โ run `python3 tools/check_skill_quality.py` for the live count
โโโ systems/ โ Multi-skill workflows (research agent, code reviewer...)
โโโ blueprints/ โ Copy-paste production architectures
โโโ benchmarks/ โ Head-to-head, reproducible skill comparisons
โโโ labs/ โ Experimental & bleeding-edge capabilities
โ
โโโ docs/ โ Interactive web UI (GitHub Pages)
โโโ i18n/ โ Localized READMEs (Arabic, Chinese, Spanish, German, French, Hindi, Japanese, Korean, Portuguese, Russian)
โโโ meta/ โ Schema, glossary, frameworks, roadmap, changelog
โโโ requirements.txt โ Pinned Python deps for CI workflows
๐๏ธ The 17 Skill Categories
| # | Category | Skills | What It Covers |
|---|---|---|---|
| 01 | ๐๏ธ Perception | 36 | Text, images, PDFs, code, sensors, databases, screens |
| 02 | ๐ง Reasoning | 39 | Planning, deduction, abduction, causal chains, commonsense |
| 03 | ๐๏ธ Memory | 19 | Working, episodic, semantic, vector, injection, forgetting |
| 04 | โก Action Execution | 21 | File I/O, HTTP, email, shell, database writes |
| 05 | ๐ป Code | 28 | Write, run, debug, review, refactor, test, deploy |
| 06 | ๐ฌ Communication | 15 | Summarize, translate, draft, argue, adapt tone |
| 07 | ๐ง Tool Use | 32 | APIs โ GitHub, Slack, Stripe, OpenAI, MCP, A2A |
| 08 | ๐ญ Multimodal | 14 | Images, audio, video, VQA, 3D, charts |
| 09 | ๐ค Agentic Patterns | 23 | ReAct, CoT, ToT, MCTS, LATS, RAG, Debate |
| 10 | ๐ฅ๏ธ Computer Use | 20 | Click, type, scroll, OCR, terminal, VM, a11y tree |
| 11 | ๐ Web | 17 | Search, scrape, crawl, login, fill forms, parse RSS |
| 12 | ๐ Data | 18 | ETL, SQL, embeddings, time series, anomaly detection |
| 13 | ๐จ Creative | 14 | Copywriting, image prompts, SVG, music, scripts |
| 14 | ๐ Security | 13 | Sandboxing, secret scanning, audit logs, rollback |
| 15 | ๐ผ Orchestration | 22 | Multi-agent, state machines, retry, consensus |
| 16 | ๐บ Domain-Specific | 28 | Medical, legal, finance, DevOps, education, science |
| 17 | ๐ ๏ธ Infrastructure | 1 | Dependency auditing & supply-chain tooling (early) |
Counts above reflect skill files on disk and are auto-synced by
tools/update_readme_counts.py(run nightly viaupdate-skill-count.yml). If you spot a drift, open an issue.
A Skill in 60 Seconds
Every skill file is self-contained and production-ready:
# Memory Injection
Category: memory | Level: intermediate | Stability: stable | Version: v2
## Description
Dynamically inject relevant past memories into an agent's system prompt
before each turn โ giving the model user context without filling the window.
## Example
```python
client.messages.create(
system=f"{base_system}\n\n## Memory\n{top_k_memories}",
messages=[{"role": "user", "content": user_message}]
)
```
## Benchmarks โ benchmarks/memory/injection-strategies.md
## Related โ working-memory.md ยท rag.md ยท vector-store-retrieval.md
## Changelog โ v1 (2025-03) ยท v2 (2026-04, added retrieval scoring)
Every skill includes:
- โ What it does and why it matters
- โ Typed inputs/outputs
- โ
Runnable Python code (
claude-opus-4-5/gpt-4o) - โ Frameworks table (LangChain, LangGraph, CrewAI, mem0...)
- โ Failure modes and edge cases
- โ Related skills cross-links
- โ Version history
Skill Versioning โ How Evolution Works
Skills are not static files. They evolve as the community learns:
v1 โ Initial entry: description + minimal example
v2 โ Enriched: better example + failure modes + related skills
v3 โ Battle-tested: benchmarks + model comparison + production notes
To upgrade a skill:
- Bump the version in frontmatter
- Add a changelog entry explaining what improved
- Open a PR titled
improve: skill-name โ v1 โ v2
The best versions surface naturally โ through PR merge frequency and inclusion in Systems + Blueprints.
๐บ๏ธ Systems โ Multi-Skill Workflows
See how skills combine into real, working agent pipelines:
| System | Skills Used | Use Case |
|---|---|---|
| Research Agent | Web search + RAG + Summarize + Cite | Deep research automation |
| Coding Agent | Code reading + Write + Debug + Test | End-to-end code generation |
| Code Reviewer | Code reading + Reasoning + Comment gen | Automated PR reviews |
| Data Pipeline Agent | DB reading + ETL + Anomaly detection | Automated data ops |
| Customer Support Bot | Memory injection + Intent + Response gen | Personalized support |
| Computer Use Agent | Screen reading + OCR + Click + Type | Full GUI automation |
| Data Analyst | SQL + Charts + Summarize + Insight gen | Automated data analysis |
| Voice Agent | Audio transcription + NLU + TTS | Real-time voice interaction |
๐๏ธ Blueprints โ Production Architectures
Copy-paste architectures for the most common agent patterns:
| Blueprint | Description |
|---|---|
| RAG Stack | Embed โ store โ retrieve โ generate, fully wired |
| Multi-Agent Workflow | Sequential orchestration with handoffs |
| Multi-Agent Mesh | N specialists + orchestrator, parallel execution |
| Computer Use Browser | Browser automation via Playwright + vision |
| Human-in-the-Loop | Approval gates, escalation, audit trails |
| Self-Healing Agent | Error detection, retry logic, rollback |
| Memory-First Agent | Profile + episodic + vector memory combined |
๐ Benchmarks โ Real Numbers, Reproducible
We test so you don't have to:
| Benchmark | Winner | Margin | Link |
|---|---|---|---|
| ReAct vs LATS (HotpotQA) | LATS | +8.3% accuracy | โ |
| RAG retrieval strategies | HyDE | +12% recall | โ |
| Memory injection methods | Top-K semantic | Best cost/quality ratio | โ |
| Function calling comparison | Claude 3.7 | +6% on tool accuracy | โ |
Every benchmark includes methodology, dataset, and reproducible test scripts.
๐ This Week's Highlights
Auto-updated weekly ยท Full leaderboard โ
๐ฅ Most Active Skills
skills/09-agentic-patterns/react.mdโ 12 community improvements this monthskills/03-memory/memory-injection.mdโ v2 with retrieval scoringskills/02-reasoning/causal.mdโ new benchmark comparison added
โก Battle-Tested (used in 10+ public projects)
ReAct ยท Chain of Thought ยท RAG Pipeline ยท Memory Injection ยท Tool Use
๐ฌ Hot in Labs
labs/reasoning/tree-of-agents.mdโ multi-agent tree searchlabs/memory/episodic-compression.mdโ lossy-but-useful memory compressionlabs/tool-use/adaptive-tool-selection.mdโ dynamic tool filtering for large registries
๐ค How to Contribute
Four types of contributions โ all valued:
| Type | What It Is | PR Title Format |
|---|---|---|
| New Skill | A capability not yet indexed | feat: add [skill] to [category] |
| Skill Upgrade | Bump v1โv2 with better content | improve: [skill] โ v1โv2 |
| Benchmark | Head-to-head with real numbers | benchmark: [skill-a] vs [skill-b] |
| System / Blueprint | Multi-skill workflow or architecture | system: add [name] |
git clone https://github.com/SamoTech/skills-tree.git
cp meta/skill-template.md skills/05-code/my-new-skill.md
# Fill in every section โ open a PR
Quality Rules
- โ No generic prompts or vague descriptions
- โ No skills without a working code example
- โ Must solve a real, specific problem
- โ Must be structured and reusable
- โ Must include inputs, outputs, and at least one runnable example
Full guide: CONTRIBUTING.md
Quick Start
# Clone
git clone https://github.com/SamoTech/skills-tree.git
# Find a skill by keyword
grep -r "memory injection" skills/ --include="*.md" -l
# Read a full system end-to-end
cat systems/research-agent.md
# See benchmark results
cat benchmarks/tool-use/function-calling-comparison.md
Who This Is For
๐๏ธ Agent Builders โ Production skill patterns, ready to use today
๐ฌ AI Researchers โ Benchmarks, taxonomy, and full capability coverage
๐ System Architects โ Blueprints for multi-agent production systems
๐ Learners โ Structured path from basic skills โ advanced systems
๐ค Contributors โ A community that improves everything together
๐บ๏ธ Roadmap
See the full plan: meta/ROADMAP.md
Near-term (v2.x):
- Skill dependency graph โ visual map of how skills relate
- Skill Paths โ curated learning tracks (e.g., "Build a Research Agent in 5 skills")
- JSON/YAML export of all skill metadata for programmatic use
- Community skill ratings and upvotes
- Auto-leaderboard: Top Skills This Week, Most Improved, Battle-Tested
Medium-term (v3.0):
- CLI:
skills-tree search "memory injection"โ returns ranked results - LangChain Hub / MCP registry integration
- โ
Localization: Arabic, Chinese, Spanish READMEsโ shipped in v2.1 - Automated changelog generation on PR merge
Long-term vision:
- Skills Tree becomes the canonical reference for AI agent capabilities
- Every major agent framework links here as the skill index
- 1000+ skills, all battle-tested, all benchmarked
Vision
AI agents are becoming teammates, not tools.
Skills Tree is the shared foundation they run on โ a living OS of capabilities that the community builds, tests, and evolves together.
Every skill added here saves every agent builder who comes after you. Every benchmark run here prevents someone else from wasting a week. Every system documented here becomes a launchpad for the next builder.
This is not a repo. It's infrastructure for the AI-native era.
โญ Star this repo ยท ๐ Browse Skills ยท ๐ค Contribute ยท ๐บ Roadmap ยท ๐ Sponsor
The AI Agent Skill OS โ built by the community, for the community.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file skills_tree-1.3.1.tar.gz.
File metadata
- Download URL: skills_tree-1.3.1.tar.gz
- Upload date:
- Size: 693.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4a3016bd16b8bf66fe319b5acafb82c68a7cae250007908dad210ad8a6ed363f
|
|
| MD5 |
91cb5cb48c407dbec518634d585f08f4
|
|
| BLAKE2b-256 |
e24f40a62a6061e137d0d54732ec20fa980554d76ab4ee34657ca549b96c7f84
|
Provenance
The following attestation bundles were made for skills_tree-1.3.1.tar.gz:
Publisher:
zero-touch-release.yml on SamoTech/skills-tree
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
skills_tree-1.3.1.tar.gz -
Subject digest:
4a3016bd16b8bf66fe319b5acafb82c68a7cae250007908dad210ad8a6ed363f - Sigstore transparency entry: 1825294402
- Sigstore integration time:
-
Permalink:
SamoTech/skills-tree@f1226a9670538f556ed0d6aacf6b5219bbdf3550 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/SamoTech
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
zero-touch-release.yml@f1226a9670538f556ed0d6aacf6b5219bbdf3550 -
Trigger Event:
push
-
Statement type:
File details
Details for the file skills_tree-1.3.1-py3-none-any.whl.
File metadata
- Download URL: skills_tree-1.3.1-py3-none-any.whl
- Upload date:
- Size: 798.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3fccc3c5f92e0f877ccddc764b0a915629c60535b56d14fe50688adc95a4e910
|
|
| MD5 |
5c29970a3b1773e0b9ffe5d0d1bd05ec
|
|
| BLAKE2b-256 |
009cd4a371800a7414670106c19b2d60caa3bccc0539e1d2092378c1ceb00784
|
Provenance
The following attestation bundles were made for skills_tree-1.3.1-py3-none-any.whl:
Publisher:
zero-touch-release.yml on SamoTech/skills-tree
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
skills_tree-1.3.1-py3-none-any.whl -
Subject digest:
3fccc3c5f92e0f877ccddc764b0a915629c60535b56d14fe50688adc95a4e910 - Sigstore transparency entry: 1825294513
- Sigstore integration time:
-
Permalink:
SamoTech/skills-tree@f1226a9670538f556ed0d6aacf6b5219bbdf3550 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/SamoTech
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
zero-touch-release.yml@f1226a9670538f556ed0d6aacf6b5219bbdf3550 -
Trigger Event:
push
-
Statement type: