Autonomous AI research and development platform powered by Claude

These details have not been verified by PyPI

Project links

Project description

AI-AtlasForge

An autonomous AI research and development platform with multi-provider LLM support (Claude, Codex, Gemini). Run long-duration missions, accumulate cross-session knowledge, and build software autonomously.

What is AI-AtlasForge?

AI-AtlasForge is not a chatbot wrapper. It's an autonomous research engine that:

Runs multi-day missions without human intervention
Maintains mission continuity across context windows
Accumulates knowledge that persists across sessions
Self-corrects when drifting from objectives
Adversarially tests its own outputs
Multi-provider: Supports Claude, OpenAI Codex, and Google Gemini as LLM backends

Quick Start

Prerequisites

Python 3.10+
Anthropic API key (get one at https://console.anthropic.com/)
Linux environment (tested on Ubuntu 22.04+, Debian 12+)

Platform Notes:

Windows: Use WSL2 (Windows Subsystem for Linux)

macOS: Should work but is untested. Please report issues.

Option 1: Standard Installation

# Clone the repository
git clone https://github.com/DragonShadows1978/AI-AtlasForge.git
cd AI-AtlasForge

# Run the installer
./install.sh

# Configure your API key
export ANTHROPIC_API_KEY='your-key-here'
# Or edit config.yaml / .env

# Verify installation
./verify.sh

Option 2: One-Liner Install

curl -sSL https://raw.githubusercontent.com/DragonShadows1978/AI-AtlasForge/main/quick_install.sh | bash

Option 3: Docker Installation

git clone https://github.com/DragonShadows1978/AI-AtlasForge.git
cd AI-AtlasForge
docker compose up -d
# Dashboard at http://localhost:5050

For detailed installation options, see INSTALL.md or QUICKSTART.md.

Running Your First Mission

Start the Dashboard (optional, for monitoring):

make dashboard
# Or: python3 dashboard_v2.py
# Access at http://localhost:5050

Create a Mission:
- Via Dashboard: Click "Create Mission" and enter your objectives
- Via Sample: Run make sample-mission to load a hello-world mission
- Via JSON: Create state/mission.json manually

Start the Engine:

make run
# Or: python3 atlasforge_conductor.py --mode=rd

Development Commands

Run make help to see all available commands:

make install      # Full installation
make verify       # Verify installation
make dashboard    # Start dashboard
make run          # Start autonomous agent
make docker       # Start with Docker
make sample-mission  # Load sample mission

What's New in v1.8.4

Handoff System Overhaul - Complete rework of the conductor handoff system for improved reliability across mission cycles
Widget Visibility Toggles - Dashboard widgets can now be hidden/shown without disabling backend services
Dashboard Drag & Drop - Drag-and-drop widget reordering with layout presets, undo/redo, and touch support
Context Watcher Improvements - Enhanced token tracking and handoff logic
Systemd Auto-Start - Fixed graphical-session.target dependency on Linux Mint, Dashboard and Tray services now auto-start on boot via default.target

What's New in v1.8.3

Test Harness Improvements - Refactored subprocess mocking in conductor timeout tests, improved phase-aware drift validation, provider-aware ground rules caching
Stability Fixes - Enhanced test coverage for timeout scenarios, improved error handling in stage handlers, Gemini provider integration tests

What's New in v1.8.2

Bug Fixes - Fixed null handling in suggestion analyzer, improved storage fallback in dashboard similarity analysis

What's New in v1.8.1

Dashboard Services Config - Added Atlas Lab service configuration to services registry

What's New in v1.8.0

Google Gemini Support - Full provider integration with subscription-based API access. Gemini missions validated on complex codebases (custom autograd implementations). Code generation, testing, and iteration loops proven functional
Provider-Agnostic Architecture - Three LLM backends (Claude, Codex, Gemini) running through unified orchestration with provider-specific hardening
Enhanced Gemini Integration - Defensive API invocation, clear error parsing, subscription auth support (API key or OAuth)
Mission Validation - Tested Gemini on Project Tensor (custom autograd) - improved code robustness and performance through multi-cycle iteration

What's New in v1.7.0

OpenAI Codex Support - Full multi-provider support: run missions and investigations with Claude or Codex as the LLM backend. Provider-aware ground rules, prompt templates, and transcript handling
Ground Rules Loader - Provider-aware ground rules system with overlay support for Claude/Codex/investigation modes
Enhanced Context Watcher - Major overhaul with improved token tracking, time-based handoff, and Haiku-powered summaries
Experiment Framework - Expanded scientific experiment orchestration with multi-hypothesis testing
Investigation Engine - Enhanced multi-subagent investigation system with provider selection
Dashboard Improvements - New widgets system, improved chat interface, better WebSocket handling
Transcript Archival - New integration for automatic transcript archival
110 files changed, 3500+ lines added across the platform

Architecture

                    +-------------------+
                    |   Mission State   |
                    |  (mission.json)   |
                    +--------+----------+
                             |
              +--------------+--------------+
              |                             |
    +---------v---------+         +--------v--------+
    |    AtlasForge     |         |    Dashboard    |
    | (Execution Engine)|         |   (Monitoring)  |
    +---------+---------+         +-----------------+
              |
    +---------v---------+         +-------------------+
    |  Modular Engine   |<------->|  Context Watcher  |
    | (StageOrchestrator)|        | (Token + Time)    |
    +---------+---------+         +-------------------+
              |
    +---------v-------------------+
    |     Stage Handlers          |
    |                             |
    |  PLANNING -> BUILDING ->    |
    |  TESTING -> ANALYZING ->    |
    |  CYCLE_END -> COMPLETE      |
    +-----------------------------+
              |
    +---------v-------------------+
    |   Integration Manager       |
    |   (Event-Driven Hooks)      |
    +-----------------------------+

Mission Lifecycle

PLANNING - Understand objectives, research codebase, create implementation plan
BUILDING - Implement the solution
TESTING - Validate implementation
ANALYZING - Evaluate results, identify issues
CYCLE_END - Generate reports, prepare continuation
COMPLETE - Mission finished

Missions can iterate through multiple cycles until success criteria are met.

Core Components

atlasforge.py

Main execution loop. Spawns Claude instances, manages state, handles graceful shutdown.

af_engine/ (Modular Engine)

Plugin-based mission execution system:

StageOrchestrator - Core workflow orchestrator (~300 lines)
Stage Handlers - Pluggable handlers for each stage (Planning, Building, Testing, Analyzing, CycleEnd, Complete)
IntegrationManager - Event-driven integration coordination
PromptFactory - Template-based prompt generation

Mission Queue

Queue multiple missions to run sequentially:

Auto-start next mission when current completes
Set cycle budgets per mission
Priority ordering
Dashboard integration for queue management

Context Watcher

Real-time context monitoring to prevent timeout waste:

Token-based detection: Monitors JSONL transcripts for context exhaustion (130K/140K thresholds)
Time-based detection: Proactive handoff at 55 minutes before 1-hour timeout
Haiku-powered summaries: Generates intelligent HANDOFF.md via Claude Haiku
Automatic recovery: Sessions continue from HANDOFF.md on restart

See context_watcher/README.md for detailed documentation.

dashboard_v2.py

Web-based monitoring interface showing mission status, knowledge base, and analytics.

Knowledge Base

SQLite database accumulating learnings across all missions:

Techniques discovered
Insights gained
Gotchas encountered
Reusable code patterns

Adversarial Testing

Separate Claude instances that test implementations:

RedTeam agents with no implementation knowledge
Mutation testing
Property-based testing

GlassBox

Post-mission introspection system:

Transcript parsing
Agent hierarchy reconstruction
Stage timeline visualization

Key Features

Display Layer (Windows)

Visual environment for graphical application testing:

Screenshot capture from virtual display
Web-accessible display via noVNC (localhost:6080)
Web terminal via ttyd (localhost:7681)
Browser support for OAuth flows and web testing
Automatic GPU detection with software fallback

See docs/DISPLAY_LAYER.md for the user guide.

Mission Continuity

Missions survive context window limits through:

Persistent mission.json state
Cycle-based iteration
Continuation prompts that preserve context

Knowledge Accumulation

Every mission adds to the knowledge base. The system improves over time as it learns patterns, gotchas, and techniques.

Autonomous Operation

Designed for unattended execution:

Graceful crash recovery
Stage checkpointing
Automatic cycle progression

Directory Structure

AI-AtlasForge/
+-- atlasforge_conductor.py # Main orchestrator
+-- af_engine/              # Modular engine package
|   +-- orchestrator.py     # StageOrchestrator
|   +-- stages/             # Stage handlers
|   +-- integrations/       # Event-driven integrations
+-- .af_archived/           # Archived legacy files (pre-modular engine backups)
+-- context_watcher/        # Context monitoring module
|   +-- context_watcher.py  # Token + time-based handoff
|   +-- tests/              # Context watcher tests
+-- dashboard_v2.py         # Web dashboard
+-- adversarial_testing/    # Testing framework
+-- atlasforge_enhancements/  # Enhancement modules
+-- workspace/              # Active workspace
|   +-- glassbox/           # Introspection tools
|   +-- artifacts/          # Plans, reports
|   +-- research/           # Notes, findings
|   +-- tests/              # Test scripts
+-- state/                  # Runtime state
|   +-- mission.json        # Current mission
|   +-- claude_state.json   # Execution state
+-- missions/               # Mission workspaces
+-- atlasforge_data/
|   +-- knowledge_base/     # Accumulated learnings
+-- logs/                   # Execution logs

Configuration

AI-AtlasForge uses environment variables for configuration:

Variable	Default	Description
`ATLASFORGE_PORT`	`5050`	Dashboard port
`ATLASFORGE_ROOT`	(script directory)	Base directory
`ATLASFORGE_DEBUG`	`false`	Enable debug logging
`USE_MODULAR_ENGINE`	`true`	Use new modular engine (set to `false` for legacy)

Dashboard Features

The web dashboard provides real-time monitoring:

Mission Status - Current stage, progress, timing
Activity Feed - Live log of agent actions
Knowledge Base - Search and browse learnings
Analytics - Token usage, cost tracking
Mission Queue - Queue and schedule missions
GlassBox - Post-mission analysis

Philosophy

First principles only. No frameworks hiding integration failures. Every component built from scratch for full visibility.

Speed of machine, not human. Designed for autonomous operation. Check in when convenient, not when required.

Knowledge accumulates. Every mission adds to the knowledge base. The system gets better over time.

Trust but verify. Adversarial testing catches what regular testing misses. The same agent that writes code doesn't validate it.

Requirements

Python 3.10+
Node.js 18+ (optional, for dashboard JS modifications)
Anthropic API key
Linux environment (Ubuntu 22.04+, Debian 12+)

Python Dependencies

See requirements.txt or pyproject.toml for full list.

Documentation

QUICKSTART.md - Get started in 5 minutes
INSTALL.md - Detailed installation guide
USAGE.md - How to use AI-AtlasForge
ARCHITECTURE.md - System architecture
DISPLAY_LAYER.md - Display Layer user guide (Windows)
TROUBLESHOOTING.md - Display Layer troubleshooting

Recent Changes

v1.9.1 (2026-02-20)

Dashboard Filter Persistence - All dashboard filters, sorts, and search state now persist across page reloads via versioned localStorage schema
Mission Suggestion Sort/Filter Persistence - Sort field, sort direction, tag filter, and health filter all persist (schema v2 with migration from legacy flat-map)
Analytics Period Persistence - Selected analytics time period persists across sessions
Glassbox UI Persistence - Search query, date range, and selected mission persist in Glassbox viewer
Global Preference Registry - Centralized ALL_PREFERENCE_KEYS list and clearAllPreferences() for one-click reset
Stage Gate Lock File Fix - Hook now bypasses all enforcement when no active Conductor process is detected via lock file; fixes normal Claude Code terminal usage being blocked post-mission
Stage Normalization - Stage names normalized to uppercase when read from lock file; prevents silent bypass on lowercase stage values

v1.9.0 (2026-02-20)

Modular Engine Only - Retired legacy monolithic af_engine.py (3,688 lines); modular af_engine/ package is now the sole engine implementation
Archival Module - Migrated transcript archival functions to af_engine/core/archival.py; removed importlib.util dynamic loading hack
Engine Init Simplified - af_engine/__init__.py reduced from ~150 lines to ~50; USE_MODULAR_ENGINE feature flag removed entirely
Dashboard WebSocket Push - Live stage updates pushed to connected clients when af_engine stage changes; no polling required
Analytics Integration - Dashboard analytics endpoints enriched with engine-native metrics (success rate, execution time, task counts)
Stage Gate Enforcement - Two-layer stage enforcement: CLI --disallowedTools per stage + hook-level path restrictions

v1.8.7 (2026-02-19)

Widget Settings Popup - Mobile panel reordering via widget settings buttons
Collapsed Card Improvements - Stage indicator and health summary remain visible when widgets are collapsed
Dashboard CSS - Refined collapsed card styling and status card layout

v1.8.6 (2026-02-19)

Widget Control Mechanism - Overhauled widget visibility toggle system; widgets can be hidden/shown independently of backend services
Token Sanity Check - New integration that validates token counts before handoff to prevent corrupt context windows
Transcript Archival - Improved automatic transcript archival integration
Orchestrator Updates - Enhanced stage orchestration reliability
Dashboard Queue Scheduler - Improved mission queue scheduling and priority handling
Dashboard Drag-Drop - Refined drag-and-drop widget reordering with better touch support

v1.8.5 (2026-02-18)

CLAUDECODE env fix - Conductor now strips CLAUDECODE env var before spawning Claude subprocesses, preventing "nested session" crash when launched from an active Claude Code session
Multiple mission completions - AtlasLab fork mission, StoryForge missions, and several R&D cycles completed autonomously
Widget visibility toggles - Dashboard widgets can now be hidden without disabling backend
Handoff system overhaul - Major rework of session handoff and continuity system

v1.8.4 (2026-02-15)

Drag-and-drop widget reordering in dashboard
Handoff system overhaul with improved continuity
Widget visibility toggles

v1.7.0 (2026-02-06)

OpenAI Codex Support - Multi-provider LLM backend: run missions and investigations with Claude or Codex. Provider-aware ground rules, prompts, and transcript handling
Ground Rules Loader - Provider-aware ground rules system with overlay support for Claude/Codex/investigation modes
Enhanced Context Watcher - Major overhaul with improved token tracking, time-based handoff, and Haiku-powered summaries
Experiment Framework - Expanded scientific experiment orchestration with multi-hypothesis testing
Investigation Engine - Enhanced multi-subagent investigation system with provider selection
Dashboard Improvements - New widgets system, improved chat interface, better WebSocket handling
PromptFactory Enhancements - Provider-aware caching, AfterImage integration with fallback paths
Conductor Hardening - Improved session management, singleton protocol, crash recovery
Transcript Archival - New integration for automatic transcript archival
Research Agent - Improved web researcher and knowledge synthesizer
110 files changed, 3500+ lines added across the platform

v1.6.9 (2026-02-02)

Fixed GlassBox visualization issues

v1.6.8 (2026-02-01)

Fixed zombie timer bug - stale session cleanup now stops timer threads
Fixed continuation prompt bug - cycle progression now updates problem_statement
Added conductor singleton with takeover protocol (prevents multiple instances)

v1.6.7 (2026-02-01)

Fixed JSON response parsing bug in conductor (handles markdown code blocks)
ContextWatcher stability improvements

v1.6.5 (2026-01-31)

Build checkpoint improvements
Mission state persistence fixes

License

MIT License - see LICENSE for details.

Contributing

Contributions are welcome! Please feel free to submit issues and pull requests.

Related Projects

AI-AfterImage - Episodic memory for AI coding agents. Gives Claude Code persistent memory of code it has written across sessions. Works great with AtlasForge for cross-mission code recall.

Acknowledgments

Built on Claude by Anthropic. Special thanks to the Claude Code team for making autonomous AI development possible.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.4.3

Apr 27, 2026

2.4.1

Apr 26, 2026

2.4.0

Apr 25, 2026

2.3.0

Apr 17, 2026

2.2.0

Apr 3, 2026

This version

2.0.2

Mar 10, 2026

2.0.1

Mar 9, 2026

2.0.0

Mar 4, 2026

1.12.0

Mar 1, 2026

1.11.0

Feb 28, 2026

1.10.0

Feb 23, 2026

1.9.1

Feb 20, 2026

1.9.0

Feb 20, 2026

1.8.7

Feb 19, 2026

1.8.6

Feb 19, 2026

1.8.5

Feb 18, 2026

1.8.4

Feb 16, 2026

1.8.3

Feb 10, 2026

1.8.2

Feb 9, 2026

1.8.1

Feb 9, 2026

1.8.0

Feb 8, 2026

1.7.1

Feb 6, 2026

1.7.0

Feb 6, 2026

1.6.9

Feb 3, 2026

1.6.8

Feb 2, 2026

1.6.7

Feb 1, 2026

1.6.5

Jan 31, 2026

1.6.4

Jan 31, 2026

1.6.3

Jan 31, 2026

1.6.2

Jan 30, 2026

1.6.1

Jan 29, 2026

1.6.0

Jan 28, 2026

1.5.1

Jan 26, 2026

1.5.0

Jan 26, 2026

1.4.3

Jan 23, 2026

1.4.2

Jan 23, 2026

1.4.1

Jan 23, 2026

1.4.0

Jan 20, 2026

1.3.7

Jan 19, 2026

1.3.6

Jan 19, 2026

1.3.5

Jan 18, 2026

1.3.4

Jan 18, 2026

1.3.3

Jan 18, 2026

1.3.2

Jan 18, 2026

1.3.1

Jan 18, 2026

1.3.0

Jan 18, 2026

1.2.3

Jan 18, 2026

1.2.2

Jan 18, 2026

1.2.1

Jan 17, 2026

1.2.0

Jan 16, 2026

1.1.0

Jan 15, 2026

1.0.0

Jan 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_atlasforge-2.0.2.tar.gz (298.1 kB view details)

Uploaded Mar 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ai_atlasforge-2.0.2-py3-none-any.whl (303.7 kB view details)

Uploaded Mar 10, 2026 Python 3

File details

Details for the file ai_atlasforge-2.0.2.tar.gz.

File metadata

Download URL: ai_atlasforge-2.0.2.tar.gz
Upload date: Mar 10, 2026
Size: 298.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ai_atlasforge-2.0.2.tar.gz
Algorithm	Hash digest
SHA256	`7b9a668985a490a5e1a9e503ce0899f7a940d6d4ef94bff6395122323ed695ed`
MD5	`39070ca102ff2f7fed13c1ed83e4a9da`
BLAKE2b-256	`656f56eb589022ec47d0b34c0851bdb31ee8d78712021d8f4ae30a18f78c333b`

See more details on using hashes here.

File details

Details for the file ai_atlasforge-2.0.2-py3-none-any.whl.

File metadata

Download URL: ai_atlasforge-2.0.2-py3-none-any.whl
Upload date: Mar 10, 2026
Size: 303.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ai_atlasforge-2.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9acea88d58a4bc72139de3ed9e868e94b67e6784679803367e8229697fc29ea7`
MD5	`069f20dc5163b482a50b71c9319fbdee`
BLAKE2b-256	`ecc3892c5e4a80eb76f549663eb55af1ac3565d57dfcf0f95366bdb29dc4f235`

See more details on using hashes here.

ai-atlasforge 2.0.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AI-AtlasForge

What is AI-AtlasForge?

Quick Start

Prerequisites

Option 1: Standard Installation

Option 2: One-Liner Install

Option 3: Docker Installation

Running Your First Mission

Development Commands

What's New in v1.8.4

What's New in v1.8.3

What's New in v1.8.2

What's New in v1.8.1

What's New in v1.8.0

What's New in v1.7.0

Architecture

Mission Lifecycle

Core Components

atlasforge.py

af_engine/ (Modular Engine)

Mission Queue

Context Watcher

dashboard_v2.py

Knowledge Base

Adversarial Testing

GlassBox

Key Features

Display Layer (Windows)

Mission Continuity

Knowledge Accumulation

Autonomous Operation

Directory Structure

Configuration

Dashboard Features

Philosophy

Requirements

Python Dependencies

Documentation

Recent Changes

v1.9.1 (2026-02-20)

v1.9.0 (2026-02-20)

v1.8.7 (2026-02-19)

v1.8.6 (2026-02-19)

v1.8.5 (2026-02-18)

v1.8.4 (2026-02-15)

v1.7.0 (2026-02-06)

v1.6.9 (2026-02-02)

v1.6.8 (2026-02-01)

v1.6.7 (2026-02-01)

v1.6.5 (2026-01-31)

License

Contributing

Related Projects

Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes