Skip to main content

Agent Operating System — Production-ready multi-model agent framework with TokenCounter, SemanticMemoryRetriever, ConfigPresets, ToolRegistry, WorkflowTemplate, ResponseCache, AgentGraph, StreamingAgent, ConversationMemory, AsyncAgentLoop, SwarmPatterns, CI/CD pipeline, and 30+ production modules

Project description

AgentOS v0.99

Agent Operating System — Production-ready multi-model agent framework.

Overview

AgentOS is a modular, twelve-layered framework for building, orchestrating, and deploying AI agents. It supports OpenAI, Anthropic, Google Gemini, and open-source models through a unified routing layer.

Quick Start

pip install agentos

# Create a new project
agentos init my-agent

# Start the API server
agentos serve --preset production
from agentos import AgentLoop, LoopConfig, ModelRouter

loop = AgentLoop(
    config=LoopConfig(model="gpt-4o", max_iterations=10),
    router=ModelRouter(),
)
result = loop.run("Summarize the key features of AgentOS.")
print(result.output)

Architecture

agentos/
├── agents/         Agent marketplace & skill registry
├── api/            REST API server, middleware, streaming, versioning
├── benchmarks/     Benchmarking & performance testing
├── cache/          LLM cache, response cache, embedding cache
├── cli/            CLI scaffolding & serve commands
├── comm/           Inter-agent communication (blackboard, event bus)
├── config/         Configuration loader, validator, presets
├── core/           Agent loop, state machine, async loop, streaming
├── cost/           Cost tracking, token counting
├── deployment/     Docker & Kubernetes deployment
├── docs/           API documentation generator
├── errors/         Error formatting & handling
├── evaluation/     Scoring & evaluation metrics
├── experiments/    A/B experiment runner
├── feedback/       User feedback collection & learning
├── health/         Health checks & monitoring
├── logging/        Structured logging
├── memory/         Short-term, long-term, working memory, summarizer, retriever
├── models/         Model router, resilience, backends (Gemini)
├── monitoring/     Alerting & metrics
├── multimodal/     Image, audio, document processing
├── observability/  Cost analytics, metrics, tracing
├── orchestration/  DAG orchestrator, agent graph execution
├── plugins/        Plugin system & lifecycle
├── prompts/        Prompt registry & templates
├── protocols/      Agent contracts & MCP
├── queue/          Task queue & rate limiter
├── security/       Guardrails, auditor, sandbox
├── server/         MCP server
├── storage/        Storage backend abstraction
├── subagent/       Sub-agent management
├── swarm/          Swarm coordination & patterns
├── testing/        Test fixtures & mocks
├── tools/          Tool registry, function calling, orchestrator, generator
├── vectorstore/    Vector database abstraction
└── workflows/      Workflow engine & templates

Key Features

v0.99 New

Module Description
TokenCounter Model-aware token counting + cost estimation for OpenAI/Anthropic/Gemini/Llama
SemanticMemoryRetriever Hybrid memory search (semantic + BM25 keyword) across conversation & long-term memory
ConfigPresets 8 ready-to-use config profiles: development, production, testing, budget, creative, deep_research, gemini_fast, gemini_pro

Core Features (v0.95 - v0.98)

Module Description
ToolRegistry Function calling pipeline with JSON Schema validation & batch execution
WorkflowTemplate Declarative workflow templates (YAML/JSON) with 6 step types
ResponseCache TTL cache with LRU eviction & 3 key strategies
AgentGraph DAG execution engine with Mermaid export
StreamingAgent SSE real-time streaming with session management
ConversationMemory 4 window strategies: Sliding, TokenAware, Importance, Hybrid
AsyncAgentLoop Async concurrent execution with p50/p95/p99 latency stats
SwarmPatterns 5 collaboration topologies: Broadcast, Pipeline, Hierarchical, Consensus, RoundRobin

Infrastructure (v0.50 - v0.95)

  • Model Router: Unified routing across OpenAI, Anthropic, Gemini, Llama
  • Guardrails: Content safety, PII sanitization, content hashing
  • Rate Limiter: Token bucket, sliding window, concurrency limiter
  • Circuit Breaker: Resilience patterns with configurable retry
  • Cost Analytics: Budget alerts, cost breakdown, session tracking
  • Health Checks: OpenAI connectivity, vector store, disk space, memory
  • Security Auditor: Full security audit with severity-based findings
  • Docker/K8s: Auto-generate Dockerfile + docker-compose
  • CI/CD: GitHub Actions with multi-OS, 3 Python versions, lint, bandit security scan

Config Presets

from agentos import get_preset, list_presets

# List all presets
for name in list_presets():
    p = get_preset(name)
    print(f"{p.name}: {p.model} (T={p.temperature})")

# Apply a preset
config = {"max_iterations": 15}
from agentos import apply_preset
apply_preset("production", config)  # Overrides with prod defaults
Preset Model Temp Use Case
development gpt-4o-mini 0.8 Local dev, fast iteration
production gpt-4o 0.3 Deployed services
testing gpt-4o-mini 0.0 CI/CD tests
budget gpt-4o-mini 0.5 Cost-sensitive
creative claude-3.5-sonnet 0.95 Creative writing
deep_research claude-3-opus 0.4 Research & analysis
gemini_fast gemini-2.0-flash 0.7 High throughput
gemini_pro gemini-1.5-pro 0.5 2M context window

Token Counting

from agentos import TokenCounter

counter = TokenCounter()
tokens = counter.count("Hello, agent world!", model="gpt-4o")
print(f"Tokens: {counter.format_tokens(tokens)}")

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Explain quantum computing in 3 sentences."},
]
total = counter.count_messages(messages, model="gpt-4o")
cost = counter.estimate_cost(total)
print(f"Cost: {counter.format_cost(cost)}")

Memory Retrieval

from agentos import SemanticMemoryRetriever, RetrievalStrategy, MemoryEntry

retriever = SemanticMemoryRetriever()

# Index memories
retriever.index([
    MemoryEntry(id="1", content="Deployed to production at 3pm UTC", source="long_term"),
    MemoryEntry(id="2", content="User asked about GDPR compliance", source="conversation"),
    MemoryEntry(id="3", content="Database migration scheduled for Friday", source="conversation"),
])

# Hybrid search (semantic + keyword)
results = retriever.retrieve("When is the next deployment?")
for r in results:
    print(f"[{r.score:.2f}] {r.entry.content}")

Requirements

  • Python >= 3.11
  • openai >= 1.0.0
  • httpx >= 0.27.0
  • pyyaml >= 6.0
  • pydantic >= 2.0
  • fastapi >= 0.110.0 (optional, for API server)

License

MIT

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nexus_agentos-1.1.3.tar.gz (170.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nexus_agentos-1.1.3-py3-none-any.whl (207.6 kB view details)

Uploaded Python 3

File details

Details for the file nexus_agentos-1.1.3.tar.gz.

File metadata

  • Download URL: nexus_agentos-1.1.3.tar.gz
  • Upload date:
  • Size: 170.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.6

File hashes

Hashes for nexus_agentos-1.1.3.tar.gz
Algorithm Hash digest
SHA256 e5e3806938d82caf6840d2722b222fa5d224ca090020aa866b64d11d6638869e
MD5 12429f1370c7dc42c22b93c610086e44
BLAKE2b-256 894d59ddaeefa0f8b0facdc56b0c52e45b998d976c84d8948e668dd577f1eaa2

See more details on using hashes here.

File details

Details for the file nexus_agentos-1.1.3-py3-none-any.whl.

File metadata

  • Download URL: nexus_agentos-1.1.3-py3-none-any.whl
  • Upload date:
  • Size: 207.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.6

File hashes

Hashes for nexus_agentos-1.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 fdfb84927254cb996125efb1374f06eb743a528493c641c8c7763f2c4531dd49
MD5 e8c5b3fdb6c26f8c6f9fff872e9643ef
BLAKE2b-256 e1ab87ab8134c07b26242c39d59100d2cb52a7eeb87a9c46308d1c8e38c20d3d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page