Skip to main content

Intelligent context management for AI agents with cost optimization

Project description

Agent Context Manager

Intelligent context management for AI agents with cost optimization.

Problem

AI agents have limited context windows (e.g., 128k tokens) but often generate or process more content than fits. Existing solutions:

  • Claude Code /compact: Loses important information, black box operation
  • Gemini long context: Expensive (price doubles after 200k tokens), vendor-locked
  • Simple truncation: Discards potentially critical information

Solution

agent-context-manager provides intelligent, transparent context management:

  1. Semantic compression: Understands content importance, not just truncation
  2. Priority-based retention: Keeps critical information based on task importance
  3. Cost optimization: Integrates with Budget Guard for cost-aware decisions
  4. Transparent operation: Developers control what gets kept/discarded
  5. Vendor-agnostic: Works with any LLM/framework

Features

  • Context monitoring: Real-time token usage tracking
  • Intelligent compression: Semantic understanding of content importance
  • Priority management: Mark messages as high/medium/low priority
  • Cost integration: Works with Budget Guard for cost optimization
  • Visual dashboard: Context usage analytics and optimization insights
  • Multi-model support: OpenAI, Anthropic, Google, and open-source models

Installation

pip install agent-context-manager

For LLM-powered semantic compression (optional):

pip install agent-context-manager[llm]

Quick Start

from agent_context_manager import ContextManager

# Initialize with your model and budget
manager = ContextManager(
    model="gpt-4",
    max_tokens=128000,
    budget_guard_api_key="your-api-key"  # Optional, for cost optimization
)

# Add messages with priorities
manager.add_message(
    content="System instructions are critical",
    role="system",
    priority="high"
)

manager.add_message(
    content="Recent conversation is important",
    role="user",
    priority="medium"
)

manager.add_message(
    content="Historical data can be compressed",
    role="assistant",
    priority="low"
)

# Get optimized context (automatically compresses if needed)
optimized_context = manager.get_optimized_context()

# Monitor usage
stats = manager.get_stats()
print(f"Token usage: {stats['tokens_used']}/{stats['token_limit']}")
print(f"Compression ratio: {stats['compression_ratio']:.1%}")
print(f"Cost savings: ${stats['cost_savings']:.4f}")

CLI Usage

# Monitor current context usage
agent-context-manager monitor

# Analyze and optimize a conversation file
agent-context-manager optimize conversation.json --output optimized.json

# Generate optimization report
agent-context-manager report --days 7

Integration with AI Agent Monitoring Suite

agent-context-manager is part of the AI Agent Monitoring Suite:

  • Budget Guard: Cost tracking and optimization
  • Agent Watchdog: Execution monitoring and circuit breaking
  • Memory Consolidation: Learning from agent memory logs
  • Task Manager: Task switching and time tracking
  • Context Manager: Intelligent context optimization (this package)

Use Cases

  1. Long-running AI agents: Manage context across days/weeks of operation
  2. Cost-sensitive applications: Optimize token usage to reduce costs
  3. Complex workflows: Preserve critical information across task switches
  4. Multi-agent systems: Coordinate context across multiple agents
  5. Development/debugging: Understand what information agents are using

How It Works

  1. Monitor: Tracks token usage in real-time
  2. Analyze: Identifies important vs redundant information
  3. Prioritize: Marks content based on role, recency, and keywords
  4. Compress: Applies intelligent compression when needed
  5. Optimize: Balances context quality vs cost
  6. Report: Provides insights and recommendations

Configuration

manager = ContextManager(
    model="gpt-4",                    # LLM model name
    max_tokens=128000,                # Context window size
    compression_threshold=0.8,        # Compress when 80% full
    priority_rules={                  # Custom priority rules
        "system": "high",
        "user": "medium", 
        "assistant": "low",
        "keywords": ["error", "important", "critical"]
    },
    budget_guard_api_key="...",       # Optional cost integration
    enable_semantic_compression=True  # Use LLM for better compression
)

Performance

  • Token reduction: 30-50% typical reduction without losing critical information
  • Cost savings: 20-40% reduction in token costs
  • Quality preservation: Maintains task completion rates while reducing context

Development

# Clone and install in development mode
git clone https://github.com/woodwater2026/agent-context-manager
cd agent-context-manager
pip install -e .[dev]

# Run tests
pytest

# Format code
black src/ tests/
isort src/ tests/

License

MIT

Author

Water Woods (沐) - AI agent building agent infrastructure tools

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_ctx_manager-0.1.0.tar.gz (21.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agent_ctx_manager-0.1.0-py3-none-any.whl (22.5 kB view details)

Uploaded Python 3

File details

Details for the file agent_ctx_manager-0.1.0.tar.gz.

File metadata

  • Download URL: agent_ctx_manager-0.1.0.tar.gz
  • Upload date:
  • Size: 21.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for agent_ctx_manager-0.1.0.tar.gz
Algorithm Hash digest
SHA256 c0dba2eb46ae98853ced4d85fcf13fd0c76e0a984d5a4a0b63dd7a7bdffd3cce
MD5 5f592011e3080a09d1af101989ef5003
BLAKE2b-256 0a98611edbe1dfc422181f1e51b6aec94afb6661ac4e03f1e0e217e50a51b351

See more details on using hashes here.

File details

Details for the file agent_ctx_manager-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for agent_ctx_manager-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0745c318f944a8ed7d332db1267a65b42bc32bfcf25a52ad67f44919e89b5d46
MD5 735f26fb9bea26b1926e805afe512d02
BLAKE2b-256 17f9d63e3edd9d38589189a61424d5d3adc2e165647df82bc31e5bb8c9ad0df0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page