A lightweight, modular execution engine and routing framework for LLM agents.

Project description

Agentic Core

If you want an autonomous agent immediately: Got an API key? You can start an agent in one line. No classes, no protocols, no boilerplate.
If you want to build production-ready multi-agent system with clear, structured interfaces: That's even better. This engine gives you:
- native MCP integration with dynamic load and discovery options,
- customize client wrapper option for smart routing strategies,
- smart-default or your own domain-specific memory truncation strategies,
- clear decision actions at each step of the agent's plan,
- and a simple but extendable base tool class that can be used to implement arbitrarily complex workflows.

Example Applications: Tool-calling Assistant Bots, DAG-based Agent Swarms (via DAGAgentRunner and agent-spawning tools), Legal Consultant Bot with RAG-based tools (via RAG tool suite installed with extra [rag] (see docs/RAG_TOOLS.md), and many more.

Quick Start
The Simple chat() Function
Tooling system: Bring-your-own-tools & MCP integration (GitHub, Filesystem, etc.)
Agent creation (Advanced) & Memory Management
Streaming Capabilities
Configuration Options
Project Structure
Troubleshooting
Notes on security

QUICK START

Installation

Install the package with pip:

pip install callai-agentic_core

Clone from this repo:

git clone https://github.com/nguyenv217/callai-agentic_core 
cd callai-agentic_core
pip install ".[all]"

Note: To install only specific providers, use pip install ".[openai]" or pip install ".[anthropic]".

If you have an OpenAI key:

import asyncio
from agentic_core.agents import chat

async def main():
    result = await chat(
        message="What's the weather in Tokyo?",
        provider="openai",
        api_key="sk-..."  # Your OpenAI key
    )
    agent_response = result.response
    print(agent_response.text)

asyncio.run(main())

If you are using an OpenAI-compatible API (e.g. HuggingFace, vLLM):

import asyncio
from agentic_core.agents import chat

async def main():
    result = await chat(
        message="Hello!",
        provider="openai",
        api_key="your_hf_token",
        base_url="https://router.huggingface.co/v1",
        model="meta-llama/Llama-3.1-8B-Instruct"
    )
    agent_response = result.response
    print(agent_response.text)

asyncio.run(main())

If you have an Anthropic key:

result = await chat(
    message="What's the weather in Tokyo?",
    provider="anthropic", 
    api_key="sk-ant-..."  # Your Anthropic key
)
agent_response = result.response
    print(agent_response.text)

If you have Ollama (local, no key needed):

result = await chat(
    message="What's the weather in Tokyo?",
    provider="ollama"  # No API key needed!
)
agent_response = result.response
    print(agent_response.text)

That's it! Want to see what's happening? Add verbose=True.

The Simple `chat()` Function

The easiest way to use agentic_core:

from agentic_core.agents import chat
from agentic_core.config import RunnerConfig

# Simple usage
result = await chat(
    message="What's the weather in Tokyo?",
    provider="openai",
    api_key="sk-...",
    verbose=True
)


# Persistent session usage (preserves memory and MCP connections)
result = await chat(
    message="Remember my name is Alice",
    provider="openai",
    api_key="sk-...",
    session_id="user_session_123"
)
response = result.response
print(response.text)


# Advanced usage with RunnerConfig
result = await chat(
    message="Analyze this repo",
    provider="openai",
    api_key="sk-...",
    config=RunnerConfig(
        max_iterations=10,
        system_prompt="You are a senior software engineer.",
        mcp_enable_discovery=True
    ),
    mcp_config_path="mcp.json"
)

Returns: A ChatResult object containing an AgentResponse with text, reasoning, tool_calls, usage, and error fields, and useful metadata: session_id, tenant_id.

class ChatResult:
    response: AgentResponse
    session_id: str | None = None
    tenant_id: str | None = None

Using Tools (Implementing `BaseTool` or integrate MCP servers)

Custom Tools via `BaseTool`

The BaseTool base class give you robust but structured method to create and use tools in your project. It is designed to extend functionality with a consistent framework (e.g, designing agent swarms with spawn_agents, create_task, etc.). If you want to write your own Python tools, inherit from BaseTool, register it with ToolManager, before specifying it in your agent workflow via RunnerConfig:

from agentic_core.tools.base import BaseTool
from agentic_core.tools.manager import ToolManager

class UpperCaseTool(BaseTool):
    name = "uppercase"
    schema = {
        "type": "function",
        "function": {
            "name": "uppercase",
            "description": "Converts text to upper case.",
            "parameters": {
                "type": "object",
                "properties": {"text": {"type": "string"}},
                "required": ["text"]
            }
        }
    }

    def execute(self, args: dict, context: dict) -> str:
        return args["text"].upper()

# Register and use
tools = ToolManager()
uppercase_tool = UpperCaseTool()
tools.register_tool(uppercase_tool)

# === More advanced usage: `toolset` ===
tools = ToolManager(
    toolsets = {
        "my_custom_toolset": {
            "tools": ["uppercase"],
            "prompt": "This prompt is dynamically injected when toolset=my_custom_toolset"
        },
        "my_other_toolset": ["some_other_tool", "other_tool2", ...] # Or just a list if no custom prompt is needed
        },
    )
    
# Then pass to `RunnerConfig`
config = RunnerConfig(
    system_prompt = "My base sytem prompt."
    tools = [uppercase_tool.schema, ...] # only inject these tools, this parameter takes priority over `toolset`
    toolset = "my_custom_toolset" # or a toolset for a specific known set of specific tools with a custom prompt
)

RECOMMENDED: For a detailed guide on building and registering tools, see docs/TOOLS_GUIDE.md.

MCP Tools (Dynamic Integration)

Note: Many MCP servers require Node.js installed on your system to run npx commands. Do check with the your specified servers' documentation.

MCP (Model Context Protocol) lets your agent use external tools via standardized protocols. agentic_core natively supports them and even gives you easy means to dynamically discover and load these tools in order to save on your token usage! Here's how to use it:

Step 1: Create `mcp.json`

{
  "mcpServers": {
    "github": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-github"], 
      "env": { "GITHUB_PERSONAL_ACCESS_TOKEN": "your-token" } 
    },
    "filesystem": {
      "command": "npx", 
      "args": ["-y", "@modelcontextprotocol/server-filesystem", "/your/project"]
    }
  }
}

Alternatively, "env":{"TOKEN_ENV_NAME":"${YOUR_TOKEN}"} works as well assuming environment variable YOUR_TOKEN exists.
Refer to examples/mcp_config.json for a few no-config, plug-and-play example servers.

Step 2: Use it in chat()

result = await chat(
    message="Create a new issue in my repo about the bug",
    provider="openai",
    api_key="sk-...",
    mcp_config_path="mcp.json"  # <-- This enables MCP!
)

Step 3: Programmatic MCP Config (No JSON file needed)

If you prefer not to use a JSON file, you can add MCP servers directly to your ToolManager instance.

from agentic_core import ToolManager

tools = ToolManager()
tools.add_mcp_server(
    server_name="everything",
    command="npx",
    args=["-y", "@modelcontextprotocol/server-everything"]
)

# Now use this ToolManager with your agent
from agentic_core import AgentRunner
from agentic_core.agents import OpenAILLM

agent = AgentRunner(
    llm_client=OpenAILLM(api_key="sk-..."),
    tools=tools
)

How it works:

The agent sees available MCP tools via list_mcp_catalog
It loads the tools it needs via load_mcp_tool
It uses them to answer your question

Advanced
By default, agentic-core injects Meta-Tools into the agent's context to enable autonomous tool management:

list_mcp_catalog: The agent can browse available servers and their tool definitions.
load_mcp_tool: The agent can "install" a tool into its active session on-the-fly.

This "Lazy Loading" approach keeps your prompt context small and saves tokens by only loading the specific tools the agent decides it needs for the task.

If you already know which tools the agent will need, you can bypass the discovery turns entirely using mcp_preload_tools and setting enable_mcp_discovery=False in yourRunnerConfig. This is highly recommended for production environments to reduce latency and costs.

from agentic_core.config import RunnerConfig

config = RunnerConfig(
    mcp_active_servers=["sqlite"],       # Initialize these servers immediately
    mcp_preload_tools=["sqlite_query"]   # Move these tools to 'Active' before Turn 1. NOTE that you must prefix the tool name with server name 
)

# The agent starts with 'sqlite_query' already지 set in its toolkit
agent.chat("How many users in the database?", config=config)

Please see this more detailed explanation in how to configuring RunnerConfig to fully realize the MCP protocol in your project.

Streaming Capabilities

The AgentRunner supports real-time streaming of agent events, allowing you to display reasoning, tool calls, and final responses as they happen.

import asyncio
from agentic_core.engines import AgentRunner
from agentic_core.llm_providers import OpenAILLM
from agentic_core.memory import MemoryManager
from agentic_core.tools import ToolManager
from agentic_core.interfaces import StreamEventType

async def main():
    # Setup
    llm = OpenAILLM(api_key="sk-...", model="gpt-4o")
    runner = AgentRunner(llm_client=llm, tools=ToolManager(), memory=MemoryManager())

    user_input = "Can you explain quantum physics in one sentence?"
    
    # Stream the response
    async for event in runner.stream_turn(user_input):
        if event.type == StreamEventType.REASONING:
            print(f"[Reasoning]: {event.content}", end="", flush=True)
        elif event.type == StreamEventType.TEXT:
            print(event.content, end="", flush=True)
        elif event.type == StreamEventType.TOOL_CALL:
            print(f"\n[Tool Call]: {event.content['function']['name']}")
        elif event.type == StreamEventType.TOOL_RESULT:
            print(f"\n[Tool Result]: {event.content['result']}")
        elif event.type == StreamEventType.FINAL_RESPONSE:
            print(f"\n\n[Done] Final response received.")

asyncio.run(main())

Available Event Types (StreamEventType):

REASONING: The agent's internal chain-of-thought.
TEXT: The actual response text being streamed.
TOOL_CALL: Information about a tool being called.
TOOL_RESULT: The output returned by a tool.
ERROR: Any error occurred during execution.
FINAL_RESPONSE: The final consolidated AgentResponse object.

Creating Your Own Agent (Advanced) & Custom Memory Management Strategy

If you need more control, you can create agents manually:

from agentic_core.engines import AgentRunner 
from agentic_core.memory import MemoryManager
from agentic_core.tools import ToolManager
from agentic_core.llm_providers import OpenAILLM
from agentic_core.handlers import PrintHandler

# 1. Create components
llm = OpenAILLM(api_key="sk-...", model="gpt-4o")
memory = MemoryManager()
memory.set_system_prompt("You are a helpful coding assistant.")
tools = ToolManager()

# 2. Create agent
agent = AgentRunner(llm_client=llm, tools=tools, memory=memory)

# 3. Run!
result = await agent.run_turn(
    user_input="Hello!",
    handler=PrintHandler()
)
print(result.text)

Memory Management & Context Truncation

To prevent token overflow in long conversations, MemoryManager supports configurable truncation. By default, it uses NoTruncationStrategy that does not multiliate the message array to preserve prompt (as most providers now supports prompt caching which makes truncation even more expensive). However, you can use DefaultTruncationStrategy that intelligently prunes tool outputs and long text before deleting entire messages while still maintaining structure integrity if the context windows is bordering on the limit.

from agentic_core.memory.manager import MemoryManager
from agentic_core.memory.strategies import DefaultTruncationStrategy

# Custom strategy: lower thresholds for aggressive pruning
strategy = DefaultTruncationStrategy(tool_threshold=1000, text_threshold=500)

memory = MemoryManager(
    max_chars=4000, 
    strategy=strategy
)

You can also implement your own truncation logic for your domain-specific workflows by inheriting from the TruncationStrategy interface.

Available LLM Adapters

from agentic_core.llm_providers import (
    OpenAILLM,      # OpenAI-compatible endpoints
    AnthropicLLM,   # Anthropic Claude
    OllamaLLM,      # Local Ollama
)

Available Handlers

from agentic_core.handlers import (
    SilentHandler,  # Silent, does nothing
    PrintHandler,    # Prints everything (great for debugging)
)

Configuration Options

chat() parameters

Parameter	Type	Default	Description
`message`	str	(required)	Your message to the agent
`base_url`	str	provider default	Custom API endpoint (e.g. for Local LLMs or Proxies)
`provider`	str	"openai"	"openai", "anthropic", or "ollama"
`api_key`	str	None	Required for openai/anthropic
`model`	str	provider default	Model name
`system_prompt`	str	"You are a helpful assistant."	Agent persona
`mcp_config_path`	str	None	Path to MCP config
`verbose`	bool	False	Print all events
`temperature`	float	provider default	LLM creativity
`max_tokens`	int	provider default	Max response size
`config`	`RunnerConfig`	None	Custom execution loop configuration

ToolManager with MCP

from agentic_core.tools import ToolManager

tools = ToolManager(
    mcp_config_path="mcp.json",  # Enable MCP: str | Path
    toolsets={
        "local": ["read_file", "write_file"],
        "web": ["web_search"], 
        ...
    },
    enable_mcp_discovery=True, # Preload MCP discovery tools (2 tools)
    extra_env={"DATABASE": "..."} # Extra environment variables for MCP server intialization (pass only what you need)
)

Structure

agentic_core/
├── __init__.py              
├── engine.py                # AgentRunner (The execution loop)
├── agents/                  # Agent construction and orchestration
│   └── builder.py           # Agent builder logic
├── llm_providers/           # LLM Adapters (OpenAI, Anthropic, Ollama)
│   ├── base.py              # Base LLM interface
│   ├── openai.py            # OpenAI/Compatible provider
│   ├── anthropic.py         # Anthropic Claude provider
│   └── ollama.py            # Local Ollama provider
├── memory/                  # Context and state management
│   ├── manager.py           # MemoryManager
│   └── strategies.py        # Truncation strategies
├── handlers/               # Event logging and observation
│   ├── base.py              # Base Handler
│   ├── dag.py               # `DAGEventHandler` for `DAGAgentRunner`-specific event hooks
│   └── standard.py          # Default/Print handlers
├── tools/                   # Tooling system
│   ├── base.py              # `BaseTool`, schemas, etc.
│   ├── mcp.py               # MCP server management
│   ├── 🔍 rag/             # Custom RAG-based tool suite, supporting your custom backends, default options: ChromaDB, Sqlite. see 'docs/RAG_TOOLS.md'
│   └── manager.py           # `ToolManager` class
└── interfaces/              # Type definitions and Protocols
    ├── llm.py
    └── events.py

Troubleshooting

"openai not found"

pip install openai
# or
pip install anthropic
# or  
pip install ollama

"API key invalid"

Check your key is correct and has credits.

"MCP not working"

Make sure mcp_config_path points to a valid JSON file
Install the MCP servers: npx -y @modelcontextprotocol/server-github
Check the logs with verbose=True

"Model not found" (Ollama)

Make sure Ollama is running:

ollama serve
ollama pull llama3.1

Quick Reference

# Simplest possible usage
from agentic_core.agents import chat
await chat("Hello", provider="ollama")

# With OpenAI
await chat("Hello", provider="openai", api_key="sk-...")

# With MCP tools
await chat("Check my GitHub", provider="openai", api_key="sk-...", mcp_config_path="mcp.json")

# Verbose mode (see everything)
await chat("Hello", provider="ollama", verbose=True)

That's it! Start building agents in seconds. 🚀

Security & Production Readiness

This tool is designed to be as lightweight and robust as possible. While agentic-core safely handles execution, developers must secure the deployment environment. Below are some important considerations:

1. Remote Code Execution via MCP Config:

The mcp_config.json dictates exactly which terminal commands your system will run (via the command and args fields). Never allow end-users to upload, modify, or provide their own mcp_config.json. This file must remain strictly server-side.

2. Prompt Injection to Tool Execution:

If an agent is given a tool that reads external data (like fetching a webpage or reading a user-submitted file), a malicious payload in that data can instruct the LLM to execute other available tools. The agent could be hijacked into executing destructive actions (e.g., via a GitHub or filesystem MCP) without human oversight. SO, do use AgentEventHandler.on_tool_start() method for granular control over agent's actions.

3. Denial of Service via Payload Serialization:

Agents interacting with APIs that return massive, deeply nested JSON payloads may experience performance degradation during the engine's double-serialization checks. To mitigate this, limit the scope of the data your tools are allowed to fetch.

MITIGATIONS:

The tool strictly requires MCP configuration via RunnerConfig to be valid and well-formed to avoid malicious runtime tool injection.
Utilize ToolExecutionController.on_prompt_respond() (blocks, prompt the user for feedback) and ToolExecutionController.on_prompt_confirmation() (blocks, prompt the user to confirm (y/n) with event hooks) for control during tool execution.
AgentEventHandler implementing on_tool_start() provides means to enforce human validation before assembling tool coroutine pool and executing, with different levels of control (use ToolStartDecision).
Heuristically detect and remove unsafe payloads from external data sources (brittle). Most importantly, write your system prompt carefully and choose your MCP servers wisely. Dry-run extensively on domain-specific vulnerable prompts and refine system prompt. This is perhaps the single most effective mitigation against prompt injection.
Finally, this tool is all about robustness - it's meant to be a lightweight engine for agentic applications. Hence, it is at the developer responsibility to enforce security measures against the above risks.

Project details

Release history Release notifications | RSS feed

This version

0.6.6

May 26, 2026

0.6.0

May 1, 2026

0.5.0

Apr 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

callai_agentic_core-0.6.6.tar.gz (98.4 kB view details)

Uploaded May 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

callai_agentic_core-0.6.6-py3-none-any.whl (61.7 kB view details)

Uploaded May 26, 2026 Python 3

File details

Details for the file callai_agentic_core-0.6.6.tar.gz.

File metadata

Download URL: callai_agentic_core-0.6.6.tar.gz
Upload date: May 26, 2026
Size: 98.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.12.10

File hashes

Hashes for callai_agentic_core-0.6.6.tar.gz
Algorithm	Hash digest
SHA256	`b7fe04177d8d9e00f226d92733a14b0bbaabf9dae1e9cc7344d7213971d8f88e`
MD5	`b8cbe72ee9d61a159a8b25dd1082d957`
BLAKE2b-256	`b198101b6a2c266b9f6646b2b58f348caff151bd3c3ef72cf6e5db71e52f39d9`

See more details on using hashes here.

File details

Details for the file callai_agentic_core-0.6.6-py3-none-any.whl.

File metadata

Download URL: callai_agentic_core-0.6.6-py3-none-any.whl
Upload date: May 26, 2026
Size: 61.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.12.10

File hashes

Hashes for callai_agentic_core-0.6.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ff21a545ea41e6e9ef754a3f6141059f7312bcde6d52aa9c48b185f73074fc19`
MD5	`75e4fac2bfea5c7d02121a4b8e55e139`
BLAKE2b-256	`4d001622d758ff7a966729a0f43a3e4e66f327954b26d3571826124dd59f7782`

See more details on using hashes here.

callai-agentic-core 0.6.6

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Agentic Core

Table of Contents

QUICK START

Installation

If you have an OpenAI key:

If you are using an OpenAI-compatible API (e.g. HuggingFace, vLLM):

If you have an Anthropic key:

If you have Ollama (local, no key needed):

The Simple chat() Function

Using Tools (Implementing BaseTool or integrate MCP servers)

Custom Tools via BaseTool

MCP Tools (Dynamic Integration)

Step 1: Create mcp.json

Step 2: Use it in chat()

Step 3: Programmatic MCP Config (No JSON file needed)

Streaming Capabilities

Creating Your Own Agent (Advanced) & Custom Memory Management Strategy

Memory Management & Context Truncation

Available LLM Adapters

Available Handlers

Configuration Options

chat() parameters

ToolManager with MCP

Structure

Troubleshooting

"openai not found"

"API key invalid"

"MCP not working"

"Model not found" (Ollama)

Quick Reference

Security & Production Readiness

1. Remote Code Execution via MCP Config:

2. Prompt Injection to Tool Execution:

3. Denial of Service via Payload Serialization:

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

The Simple `chat()` Function

Using Tools (Implementing `BaseTool` or integrate MCP servers)

Custom Tools via `BaseTool`

Step 1: Create `mcp.json`