CodeAct Agent.

These details have not been verified by PyPI

Project description

Quantalogic CodeAct

Quantalogic CodeAct is a powerful, modular framework for building AI agents that solve complex tasks through iterative reasoning and action. Built on the ReAct (Reasoning and Acting) paradigm, it integrates language models (via litellm), a robust tool ecosystem, and a secure Python execution environment powered by quantalogic-pythonbox. With intuitive interfaces (CLI, interactive shell, and SDK), CodeAct supports a wide range of applications—from mathematical problem-solving to conversational interactions—making it ideal for developers, researchers, and end-users.

Why CodeAct?
What is CodeAct?
How to Use CodeAct
Contributing
References
License

Why CodeAct?

Modern AI agents must tackle intricate, multi-step tasks, adapt to diverse contexts, and provide seamless user experiences. CodeAct meets these demands by offering:

Robust Task Solving: Decomposes complex problems into iterative reasoning and executable actions for precise solutions.
Modularity and Extensibility: Supports plug-and-play components like tools, reasoners, and executors for domain-specific customization.
Accessible Interfaces: Provides CLI, interactive shell, and SDK, catering to both casual users and developers.
Scalability: Handles applications from simple calculations to advanced automation and conversational workflows.

Whether you're automating processes, conducting AI research, or seeking an intelligent assistant, CodeAct delivers flexibility and power.

What is CodeAct?

Quantalogic CodeAct is a framework within the Quantalogic ecosystem that enables the creation of AI agents capable of reasoning and acting using executable Python code as the primary action mechanism. It leverages the ReAct paradigm, integrating language models, extensible tools, and secure execution environments to solve tasks, engage in dialogues, and support custom workflows.

Core Modules

CodeAct’s modular design ensures each component has a clear responsibility, making it easy to extend or customize:

agent.py: Implements the CodeActAgent, orchestrating the ReAct loop.
agent_config.py: Manages agent configuration via YAML files.
reasoner.py: Generates reasoning steps and code using language models.
executor.py: Safely executes Python code and tool calls using PythonBox.
tools_manager.py: Registers and manages tools and toolboxes.
tools/: Includes built-in tools (e.g., AgentTool, RetrieveMessageTool).
conversation_manager.py: Tracks conversation history for context-aware interactions.
working_memory.py: Manages task-specific execution history.
completion_evaluator.py: Evaluates task completion using LLM verification.
events.py: Defines event and result models with Pydantic.
constants.py: Stores project constants (e.g., default model, token limits).
cli.py & cli_commands/: Provides CLI entry points and subcommands.
templates/ & prompts/: Jinja2 templates for LLM prompts and responses.
plugin_manager.py: Enables dynamic loading of plugins and toolboxes.
xml_utils.py: Handles XML formatting for results and actions.
llm_util.py: Manages LLM completion with streaming support.

CodeAct Principle and ReAct Paradigm

CodeAct is inspired by the ReAct paradigm from "ReAct: Synergizing Reasoning and Acting in Language Models" (Yao et al., 2022) and advanced in "Executable Code Actions Elicit Better LLM Agents" (Yang et al., 2024).

ReAct Paradigm

ReAct combines reasoning (generating plans or thoughts) with acting (executing actions) in an iterative loop:

Reason: Analyze the task and plan actions using an LLM.
Act: Execute actions (e.g., code, tool calls) in the environment.
Iterate: Incorporate feedback from actions to refine reasoning until task completion.

This synergy reduces errors, improves adaptability, and enhances interpretability by producing explicit reasoning traces alongside actions.

CodeAct Enhancement

CodeAct builds on ReAct by using executable Python code as the action format, leveraging LLMs’ code-generation capabilities. Key benefits include:

Unified Action Space: Python code allows flexible tool composition, control flow, and error handling.
Secure Execution: Powered by quantalogic-pythonbox, ensuring safe code execution with resource limits.
Empirical Superiority: The CodeAct paper demonstrates up to 20% higher success rates compared to JSON or text-based actions.

Example:

Task: "Calculate 3 + 5"
Reasoning: "I’ll write a Python script to perform the addition."
Action:
<execute>
result = 3 + 5
print(result)
</execute>
Result: "8"

ReAct Agent

The CodeActAgent is the heart of CodeAct, implementing the ReAct loop:

Task Input: Receives a task (e.g., "Calculate the factorial of 5").
Reasoning Phase: Uses a Reasoner to generate a plan or Python code.
Action Phase: Employs an Executor to run the code or invoke tools.
Evaluation: Checks task completion via CompletionEvaluator, iterating if needed.

It maintains context through WorkingMemory and ConversationManager, supporting multi-step tasks and dialogues.

Architecture

CodeAct’s architecture is modular and scalable, as shown below:

graph TD
    classDef user fill:#ffe2e2,stroke:#e3bcbc,stroke-width:2px,color:#333;
    classDef shell fill:#e2f0ff,stroke:#bcd5e3,stroke-width:2px,color:#333;
    classDef cli fill:#e2ffe9,stroke:#bce3c8,stroke-width:2px,color:#333;
    classDef sdk fill:#f2e2ff,stroke:#d3bce3,stroke-width:2px,color:#333;
    classDef agent fill:#fff7e2,stroke:#e3d9bc,stroke-width:2px,color:#333;
    classDef reasoner fill:#e2fff7,stroke:#bce3d9,stroke-width:2px,color:#333;
    classDef executor fill:#e2e7ff,stroke:#bcbce3,stroke-width:2px,color:#333;
    classDef tools fill:#f9e2ff,stroke:#e3bcdc,stroke-width:2px,color:#333;
    classDef pythonbox fill:#e2f7ff,stroke:#bcd5e3,stroke-width:2px,color:#333;

    User[User] -->|Interacts| Shell[Shell Interface]
    User -->|Runs| CLI[CLI Commands]
    User -->|Programs| SDK[Agent SDK]
    Shell -->|Commands| Agent[CodeActAgent]
    CLI -->|Tasks| Agent
    SDK -->|Controls| Agent
    Agent -->|Reasons| Reasoner[Reasoner]
    Agent -->|Executes| Executor[Executor]
    Reasoner -->|Generates Actions| Agent
    Executor -->|Uses| Tools[Tools]
    Tools -->|Returns Results| Executor
    Executor -->|Executes Code| PythonBox["Python Toolbox (PythonBox)"]
    PythonBox -->|Returns Results| Executor
    Executor -->|Updates Context| Agent
    Agent -->|Responds| Shell
    Agent -->|Outputs| CLI
    Agent -->|Returns| SDK

    class User user;
    class Shell shell;
    class CLI cli;
    class SDK sdk;
    class Agent agent;
    class Reasoner reasoner;
    class Executor executor;
    class Tools tools;
    class PythonBox pythonbox;

Agent: Orchestrates the ReAct loop, managing state and history.
Reasoner: Generates code or plans using LLMs (e.g., Gemini, DeepSeek).
Executor: Executes actions securely with PythonBox.
Tools: Modular functions for specialized tasks.
PythonBox: Provides a sandboxed environment for safe code execution.

How to Use CodeAct

Installation

Prerequisites

Python: 3.12 or higher
Poetry: Install via pip install poetry
API Keys: Required for LLMs (e.g., GEMINI_API_KEY for Gemini models)

Installation Steps

Clone the Repository:

git clone https://github.com/quantalogic/quantalogic-codeact.git
cd quantalogic-codeact

Install Dependencies:

poetry install

Alternatively, install via pip:

pip install quantalogic-codeact

Set Environment Variables:
```
export GEMINI_API_KEY="your-api-key"
```
Verify Installation:
```
poetry run quantalogic_codeact --help
```

Quick Start

CodeAct offers two primary interaction modes: an interactive shell for real-time engagement and a CLI for direct task execution.

Interactive Shell

Start the shell:

poetry run quantalogic_codeact shell

Interact using commands like /solve for tasks or /chat for conversations. Example:

[cfg:config.yaml] [Agent] [codeact]> /solve "Calculate 2 + 2"
[Step 1 Result]
- Status: Success
- Value: 4
- Execution Time: 0.12 seconds
- Completed: True
[Final Answer]
4

Command-Line Interface (CLI)

Run tasks directly:

poetry run quantalogic_codeact task "Calculate 3 * 4" --streaming

List available resources:

poetry run quantalogic_codeact list-toolboxes
poetry run quantalogic_codeact list-models

Using the Agent SDK

The Agent SDK enables programmatic control over agents, ideal for developers building custom applications. Below is an example integrating a custom tool and monitoring task progress.

from quantalogic_codeact.codeact.agent import Agent
from quantalogic_codeact.codeact.agent_config import AgentConfig
from quantalogic_toolbox import create_tool, Tool

# Define a custom tool
@create_tool
async def factorial(n: int) -> int:
    """Calculate the factorial of a number."""
    if n < 0:
        raise ValueError("Factorial is not defined for negative numbers")
    result = 1
    for i in range(1, n + 1):
        result *= i
    return result

# Configure agent
config = AgentConfig(
    model="deepseek/deepseek-chat",
    max_iterations=5,
    enabled_toolboxes=["math_tools"],
    tools=[factorial],
    personality={"traits": ["logical", "precise"]}
)
agent = Agent(config=config)

# Monitor progress
def monitor_event(event):
    if event.event_type == "StepStarted":
        print(f"Step {event.step_number} started")
    elif event.event_type == "ActionExecuted":
        print(f"Step {event.step_number} result: {event.result.to_summary()}")
    elif event.event_type == "TaskCompleted":
        print(f"Task completed with answer: {event.final_answer}")

agent.add_observer(monitor_event, ["StepStarted", "ActionExecuted", "TaskCompleted"])

# Solve a task
result = agent.sync_solve("Calculate the factorial of 5")
final_answer = result[-1].get("result", "No result")
print(f"Final Answer: {final_answer}")

# Chat asynchronously
async def run_chat():
    response = await agent.chat("Explain factorials")
    print(f"Explanation: {response}")

import asyncio
asyncio.run(run_chat())

Example Output:

Step 1 started
Step 1 result: - Status: Success
- Task Status: completed
- Result: 120
- Execution Time: 0.15 seconds
Task completed with answer: 120
Final Answer: 120
Explanation: A factorial of a non-negative integer n, denoted n!, is the product of all positive integers less than or equal to n. For example, 5! = 5 * 4 * 3 * 2 * 1 = 120.

ReAct Loop Visualization:

sequenceDiagram
    participant U as User
    participant A as Agent
    participant R as Reasoner
    participant E as Executor
    participant T as Tools
    U->>A: Solve "Factorial of 5"
    A->>R: Generate plan
    R-->>A: Plan: Use factorial tool
    A->>E: Execute factorial(5)
    E->>T: Call factorial tool
    T-->>E: Result: 120
    E-->>A: Result XML: 120
    A-->>U: Final Answer: 120

Commands

Shell Commands

Below is a comprehensive list of shell commands:

Command	Description	Example Usage
`/help [command]`	Show help for commands or a specific command	`/help solve`
`/chat <message>`	Send a chat message to the agent	`/chat How are you?`
`/solve <task>`	Solve a task	`/solve Calculate 2 + 2`
`/mode [chat	codeact]`	Switch between chat and task-solving modes
`/stream [on	off]`	Toggle streaming output
`/exit`	Exit the shell	`/exit`
`/history [n]`	Show last `n` messages (default: all)	`/history 5`
`/clear`	Clear conversation history	`/clear`
`/agent <name>`	Switch or show agent details	`/agent MathBot`
`/set <field> <value>`	Set a config field and create new agent	`/set model deepseek/deepseek-chat`
`/set temperature <value>`	Set or show LLM temperature (0 to 1)	`/set temperature 0.7`
`/config show`	Display current configuration	`/config show`
`/config save <file>`	Save config to a file	`/config save myconfig.yaml`
`/config load <file>`	Load config from a file	`/config load myconfig.yaml`
`/toolbox install <name>`	Install a toolbox	`/toolbox install math_tools`
`/toolbox uninstall <name>`	Uninstall a toolbox	`/toolbox uninstall math_tools`
`/toolbox enable <name>`	Enable a toolbox	`/toolbox enable math_tools`
`/toolbox disable <name>`	Disable a toolbox	`/toolbox disable math_tools`
`/toolbox installed`	Show installed toolboxes	`/toolbox installed`
`/toolbox tools <name>`	List tools in a toolbox	`/toolbox tools math_tools`
`/toolbox doc <name> <tool>`	Show tool documentation	`/toolbox doc math_tools integrate`
`/listmodels`	List available models	`/listmodels`
`/version`	Show package version	`/version`
`/tutorial`	Display a tutorial for new users	`/tutorial`
`/inputmode [single	multi]`	Toggle single-line or multiline input
`/contrast [on	off]`	Toggle high-contrast mode for accessibility
`/setmodel <model>`	Set model and switch to a new agent	`/setmodel deepseek/deepseek-chat`
`/loglevel [level]`	Set log level (DEBUG	INFO
`/save <filename>`	Save conversation history to a file	`/save history.json`
`/load <filename>`	Load conversation history from a file	`/load history.json`
`/compose`	Compose input in an external editor	`/compose`
`/edit [INDEX_OR_ID]`	Edit a previous user message	`/edit 3`

CLI Commands

Usage: quantalogic_codeact [OPTIONS] COMMAND [ARGS]...

Options:
  --config, -c PATH        Path to the configuration file to use
  --loglevel, -l TEXT      Override log level: DEBUG|INFO|WARNING|ERROR|CRITICAL
  --install-completion     Install shell completion
  --show-completion        Show shell completion script
  --help                   Show this message and exit

Commands:
  shell                  Start the interactive shell
  task                   Run a task with event monitoring
  create-toolbox         Create a new toolbox project
  config-load            Load a configuration file
  list-models            List available LLM models
  list-toolboxes         List installed toolboxes
  list-reasoners         List available reasoners
  list-executors         List available executors
  tool-info              Display tool information
  install-toolbox        Install a toolbox
  uninstall-toolbox      Uninstall a toolbox
  config                 Manage configuration (subcommands: show, reset)
  toolbox                Manage toolboxes (subcommands: install, uninstall, tools, doc)

Tip: Use --config to specify a custom configuration file:

quantalogic_codeact task "Solve 2 + 2" -c ./myconfig.yaml

Examples

Shell

[cfg:config.yaml] [Agent] [codeact]> /chat Tell me a joke
Why did the computer go to art school? Because it wanted to learn how to draw a better "byte"!

CLI

poetry run quantalogic_codeact task "Calculate the square root of 16" --model gemini/gemini-2.0-flash

Output:

[Final Answer]
4

SDK

from quantalogic_codeact.codeact.agent import Agent
agent = Agent()
result = agent.sync_solve("What is 3 * 5?")
print(result[-1]["result"])  # Output: 15

Configuration

CodeAct uses a YAML configuration file, typically at ~/.quantalogic/config.yaml. Example:

model: "gemini/gemini-2.0-flash"
max_iterations: 5
max_history_tokens: 2000
enabled_toolboxes:
  - math_tools
reasoner:
  name: "default"
  config:
    temperature: 0.7
executor:
  name: "default"
personality:
  traits:
    - witty
    - helpful
tools_config:
  - name: math_tools
    enabled: true
    config:
      precision: "high"

Manage configurations in the shell with /config save or /config load, or edit the file directly.

Toolbox System

Toolboxes extend CodeAct’s functionality with modular, reusable tools. Built-in toolboxes include math_tools for calculations. Create custom toolboxes with:

poetry run quantalogic_codeact create-toolbox my_toolbox

This generates a project structure with a tools.py file for defining tools. Example:

@create_tool
async def echo_tool(message: str) -> str:
    """Echoes the input message."""
    return f"Echo: {message}"

For detailed guidance, see Toolbox Documentation, covering toolbox creation, tool registration, and management.

Memory Systems

CodeAct employs two memory systems:

WorkingMemory: Tracks task-specific steps, thoughts, actions, and results.
ConversationManager: Stores user-agent interaction history for context-aware dialogues.

These systems enable multi-step reasoning and persistent conversations. See Memory Systems Documentation for implementation details and examples.

Troubleshooting

API Key Issues: Verify GEMINI_API_KEY or other LLM keys are set.
Dependency Errors: Run poetry install to ensure all packages are installed.
Timeout Errors: Increase --timeout in CLI or timeout in SDK config.
Tool Failures: Use /toolbox doc <toolbox> <tool> to check tool arguments.
Logging: Enable debug mode with /loglevel DEBUG for detailed logs.

Contributing

Contributions are welcome! Please follow CONTRIBUTING.md for guidelines on code style, testing, and workflows.

References

Yao, S., et al. (2022). "ReAct: Synergizing Reasoning and Acting in Language Models." arXiv:2210.03629.
Yang, J., et al. (2024). "Executable Code Actions Elicit Better LLM Agents." arXiv:2402.01030.
Quantalogic PythonBox: github.com/quantalogic/quantalogic-pythonbox.

License

Quantalogic CodeAct is licensed under the Apache License, Version 2.0. See LICENSE for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.100.0

Apr 25, 2025

0.94.0

Apr 23, 2025

0.93.0

Apr 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

quantalogic_codeact-0.100.0.tar.gz (86.9 kB view details)

Uploaded Apr 25, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

quantalogic_codeact-0.100.0-py3-none-any.whl (124.9 kB view details)

Uploaded Apr 25, 2025 Python 3

File details

Details for the file quantalogic_codeact-0.100.0.tar.gz.

File metadata

Download URL: quantalogic_codeact-0.100.0.tar.gz
Upload date: Apr 25, 2025
Size: 86.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.2 CPython/3.12.8 Darwin/24.4.0

File hashes

Hashes for quantalogic_codeact-0.100.0.tar.gz
Algorithm	Hash digest
SHA256	`7d56e147233af44ffa2ab25054bc0425b6573d89beea3d71da0a243a7802e17e`
MD5	`4bba50e4696e7d41e72c36e16a056658`
BLAKE2b-256	`6bcbd6aa6f7dd8e45180a5f68e46290288ea07f67c0796925f4343584348df85`

See more details on using hashes here.

File details

Details for the file quantalogic_codeact-0.100.0-py3-none-any.whl.

File metadata

Download URL: quantalogic_codeact-0.100.0-py3-none-any.whl
Upload date: Apr 25, 2025
Size: 124.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.2 CPython/3.12.8 Darwin/24.4.0

File hashes

Hashes for quantalogic_codeact-0.100.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6711df7aa3a61f9aad019be3340edf4d6a237971742f0d7e2034e1a84c14f63b`
MD5	`31fb4e6f625f0eacbd556b7adbfe76a0`
BLAKE2b-256	`7f0285094544c764c686eee42eda5bcda96e316f6aee7fc74b5a33cf6a6d541a`

See more details on using hashes here.

quantalogic-codeact 0.100.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Quantalogic CodeAct

Table of Contents

Why CodeAct?

What is CodeAct?

Core Modules

CodeAct Principle and ReAct Paradigm

ReAct Paradigm

CodeAct Enhancement

ReAct Agent

Architecture

How to Use CodeAct

Installation

Prerequisites

Installation Steps

Quick Start

Interactive Shell

Command-Line Interface (CLI)

Using the Agent SDK

Commands

Shell Commands

CLI Commands

Examples

Shell

CLI

SDK

Configuration

Toolbox System

Memory Systems

Troubleshooting

Contributing

References

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes