A PyTorch-like API for building composable LLM agents

Project description

Promptimus 🧠

A PyTorch-like API for building composable LLM agents with advanced tool calling, memory management, and observability.

✨ Key Features

🧠 PyTorch-like Modules: Composable agent architecture with hierarchical module system
🔧 Tool Calling: ReACT-style and native OpenAI function calling with automatic schema generation
📝 Structured Output: Pydantic schema-based JSON generation with validation
💾 Memory Management: Conversation context with configurable memory limits
🔍 Embeddings: Text embedding generation with batch processing support
🗄️ Vector Stores: ChromaDB integration with async-first vector operations
🤖 RAG (Retrieval-Augmented Generation): Retrieval system with conversation memory
📊 Tracing: Arize Phoenix integration for observability with token usage and cost tracking
💾 Serialization: TOML-based save/load for prompts and module configurations
⚡ Async First: Built for high-performance asynchronous operations

🚀 Quick Start

Installation

pip install promptimus

Basic Example

import promptimus as pm

# Create an LLM provider
llm = pm.llms.OpenAILike(
    model_name="gpt-4",
    api_key="your-api-key"
)

# Create a simple agent with memory
agent = pm.modules.MemoryModule(
    memory_size=5,
    system_prompt="You are a helpful assistant."
).with_llm(llm)

# Have a conversation
response1 = await agent.forward("Hi, I'm Alice!")
response2 = await agent.forward("What's my name?")
print(response2.content)  # "Your name is Alice!"

🏗️ Architecture

Core Concepts

Modules: Container system for organizing prompts, submodules, and logic

class MyAgent(pm.Module):
    def __init__(self):
        super().__init__()
        self.chat = pm.Prompt("You are a helpful assistant")
        self.memory = []

    async def forward(self, message: str) -> str:
        # Custom logic here
        pass

Prompts: Parameter-like system for system prompts (similar to PyTorch parameters)

prompt = pm.Prompt("You are a {role} assistant").with_llm(llm)
response = await prompt.forward(role="helpful")

Tools: Function decoration for external capabilities

@pm.modules.Tool.decorate
def calculate(a: float, b: float, operation: str) -> float:
    """Calculate result of operation on two numbers."""
    if operation == "add":
        return a + b
    # ... more operations

Pre-built Modules

Memory Module: Conversation memory with configurable limits

agent = pm.modules.MemoryModule(
    memory_size=10,
    system_prompt="You are a helpful assistant."
).with_llm(llm)

Retrieval Module: Hybrid search with vector and text stores

retrieval = pm.modules.RetrievalModule(
    vector_store=vector_store,
    n_semantic=5,
).with_embedder(embedder)

# Insert documents
await retrieval.insert(documents)

# Search for relevant content
results = await retrieval.forward("query about AI")

RAG Module: Retrieval-Augmented Generation with conversation memory

import chromadb
from chromadb_store import ChromaVectorStore

# Setup components
embedder = pm.embedders.OpenAILikeEmbedder(model_name="text-embedding-3-small")
client = chromadb.EphemeralClient()
vector_store = ChromaVectorStore(client, "my_docs")

# Create RAG agent
rag_agent = pm.modules.RAGModule(
    vector_store=vector_store,
    n_semantic=3,
    memory_size=5
).with_llm(llm).with_embedder(embedder)

# Add documents
await rag_agent.retrieval.insert([
    "Machine learning is a subset of AI...",
    "Deep learning uses neural networks...",
    # ... more documents
])

# Query with context
response = await rag_agent.forward("What is machine learning?")

Structural Output: Pydantic schema-based JSON generation

from pydantic import BaseModel

class Person(BaseModel):
    name: str
    age: int
    occupation: str

module = pm.modules.StructuralOutput(Person).with_llm(llm)
result = await module.forward("Extract info about John, a 30-year-old engineer")

Tool Calling Agents: Agents that can use tools autonomously

agent = pm.modules.ToolCallingAgent([
    calculate,
    # ... more tools
]).with_llm(llm)

result = await agent.forward("What is 15 + 27?")

🔧 Advanced Features

Serialization

Save and load module configurations:

agent.save("my_agent.toml")
loaded_agent = pm.modules.MemoryModule().load("my_agent.toml")

Config Composition

Split complex TOML configs into composable files using OmegaConf-inspired resolver syntax:

# main.toml — lean orchestrator config
analysis_format = """
<query>{query}</query>
"""

# Reference submodule configs from installed packages
query_decomposer = "${pkg:my_lib.prompts.query_decomposer.toml}"

# Or from relative file paths
series_analyzer = "${file:./series_analyzer.toml}"

References are resolved at load time and inlined on save — save() always produces a single flat TOML.

Tracing with Phoenix

import phoenix as px
from phoenix_tracer import trace

px.launch_app()
trace(agent, "my_agent", project_name="my_project")

# With cost tracking
trace(
    agent, "my_agent",
    pricing={
        "gpt-4.1": (2.0, 8.0),       # (input $/M tokens, output $/M tokens)
        "gpt-4.1-mini": (0.4, 1.6),
    },
    project_name="my_project",
)

Vector Stores

import chromadb
from chromadb_store import ChromaVectorStore

# Setup ChromaDB vector store
client = chromadb.PersistentClient(path="./chroma_db")
vector_store = ChromaVectorStore(client, "my_collection")

# Create embedder
embedder = pm.embedders.OpenAILikeEmbedder(
    model_name="text-embedding-3-small"
)

# Build RAG system
rag = pm.modules.RAGModule(
    vector_store=vector_store,
    n_semantic=5,
    memory_size=10
).with_llm(llm).with_embedder(embedder)

# Add documents
await rag.retrieval.insert([
    "Document 1 content...",
    "Document 2 content...",
])

# Query with retrieval-augmented generation
response = await rag.forward("What information do you have about X?")

Custom Embedders

embedder = pm.embedders.OpenAILikeEmbedder(
    model_name="text-embedding-3-small"
)

embeddings = await embedder.aembed_batch([
    "Hello world",
    "How are you?"
])

📖 Documentation

Tutorials

Explore our comprehensive notebook tutorials:

LLM Providers & Embedders - Getting started with providers
Prompts & Modules - Core architecture concepts
Pre-built Modules - Ready-to-use components including RAG
Custom Agents - Tool calling and advanced agents
Tracing - Observability with Phoenix

API Reference

pm.Module: Base class for all modules
pm.Prompt: System prompt management
pm.llms.*: LLM provider implementations
pm.embedders.*: Embedding provider implementations
pm.vectore_store.*: Vector store protocols and implementations
pm.modules.*: Pre-built module components
- MemoryModule: Conversation memory management
- RAGModule: Retrieval-Augmented Generation
- RetrievalModule: Vector database operations
- StructuralOutput: Schema-based JSON generation
- ToolCallingAgent: Tool-augmented agents

🛠️ Installation Options

Basic Installation

pip install promptimus

With Optional Dependencies

# Phoenix tracing support
pip install promptimus[phoenix]

# ChromaDB vector store for RAG
pip install promptimus[chromadb]

# All optional dependencies
pip install promptimus[all]

Development Setup

git clone https://github.com/AIladin/promptimus.git
cd promptimus
pip install -e .[dev]

🙏 Acknowledgments

Inspired by PyTorch's modular architecture
Built on top of modern Python async patterns
Integrated with Arize Phoenix for tracing
Compatible with OpenAI and OpenAI-compatible APIs
Vector store support powered by ChromaDB

Ready to build your next LLM agent? Check out our tutorials to get started! 🚀

Project details

Release history Release notifications | RSS feed

This version

0.1.25

Apr 21, 2026

0.1.24

Apr 21, 2026

0.1.23

Apr 15, 2026

0.1.22

Apr 6, 2026

0.1.21

Apr 6, 2026

0.1.20

Mar 29, 2026

0.1.19

Mar 5, 2026

0.1.18

Jan 28, 2026

0.1.17

Jan 28, 2026

0.1.16

Jan 27, 2026

0.1.15

Jan 26, 2026

0.1.14

Jan 22, 2026

0.1.12

Jan 19, 2026

0.1.11

Dec 12, 2025

0.1.10

Nov 12, 2025

0.1.9

Sep 24, 2025

0.1.8

Sep 12, 2025

0.1.7

Sep 12, 2025

0.1.6

Sep 11, 2025

0.1.5

Sep 10, 2025

0.1.3

Sep 10, 2025

0.1.0

Jun 27, 2025

0.0.2a4 pre-release

Jun 4, 2025

0.0.2a3 pre-release

Apr 23, 2025

0.0.2a2 pre-release

Apr 22, 2025

0.0.2a1 pre-release

Apr 22, 2025

0.0.2a0 pre-release

Apr 22, 2025

0.0.1a8 pre-release

Apr 16, 2025

0.0.1a7 pre-release

Mar 26, 2025

0.0.1a6 pre-release

Mar 4, 2025

0.0.1a5 pre-release

Mar 3, 2025

0.0.1a4 pre-release

Mar 1, 2025

0.0.1a3 pre-release

Feb 28, 2025

0.0.1a2 pre-release

Feb 28, 2025

0.0.1a1 pre-release

Feb 28, 2025

0.0.1a0 pre-release

Feb 26, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

promptimus-0.1.25.tar.gz (261.9 kB view details)

Uploaded Apr 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

promptimus-0.1.25-py3-none-any.whl (32.0 kB view details)

Uploaded Apr 21, 2026 Python 3

File details

Details for the file promptimus-0.1.25.tar.gz.

File metadata

Download URL: promptimus-0.1.25.tar.gz
Upload date: Apr 21, 2026
Size: 261.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for promptimus-0.1.25.tar.gz
Algorithm	Hash digest
SHA256	`2a3cef99513a37173a790098f6ecd99c4fc39b1a1793c579ba97ca6d7ee16d60`
MD5	`f0bb51fcebacf3a7d6535ad475e482d0`
BLAKE2b-256	`797daced262521ee603198ae4d9ae27183cfea97c3241601839b18c755471626`

See more details on using hashes here.

Provenance

The following attestation bundles were made for promptimus-0.1.25.tar.gz:

Publisher: python-publish.yml on AIladin/promptimus

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: promptimus-0.1.25.tar.gz
- Subject digest: 2a3cef99513a37173a790098f6ecd99c4fc39b1a1793c579ba97ca6d7ee16d60
- Sigstore transparency entry: 1347985352
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: AIladin/promptimus@fed5811087d75cdbc7a664a7c0eb77c13e476dd8
- Branch / Tag: refs/tags/v0.1.25
- Owner: https://github.com/AIladin
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@fed5811087d75cdbc7a664a7c0eb77c13e476dd8
- Trigger Event: release

File details

Details for the file promptimus-0.1.25-py3-none-any.whl.

File metadata

Download URL: promptimus-0.1.25-py3-none-any.whl
Upload date: Apr 21, 2026
Size: 32.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for promptimus-0.1.25-py3-none-any.whl
Algorithm	Hash digest
SHA256	`378b74886f184927c33afe1d4a4eafc450afc0494192e6d5f4b58920a4a5e8e5`
MD5	`2b6e3fd3ca891203bf5809d84f766cbb`
BLAKE2b-256	`2963ef08770931d67018cdc279ad9a69a070bd2b7c824ec744cd782fed1664e0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for promptimus-0.1.25-py3-none-any.whl:

Publisher: python-publish.yml on AIladin/promptimus

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: promptimus-0.1.25-py3-none-any.whl
- Subject digest: 378b74886f184927c33afe1d4a4eafc450afc0494192e6d5f4b58920a4a5e8e5
- Sigstore transparency entry: 1347985541
- Sigstore integration time: Apr 21, 2026
Source repository:
- Permalink: AIladin/promptimus@fed5811087d75cdbc7a664a7c0eb77c13e476dd8
- Branch / Tag: refs/tags/v0.1.25
- Owner: https://github.com/AIladin
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@fed5811087d75cdbc7a664a7c0eb77c13e476dd8
- Trigger Event: release

promptimus 0.1.25

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Promptimus 🧠

✨ Key Features

🚀 Quick Start

Installation

Basic Example

🏗️ Architecture

Core Concepts

Pre-built Modules

🔧 Advanced Features

Serialization

Config Composition

Tracing with Phoenix

Vector Stores

Custom Embedders

📖 Documentation

Tutorials

API Reference

🛠️ Installation Options

Basic Installation

With Optional Dependencies

Development Setup

🙏 Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance