A memory-augmented framework for LLMs

These details have not been verified by PyPI

Project links

Project description

🦙 Llamate

Llamate is a memory-augmented agent framework for Large Language Models (LLMs) that provides persistent, retrievable memory for AI conversations.

What is Llamate?

Llamate solves a fundamental limitation of current LLMs: their inability to remember past conversations beyond a single context window. It creates a vector database of memories that can be semantically searched and retrieved during conversations, allowing LLMs to maintain continuity and context over extended interactions.

How It Works

Memory Storage: Llamate stores important pieces of conversation as vector embeddings in a database (either FAISS or PostgreSQL).
Semantic Retrieval: When new queries come in, Llamate searches for semantically relevant past memories.
Memory Filtering: The system automatically filters out the current query from search results to prevent echo effects.
Context Enhancement: Retrieved memories are injected into the conversation context, allowing the LLM to access and utilize past information.
User Identification: Each user gets a unique memory space, ensuring personalized conversation history.

Key Features

Multiple Backend Support: Works with FAISS (file-based) or PostgreSQL (with pgvector)
Persistence: Memories remain available between sessions and application restarts
Simple API: Easy-to-use Python interface that works with any LLM
CLI Interface: Command-line tool for quick testing and interaction
Production Ready: Designed for both development and production environments

Quick Start

Local Development

The following steps guide you through setting up Llamate for local development:

1. Install Package

pip install llamate

2. OpenAI API Requirements

Llamate requires access to the following OpenAI models in your account:

Embedding models (at least one of):
- text-embedding-3-small (default, 1536 dimensions) - Faster, smaller embeddings, cost-effective
- text-embedding-3-large (3072 dimensions) - Higher accuracy, larger embeddings
gpt-4 - Recommended for high-quality responses

Make sure these models are enabled in your OpenAI account.

Environment Variables

Llamate is configured primarily through environment variables, making it easy to integrate with any backend deployment. The following environment variables are supported:

Variable	Default	Description
`LLAMATE_OPENAI_API_KEY`	None (Required)	Your OpenAI API key
`LLAMATE_VECTOR_BACKEND`	`postgres`	Vector store backend (`postgres` or `faiss`)
`LLAMATE_DATABASE_URL`	`postgresql://llamate:llamate@localhost:5432/llamate`	PostgreSQL connection string (when using postgres backend)
`LLAMATE_EMBEDDING_MODEL`	`text-embedding-3-small`	Embedding model to use (`text-embedding-3-small` or `text-embedding-3-large`)

Example configuration for production deployment:

# Required
LLAMATE_OPENAI_API_KEY=sk-your-api-key

# Optional overrides
LLAMATE_VECTOR_BACKEND=postgres
LLAMATE_DATABASE_URL=postgresql://user:password@your-db-host:5432/dbname
LLAMATE_EMBEDDING_MODEL=text-embedding-3-large

Note: While you can use llamate --init for local development to generate a .env file, in production environments you should configure these variables directly in your deployment platform.

3. Start PostgreSQL Container

docker run --name llamate-postgres -e POSTGRES_USER=llamate -e POSTGRES_PASSWORD=llamate -e POSTGRES_DB=llamate -p 5432:5432 -d ankane/pgvector

4. Initialize Llamate

llamate --init
# Select 'postgres' as your vector store backend
# Enter connection string: postgresql://llamate:llamate@localhost:5432/llamate

5. Test Llamate locally

from llamate import MemoryAgent, get_vectorstore_from_env

# Set user ID
user_id = "test_user"

# Initialize components
vectorstore = get_vectorstore_from_env(user_id=user_id)
agent = MemoryAgent(user_id=user_id, vectorstore=vectorstore)

# Add memories
agent.chat("The capital of France is Paris.")
agent.chat("The Eiffel Tower is 324 meters tall.")
agent.chat("Python is a programming language created by Guido van Rossum.")

# Test retrieval
response = agent.chat("Tell me about Paris.")
print("Response:", response)

Production Integration

For production applications, you'll typically integrate Llamate directly into your backend services:

from llamate import MemoryAgent, get_vectorstore_from_env
import os

# In production, set environment variables directly in your deployment platform
# os.environ["LLAMATE_OPENAI_API_KEY"] = "your-key-here" # Set in platform instead
# os.environ["LLAMATE_DATABASE_URL"] = "connection-string" # Set in platform instead

def create_llamate_agent(user_id):
    """Factory function to create a memory-augmented agent for a specific user"""
    vectorstore = get_vectorstore_from_env(user_id=user_id)
    return MemoryAgent(user_id=user_id, vectorstore=vectorstore)

# Example API endpoint
def handle_chat_request(user_id, user_message):
    agent = create_llamate_agent(user_id)
    return agent.chat(user_message)

6. View Data in PostgreSQL

Connect to the database:

docker exec -it llamate-postgres psql -U llamate -d llamate

List tables to find your memory table (it will use your user_id):

\dt

View table structure:

\d memory_test_user

Display memory records (omitting the large vector field):

SELECT id, text FROM memory_test_user;

Count records:

SELECT COUNT(*) FROM memory_test_user;

Query specific memories (using text search):

SELECT id, text FROM memory_test_user WHERE text LIKE '%Paris%';

Delete test memories (if needed):

DELETE FROM memory_test_user WHERE text LIKE '%test%';

Exit the PostgreSQL shell:

\q

Features

Persistent memory for AI using vector embeddings
Multiple vector store backends (FAISS and PostgreSQL)
Easy integration into existing applications
Simple CLI for testing and demonstration

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.16

Jul 13, 2025

0.1.15

Jul 13, 2025

0.1.14

Jul 13, 2025

0.1.13

Jul 13, 2025

0.1.12

Jul 13, 2025

0.1.11

Jul 13, 2025

0.1.10

Jul 13, 2025

0.1.9

Jul 13, 2025

This version

0.1.8

Jul 7, 2025

0.1.7

Jul 7, 2025

0.1.6

Jul 7, 2025

0.1.5

Jul 6, 2025

0.1.4

Jul 6, 2025

0.1.1

Jul 6, 2025

0.1.0

Jul 6, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llamate-0.1.8.tar.gz (12.6 kB view details)

Uploaded Jul 7, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llamate-0.1.8-py3-none-any.whl (12.2 kB view details)

Uploaded Jul 7, 2025 Python 3

File details

Details for the file llamate-0.1.8.tar.gz.

File metadata

Download URL: llamate-0.1.8.tar.gz
Upload date: Jul 7, 2025
Size: 12.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for llamate-0.1.8.tar.gz
Algorithm	Hash digest
SHA256	`fb4534f985a37c0a2cf0a078a473e2135a5bcf87a6901e26c5994d0c671db0f3`
MD5	`6e7ebc29a1bfab6dde88768f7d4ed68c`
BLAKE2b-256	`bdd08de7cf2d19bf31ffb397fe542364bba09a92d962c0f36459ccc1914ce93f`

See more details on using hashes here.

File details

Details for the file llamate-0.1.8-py3-none-any.whl.

File metadata

Download URL: llamate-0.1.8-py3-none-any.whl
Upload date: Jul 7, 2025
Size: 12.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for llamate-0.1.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c2f9783672423c9316c2c8ac1fa0d42626511290416bc451ffafef43e149da31`
MD5	`481dac7c24d8a20c7be578ce7573b1df`
BLAKE2b-256	`1c54efd7a4fa7a9e9a22e1ee79256ee3596da11db44031eba415b0f0dcd90ae9`

See more details on using hashes here.

llamate 0.1.8

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

🦙 Llamate

What is Llamate?

How It Works

Key Features

Quick Start

Local Development

1. Install Package

2. OpenAI API Requirements

Environment Variables

3. Start PostgreSQL Container

4. Initialize Llamate

5. Test Llamate locally

Production Integration

6. View Data in PostgreSQL

Features

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes