Skip to main content

Memor: Reproducible Structured Memory for LLMs

Project description

Memor Logo

Memor: Reproducible Structured Memory for LLMs


PyPI version built with Python3 GitHub repo size Discord Channel

Overview

With Memor, users can store their LLM conversation history using an intuitive and structured data format. It abstracts user prompts and model responses into a "Session", a sequence of message exchanges. In addition to the content, it includes details like decoding temperature and token count of each message. Therefore users could create comprehensive and reproducible logs of their interactions. Because of the model-agnostic design, users can begin a conversation with one LLM and switch to another keeping the context the same. For example, they might use a retrieval-augmented model (like RAG) to gather relevant context for a math problem, and then switch to a model better suited for reasoning to solve the problem based on the retrieved information presented by Memor.

Memor also lets users select, filter, and then share the specific parts of the past conversations across different models. This means users are not only able to reproduce and review previous chats through structured logs, but can also flexibly transfer the content of their conversations between LLMs. In a nutshell, Memor makes it easy and effective to manage and reuse conversations with large language models.

PyPI Counter
Github Stars
Branch main dev
CI
Code Quality CodeFactor

Installation

PyPI

Source code

Usage

Memor provides Prompt, Response, and Session as abstractions by which you can save your conversation history much structured. You can set a Session object before starting a conversation, make a Prompt object from your prompt and a Response object from LLM's response. Then adding them to the created Session can keep the conversation history.

from memor import Session, Prompt, Response
from memor import RenderFormat
from mistralai import Mistral

client = Mistral(api_key="YOUR_MISTRAL_API")
session = Session()
while True:
    user_input = input(">> You: ")
    prompt = Prompt(message=user_input)
    session.add_message(prompt) # Add user input to session
    response = client.chat.complete(
        model="mistral-large-latest",
        messages=session.render(RenderFormat.OPENAI)  # Render the whole session history
    )
    print("<< MistralAI:", response.choices[0].message.content)
    response = Response(message=response.choices[0].message.content)
    session.add_message(response) # Add model response to session

Your conversations would carry the past interactions and LLM remembers your session's information:

>> You: Imagine you have 3 apples. You eat one of them. How many apples remain?
<< MistralAI: If you start with 3 apples and you eat one of them, you will have 2 apples remaining.
>> You: How about starting from 2 apples?
<< MistralAI: If you start with 2 apples and you eat one of them, you will have 1 apple remaining. Here's the simple math:
2 apples - 1 apple = 1 apple

In the following, we detail different abstraction levels Memor provides for the conversation artifacts.

Prompt

The Prompt class is a core abstraction in Memor, representing a user prompt. The prompt can be associated with one or more responses from an LLM, with the first one being the most confident usually. It encapsulates not just the prompt text but also metadata, a template for rendering into the API endpoint, and serialization capabilities that enable saving and reusing prompts.

from memor import Prompt, Response, PresetPromptTemplate
prompt = Prompt(
    message="Hello, how are you?",
    responses=[
        Response(message="I'm fine."),
        Response(message="I'm not fine."),
    ],
    template=PresetPromptTemplate.BASIC.PROMPT_RESPONSE_STANDARD
)
prompt.render()
# Prompt: Hello, how are you?
# Response: I'm fine.

Parameters

Name Type Description
message str The core prompt message content
responses List[Response] List of associated responses
role Role Role of the message sender (USER, SYSTEM, etc.)
tokens int Token count
template PromptTemplate | PresetPromptTemplate Template used to format the prompt
file_path str Path to load a prompt from a JSON file
init_check bool Whether to verify template rendering during initialization

Methods

Method Description
add_response Add a new response (append or insert)
remove_response Remove the response at specified index
select_response Mark a specific response as selected to be included in memory
update_template Update the rendering template
update_responses Replace all responses
update_message Update the prompt text
update_message_from_xml Update the prompt text from XML
update_role Change the prompt role
update_tokens Set a custom token count
to_json / from_json Serialize or deserialize the prompt data
to_dict Convert the object to a Python dictionary
save / load Save or load prompt from file
render Render the prompt in a specified format
check_render Validate if the current prompt setup can render
estimate_tokens Estimate the token usage for the prompt
get_size Return prompt size in bytes (JSON-encoded)
copy Clone the prompt
regenerate_id Reset the unique identifier of the prompt
contains_xml Check if the prompt contains any XML tags
set_size_warning / reset_size_warning Set or reset size warning

Response

The Response class represents an answer or a completion generated by a model given a prompt. It encapsulates metadata such as score, temperature, model, tokens, inference time, and more. It also provides utilities for JSON serialization, rendering in multiple formats, and import/export functionality.

from memor import Response, Role, LLMModel
response = Response(
    message="Sure! Here's a summary.",
    score=0.94,
    temperature=0.7,
    model=LLMModel.GPT_4,
    inference_time=0.3
)
response.render()
# Sure! Here's a summary.

Parameters

Name Type Description
message str The content of the response
score float Evaluation score representing the response quality
role Role Role of the message sender (USER, SYSTEM, etc.)
temperature float Sampling temperature
top_k int k in top-k sampling method
top_p float p in top-p (nucleus) sampling
tokens int Number of tokens in the response
inference_time float Time spent generating the response (seconds)
model LLMModel | str Model used
gpu str GPU model used
date datetime.datetime Timestamp of the creation
file_path str Path to load a saved response

Methods

Method Description
update_score Update the response score
update_temperature Set the generation temperature
update_top_k Set the top-k value
update_top_p Set the top-p value
update_model Set the model name or enum
update_gpu Set the GPU model identifier
update_inference_time Set the inference time in seconds
update_message Update the response message
update_message_from_xml Update the response message from XML
update_role Update the sender role
update_tokens Set the number of tokens
to_json / from_json Serialize or deserialize to/from JSON
to_dict Convert the object to a Python dictionary
save / load Save or load the response to/from a file
render Render the response in a specific format
check_render Validate if the current response setup can render
estimate_tokens Estimate the token usage for the response
get_size Return response size in bytes (JSON-encoded)
copy Clone the response
regenerate_id Reset the unique identifier of the response
contains_xml Check if the response contains any XML tags
set_size_warning / reset_size_warning Set or reset size warning

Prompt Templates

The PromptTemplate class provides a structured interface for managing, storing, and customizing text prompt templates used in prompt engineering tasks. This class supports template versioning, metadata tracking, file-based persistence, and integration with preset template formats. It is a core component of the memor library, designed to facilitate reproducible and organized prompt workflows for LLMs.

from memor import Prompt, PromptTemplate
template = PromptTemplate(content="{instruction}, {prompt[message]}", custom_map={"instruction": "Hi"})
prompt = Prompt(message="How are you?", template=template)
prompt.render()
'Hi, How are you?'

Parameters

Name Type Description
title str The template name
content str The template content string with placeholders
custom_map Dict[str, str] A dictionary of custom variables used in the template
file_path str Path to a JSON file to load the template from

Methods

Method Description
update_title Update the template title
update_content Update the template content
update_map Update the custom variable map
get_size Return the size (in bytes) of the JSON representation
save / load Save or load the template to/from a file
to_json / from_json Serialize or deserialize to/from JSON
to_dict Convert the template to a plain Python dictionary
copy Return a shallow copy of the template instance

Preset Templates

Memor provides a variety of pre-defined PromptTemplates to control how prompts and responses are rendered. Each template is prefixed by an optional instruction string and includes variations for different formatting styles. Following are different variants of parameters:

  • INSTRUCTION1: "I'm providing you with a history of a previous conversation. Please consider this context when responding to my new question."
  • INSTRUCTION2: "Here is the context from a prior conversation. Please learn from this information and use it to provide a thoughtful and context-aware response to my next questions."
  • INSTRUCTION3: "I am sharing a record of a previous discussion. Use this information to provide a consistent and relevant answer to my next query."
Template Title Description
PROMPT Only includes the prompt message
RESPONSE Only includes the response message
RESPONSE0 to RESPONSE3 Include specific responses from a list of multiple responses
PROMPT_WITH_LABEL Prompt with a "Prompt: " prefix
RESPONSE_WITH_LABEL Response with a "Response: " prefix
RESPONSE0_WITH_LABEL to RESPONSE3_WITH_LABEL Labeled response for the i-th response
PROMPT_RESPONSE_STANDARD Includes both labeled prompt and response on a single line
PROMPT_RESPONSE_FULL A detailed multi-line representation including role, date, model, etc

You can access them using:

from memor import PresetPromptTemplate
template = PresetPromptTemplate.INSTRUCTION1.PROMPT_RESPONSE_STANDARD

Session

The Session class represents a conversation session composed of Prompt and Response messages. It supports creation, modification, saving, loading, searching, rendering, and token estimation — offering a structured way to manage LLM interaction histories. Each session tracks metadata such as title, creation/modification time, render count, and message activation (masking) status.

from memor import Session, Prompt, Response
session = Session(title="Q&A Session", messages=[
            Prompt(message="What is the capital of France?"),
            Response(message="The capital of France is Paris.")
            ])
session.add_message(Prompt(message="What is the population of Paris?"))
print(session.render())
# What is the capital of France?
# The capital of France is Paris.
# What is the population of Paris?

results = session.search("Paris")
print("Found at indices:", results)
# Found at indices: [1, 2]

tokens = session.estimate_tokens()
print("Estimated tokens:", tokens)
# Estimated tokens: 35

Parameters

Parameter Type Description
title str The title of the session
messages List[Prompt or Response] The list of initial messages
init_check bool Whether to check rendering at initialization
file_path str The Path to a saved session file

Methods

Method Description
add_message Add a Prompt or Response to the session
remove_message Remove a message by index or ID
remove_message_by_index Remove a message by numeric index
remove_message_by_id Remove a message by its unique ID
update_title Update the title of the session
update_messages Replace all messages and optionally update their status list
update_messages_status Update the message status without changing the content
clear_messages Remove all messages from the session
get_message Retrieve a message by index, slice, or ID
get_message_by_index Get a message by integer index or slice
get_message_by_id Get a message by its unique ID
enable_message Mark the message at the given index as active
disable_message Mark the message as inactive (masked)
mask_message Alias for disable_message()
unmask_message Alias for enable_message()
search Search for a string or regex pattern in the messages
save / load Save or load the session to/from a file
to_json / from_json Serialize or deserialize the session to/from JSON
to_dict Return a Python dict representation of the session
to_dataframe/from_dataframe Serialize or deserialize the session to/from Pandas DataFrame
render Render the session in the specified format
check_render Return True if the session renders without error
get_size Return session size in bytes (JSON-encoded)
copy Return a shallow copy of the session
estimate_tokens Estimate the token count of the session content
set_size_warning / reset_size_warning Set or reset size warning
reset_render_counter Reset render counter

Examples

You can find more real-world usage of Memor in the examples directory. This directory includes concise and practical Python scripts that demonstrate key features of Memor library.

Issues & bug reports

Just fill an issue and describe it. We'll check it ASAP! or send an email to memor@openscilab.com.

  • Please complete the issue template

You can also join our discord server

Discord Channel

Show your support

Star this repo

Give a ⭐️ if this project helped you!

Donate to our project

If you do like our project and we hope that you do, can you please support us? Our project is not and is never going to be working for profit. We need the money just so we can continue doing what we do ;-) .

Memor Donation

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog and this project adheres to Semantic Versioning.

Unreleased

1.1 - 2025-12-23

Added

  • Session class to_dataframe method
  • Session class from_dataframe method
  • Session class reset_render_counter method
  • xml_usage.py example
  • dataframe_usage example

Changed

  • LLMModel enum updated
  • Test system modified
  • README.md updated

1.0 - 2025-11-17

Added

  • Response class xml_tree property
  • Prompt class xml_tree property
  • Response class update_message_from_xml method
  • Prompt class update_message_from_xml method

Changed

  • README.md updated
  • LLMModel enum updated
  • verify parameter added to contains_xml method
  • Warning bug fixed
  • Test system modified

0.9 - 2025-10-22

Added

  • Prompt class contains_xml method
  • Response class contains_xml method
  • Prompt class size warning
  • Response class size warning
  • Session class size warning
  • Response class warnings property
  • Prompt class warnings property
  • Session class warnings property

Changed

  • README.md updated
  • LLMModel enum updated
  • None value update bug fixed
  • show_warning parameter added to render method
  • Python 3.14 added to test.yml
  • Test system modified
  • Typing modified

0.8 - 2025-07-21

Added

  • Logo
  • Response class top_k property
  • Response class top_p property
  • Response class gpu property

Changed

  • AI_STUDIO render format modified
  • LLMModel enum updated
  • Model load bug fixed
  • Test system modified
  • README.md updated
  • _validate_pos_float now validates int values
  • _validate_probability now validates int values
  • None value validation bug fixed

0.7 - 2025-06-25

Added

  • Message abstract class
  • Session class _validate_extract_json method
  • Response class _validate_extract_json method
  • Prompt class _validate_extract_json method
  • PromptTemplate class _validate_extract_json method
  • Session class search method
  • Session class get_size method
  • Response class get_size method
  • Prompt class get_size method
  • PromptTemplate class get_size method
  • Session class size attribute
  • Response class size attribute
  • Prompt class size attribute
  • PromptTemplate class size attribute
  • examples directory

Changed

  • Validation bug fixed in update_messages method in Session class
  • Validation bug fixed in from_json method in PromptTemplate, Response, Prompt, and Session classes
  • AI_STUDIO render format modified
  • Session class messages status bug fixed
  • Test system modified
  • README.md updated

0.6 - 2025-05-05

Added

  • Response class id property
  • Prompt class id property
  • Response class regenerate_id method
  • Prompt class regenerate_id method
  • Session class render_counter method
  • Session class remove_message_by_index and remove_message_by_id methods
  • Session class get_message_by_index, get_message_by_id and get_message methods
  • LLMModel enum
  • AI_STUDIO render format

Changed

  • Test system modified
  • Modification handling centralized via _mark_modified method
  • Session class remove_message method modified

0.5 - 2025-04-16

Added

  • Session class check_render method
  • Session class clear_messages method
  • Prompt class check_render method
  • Session class estimate_tokens method
  • Prompt class estimate_tokens method
  • Response class estimate_tokens method
  • universal_tokens_estimator function
  • openai_tokens_estimator_gpt_3_5 function
  • openai_tokens_estimator_gpt_4 function

Changed

  • init_check parameter added to Prompt class
  • init_check parameter added to Session class
  • Test system modified
  • Python 3.6 support dropped
  • README.md updated

0.4 - 2025-03-17

Added

  • Session class __contains__ method
  • Session class __getitem__ method
  • Session class mask_message method
  • Session class unmask_message method
  • Session class masks attribute
  • Response class __len__ method
  • Prompt class __len__ method

Changed

  • inference_time parameter added to Response class
  • README.md updated
  • Test system modified
  • Python typing features added to all modules
  • Prompt class default values updated
  • Response class default values updated

0.3 - 2025-03-08

Added

  • Session class __len__ method
  • Session class __iter__ method
  • Session class __add__ and __radd__ methods

Changed

  • tokens parameter added to Prompt class
  • tokens parameter added to Response class
  • tokens parameter added to preset templates
  • Prompt class modified
  • Response class modified
  • PromptTemplate class modified

0.2 - 2025-03-01

Added

  • Session class

Changed

  • Prompt class modified
  • Response class modified
  • PromptTemplate class modified
  • README.md updated
  • Test system modified

0.1 - 2025-02-12

Added

  • Prompt class
  • Response class
  • PromptTemplate class
  • PresetPromptTemplate class

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

memor-1.1.tar.gz (60.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

memor-1.1-py3-none-any.whl (39.9 kB view details)

Uploaded Python 3

File details

Details for the file memor-1.1.tar.gz.

File metadata

  • Download URL: memor-1.1.tar.gz
  • Upload date:
  • Size: 60.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for memor-1.1.tar.gz
Algorithm Hash digest
SHA256 23895616bb0ad7f4f11357f9983aa09571c198d18917cf3d0c183ee8a7892b4c
MD5 e5d77219a964e9eed394ad7cf73999a4
BLAKE2b-256 226fb788566a7b63dea209a70705269f64fc4d0429f8a3676f344aae45421b28

See more details on using hashes here.

File details

Details for the file memor-1.1-py3-none-any.whl.

File metadata

  • Download URL: memor-1.1-py3-none-any.whl
  • Upload date:
  • Size: 39.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for memor-1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0cb062de7c3cc26103a5fdced804d17a1479f69745d6c706965c5a19784b0998
MD5 fa7ec832b1741ab5ab21d14f12a84ff4
BLAKE2b-256 095ba2c66782503f09b899ee6eeedff160c90c95824a117b4266f8cd30e889d4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page