Une approche pour la gestion de la mémoire à court terme dans les chatbots.

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Introduction

We present here an approach for managing short-term memory in chatbots, using a combination of storage techniques and automatic summarization to optimize conversational context. The introduced method relies on a dynamic memory structure that limits data size while preserving essential information through intelligent summaries. This approach not only improves the fluidity of interactions but also ensures contextual continuity during long dialogue sessions. Additionally, the use of asynchronous techniques ensures that memory management operations do not interfere with the chatbot's responsiveness.

How to Use the `shortterm-memory` Package

This section explains how to use the shortterm-memory package to manage a chatbot's memory.

Installation

pip install torch transformers

pip install shortterm-memory

pip show shortterm-memory

Usage

from shortterm_memory.ChatbotMemory import ChatbotMemory

Usage Exemple

from shortterm_memory.ChatbotMemory import ChatbotMemory

# Initialisation de la mémoire du chatbot
chat_memory = ChatbotMemory()

# Mettre à jour la mémoire avec un nouvel échange
user_input = "Bonjour, comment allez-vous?"
bot_response = "Je vais bien, merci ! Et vous ?"
chat_memory.update_memory(user_input, bot_response)

# Obtenir l'historique des conversations
historique = chat_memory.get_memory()
print(historique)

Available Features

update_memory(user_input: str, bot_response: str): Updates the conversation history with a new question-response pair.
get_memory(): Returns the complete conversation history as a list.
memory_counter(conv_hist: list) -> int: Counts the total number of words in the conversation history.
compressed_memory(conv_hist: list) -> list: Compresses the conversation history using a summarization model.

Error Handling

Ensure that user inputs and bot responses are valid strings. If the history becomes too large, the package automatically compresses older conversations to save memory.

Mathematical Modeling of Conversation Management

In this section, we mathematically formalize conversation memory management in the chatbot. The memory is structured as a list of pairs representing exchanges between the user and the bot.

Conversation Memory Structure

The conversation memory can be defined as an ordered list of pairs $(u_i, d_i)$, where $u_i$ represents the user input and $d_i$ the bot response for the $i$-th exchange. This list is denoted by $\mathcal{C}$:

$$ \mathcal{C} = [(u_1, d_1), (u_2, d_2), \ldots, (u_n, d_n)] $$

where $n$ is the total number of exchanges in the current history.

Memory Update

When a new exchange occurs, a new pair $(u_{n+1}, d_{n+1})$ is added to the memory. If the size of $\mathcal{C}$ exceeds a predefined maximum limit $M_{\text{max}}$, the oldest exchange is removed:

$$ \mathcal{C} = \begin{cases} \mathcal{C} \cup {(u_{n+1}, d_{n+1})}, & \text{si } |\mathcal{C}| < M_{\text{max}} \ (\mathcal{C} \setminus {(u_1, d_1)}) \cup {(u_{n+1}, d_{n+1})}, & \text{si } |\mathcal{C}| = M_{\text{max}} \end{cases} $$

Word Count

To manage memory space and decide when compression is necessary, we calculate the total number of words $W(\mathcal{C})$ in memory:

$$ W(\mathcal{C}) = \sum_{(u_i, d_i) \in \mathcal{C}} (|u_i| + |d_i|) $$

where $|u_i|$ and $|d_i|$ are respectively the number of words in $u_i$ and $d_i$.

Memory Compression

When $W(\mathcal{C})$ exceeds a threshold $W_{\text{max}}$, the memory is compressed to maintain the relevance of the context. This compression is performed by a summarization model $\mathcal{S}$, such as BART:

$$ \mathcal{C}_{\text{compressed}} = \mathcal{S}(\mathcal{C}) $$

where $\mathcal{C}_{\text{compressed}}$ is the compressed version of the memory, reducing the total number of words while preserving the essence of past interactions.

Integration into the Language Model

The language model uses the compressed context to generate relevant responses. The prompt $P$ used by the model is constructed as follows:

$$ P = f(\mathcal{C}_{\text{compressed}}, \text{context}) $$

where $\text{context}$ is additional context retrieved from a RAG pipeline, and $f$ is a concatenation function that prepares the text for the model.

This approach ensures that the chatbot always has an up-to-date conversational context, enabling more natural and engaging interactions with the user.

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.1.2

Feb 3, 2025

This version

1.1.1

Jan 20, 2025

1.0.9

Jan 20, 2025

1.0.6

Sep 13, 2024

1.0.5

Sep 13, 2024

1.0.4

Aug 22, 2024

1.0.3

Aug 21, 2024

1.0.2

Aug 21, 2024

1.0.1

Aug 21, 2024

0.1.0

Aug 21, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shortterm_memory-1.1.1.tar.gz (6.2 kB view details)

Uploaded Jan 20, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

shortterm_memory-1.1.1-py3-none-any.whl (6.5 kB view details)

Uploaded Jan 20, 2025 Python 3

File details

Details for the file shortterm_memory-1.1.1.tar.gz.

File metadata

Download URL: shortterm_memory-1.1.1.tar.gz
Upload date: Jan 20, 2025
Size: 6.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for shortterm_memory-1.1.1.tar.gz
Algorithm	Hash digest
SHA256	`18a00b3b19adfd004ce05d5659c4defe64c57798ab1ecb1e3ee97fe07ae244ee`
MD5	`d70efdc89b11a7ca6a85e9a2a1883767`
BLAKE2b-256	`388d2042ca429b0daffd69dd4e719da7d49cf399b6e035d7e4a8f3a33773cea4`

See more details on using hashes here.

File details

Details for the file shortterm_memory-1.1.1-py3-none-any.whl.

File metadata

Download URL: shortterm_memory-1.1.1-py3-none-any.whl
Upload date: Jan 20, 2025
Size: 6.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for shortterm_memory-1.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9eddf1bed6066cfb65cb8bbe2c013a1a8890569a51e76830ee8b84dd98880c9b`
MD5	`c56bfeb588f0d44ae7a5f55eaa3969e7`
BLAKE2b-256	`1fb5fcd26c1850fe77ac1c41febfe3c008acf548cdcd116b33f6cb8e49880621`

See more details on using hashes here.

shortterm-memory 1.1.1

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

Introduction

How to Use the shortterm-memory Package

Installation

Usage

Usage Exemple

Available Features

Error Handling

Mathematical Modeling of Conversation Management

Conversation Memory Structure

Memory Update

Word Count

Memory Compression

Integration into the Language Model

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

How to Use the `shortterm-memory` Package