The Python interface to the MessageDB Event Store and Message Store

These details have not been verified by PyPI

Project description

message-db-py

Message DB is a fully-featured event store and message store implemented in PostgreSQL for Pub/Sub, Event Sourcing, Messaging, and Evented Microservices applications.

message-db-py is a Python interface to the Message DB event store and message store, designed for easy integration into Python applications.

Installation

Use pip to install:

pip install message-db-py

Setting up Message DB database

Clone the Message DB repository to set up the database:

git clone git@github.com:message-db/message-db.git

More detailed instructions are in the Installation section of Message DB repo.

Running the database installation script creates the database, schema, table, indexes, functions, views, types, a user role, and limit the user's privileges to the message store's public interface.

The installation script is in the database directory of the cloned Message DB repo. Change directory to the message-db directory where you cloned the repo, and run the script:

database/install.sh

Make sure that your default Postgres user has administrative privileges.

Database Name

By default, the database creation tool will create a database named message_store.

If you prefer either a different database name, you can override the name using the DATABASE_NAME environment variable.

DATABASE_NAME=some_other_database database/install.sh

Uninstalling the Database

If you need to drop the database (for example, on a local dev machine):

database/uninstall.sh

If you're upgrading a previous version of the database:

database/update.sh

Docker Image

You can optionally use a Docker image with Message DB pre-installed and ready to go. This is especially helpful to run test cases locally.

The docker image is available in Docker Hub. The source is in Gitlab

Usage

The complete user guide for Message DB is available at http://docs.eventide-project.org/user-guide/message-db/.

Below is documentation for methods exposed through the Python API.

Quickstart

Here's a quick example of how to publish and read messages using Message-DB-py:

from message_db import MessageDB

# Initialize the database connection
store = MessageDB(CONNECTION_URL)

# Write a message
store.write("user_stream", "register", {"name": "John Doe"})

# Read a message
message = store.read_last_message("user_stream")
print(message)

Write messages

The write method is used to append a new message to a specified stream within the message database. This method ensures that the message is written with the appropriate type, data, and metadata, and optionally, at a specific expected version of the stream.

def write(
    self,
    stream_name: str,
    message_type: str,
    data: Dict,
    metadata: Dict | None = None,
    expected_version: int | None = None,
) -> int:
    """Write a message to a stream."""

Parameters:

stream_name (str): The name of the stream to which the message will be written. This identifies the logical series of messages.
message_type (str): The type of message being written. Typically, this reflects the nature of the event or data change the message represents.
data (Dict): The data payload of the message. This should be a dictionary containing the actual information the message carries.
metadata (Dict | None): Optional. Metadata about the message, provided as a dictionary. Metadata can include any additional information that is not part of the main data payload, such as sender information or timestamps. Defaults to None.
expected_version (int | None): Optional. The version of the stream where the client expects to write the message. This is used for concurrency control and ensuring the integrity of the stream's order. Defaults to None.

Returns:

position (int): The position (or version number) of the message in the stream after it has been successfully written.

message_db = MessageDB(connection_pool=my_pool)
stream_name = "user_updates"
message_type = "UserCreated"
data = {"user_id": 123, "username": "example"}
metadata = {"source": "web_app"}

position = message_db.write(stream_name, message_type, data, metadata)

print("Message written at position:", position)

Read messages from a stream or category

The read method retrieves messages from a specified stream or category. This method supports flexible query options through a direct SQL parameter or by determining the SQL based on the stream name and its context (stream vs. category vs. all messages).

def read(
    self,
    stream_name: str,
    sql: str | None = None,
    position: int = 0,
    no_of_messages: int = 1000,
) -> List[Dict[str, Any]]:
    """Read messages from a stream or category.

    Returns a list of messages from the stream or category starting from the given position.
    """

Parameters:

stream_name (str): The identifier for the stream or category from which messages are to be retrieved. Special names like "$all" can be used to fetch messages across all streams.
sql (str | None, optional): An optional SQL query string that if provided, overrides the default SQL generation based on the stream_name. If None, the SQL is automatically generated based on the stream_name value. Defaults to None.
position (int, optional): The starting position in the stream or category from which to begin reading messages. Defaults to 0.
no_of_messages (int, optional): The maximum number of messages to retrieve. Defaults to 1000.

Returns:

List[Dict[str, Any]]: A list of messages, where each message is represented as a dictionary containing details such as the message ID, stream name, type, position, global position, data, metadata, and timestamp.

message_db = MessageDB(connection_pool=my_pool)
stream_name = "user-updates"
position = 10
no_of_messages = 50

# Reading from a specific stream
messages = message_db.read(stream_name, position=position, no_of_messages=no_of_messages)

# Custom SQL query
custom_sql = "SELECT * FROM get_stream_messages(%(stream_name)s, %(position)s, %(batch_size)s);"
messages = message_db.read(stream_name, sql=custom_sql, position=position, no_of_messages=no_of_messages)

for message in messages:
    print(message)

Read Last Message from stream

The read_last_message method retrieves the most recent message from a specified stream. This method is useful when you need the latest state or event in a stream without querying the entire message history.

def read_last_message(self, stream_name: str) -> Dict[str, Any] | None:
    """Read the last message from a stream."""

Parameters:

stream_name (str): The name of the stream from which the last message is to be retrieved.

Returns:

Dict[str, Any] | None: A dictionary representing the last message in the specified stream. If the stream is empty or the message does not exist, None is returned.

message_db = MessageDB(connection_pool=my_pool)
stream_name = "user_updates"

# Reading the last message from a stream
last_message = message_db.read_last_message(stream_name)

if last_message:
    print("Last message data:", last_message)
else:
    print("No messages found in the stream.")

Utility APIs

Read Stream
Read Category
Write Batch

Read Stream (Utility)

The read_stream method retrieves a sequence of messages from a specified stream within the message database. This method is specifically designed to fetch messages from a well-defined stream based on a starting position and a specified number of messages.

def read_stream(
    self, stream_name: str, position: int = 0, no_of_messages: int = 1000
) -> List[Dict[str, Any]]:
    """Read messages from a stream.

    Returns a list of messages from the stream starting from the given position.
    """

Parameters:

stream_name (str): The name of the stream from which messages are to be retrieved. This name must include a hyphen (-) to be recognized as a valid stream identifier.
position (int, optional): The zero-based index position from which to start reading messages. Defaults to 0, which starts reading from the beginning of the stream.
no_of_messages (int, optional): The maximum number of messages to retrieve from the stream. Defaults to 1000.

Returns:

List[Dict[str, Any]]: A list of dictionaries, each representing a message retrieved from the stream. Each dictionary contains the message details structured in key-value pairs.

Exceptions:

ValueError: Raised if the provided stream_name does not contain a hyphen (-), which is required to validate the name as a stream identifier.

message_db = MessageDB(connection_pool=my_pool)
stream_name = "user-updates-2023"
position = 0
no_of_messages = 100

messages = message_db.read_stream(stream_name, position, no_of_messages)

for message in messages:
    print(message)

Read Category (Utility)

The read_category method retrieves a sequence of messages from a specified category within the message database. It is designed to fetch messages based on a category identifier, starting from a specific position, and up to a defined limit of messages.

def read_category(
    self, category_name: str, position: int = 0, no_of_messages: int = 1000
) -> List[Dict[str, Any]]:
    """Read messages from a category.

    Returns a list of messages from the category starting from the given position.
    """

Parameters:

category_name (str): The name of the category from which messages are to be retrieved. This identifier should not include a hyphen (-) to validate it as a category name.
position (int, optional): The zero-based index position from which to start reading messages within the category. Defaults to 0.
no_of_messages (int, optional): The maximum number of messages to retrieve from the category. Defaults to 1000.

Returns:

List[Dict[str, Any]]: A list of dictionaries, each representing a message. Each dictionary includes details about the message such as the message ID, stream name, type, position, global position, data, metadata, and time of creation.

Exceptions:

ValueError: Raised if the provided category_name contains a hyphen (-), which is not allowed for category identifiers and implies a misunderstanding between streams and categories.

message_db = MessageDB(connection_pool=my_pool)
category_name = "user_updates"
position = 0
no_of_messages = 100

# Reading messages from a category
messages = message_db.read_category(category_name, position, no_of_messages)

for message in messages:
    print(message)

Write Batch (Utility)

The write_batch method is designed to write a series of messages to a specified stream in a batch operation. It ensures atomicity in writing operations, where all messages are written in sequence, and each subsequent message can optionally depend on the position of the last message written. This method is useful when multiple messages need to be written as a part of a single transactional context.

def write_batch(
    self, stream_name, data, expected_version: int | None = None
) -> int:
    """Write a batch of messages to a stream."""

Parameters:

stream_name (str): The name of the stream to which the batch of messages will be written.
data (List[Tuple[str, Dict, Dict | None]]): A list of tuples, where each tuple represents a message. The tuple format is (message_type, data, metadata), with metadata being optional.
expected_version (int | None, optional): The version of the stream where the batch operation expects to start writing. This can be used for concurrency control to ensure messages are written in the expected order. Defaults to None.

Returns:

position (int): The position (or version number) of the last message written in the stream as a result of the batch operation.

message_db = MessageDB(connection_pool=my_pool)
stream_name = "order_events"
data = [
    ("OrderCreated", {"order_id": 123, "product_id": 456}, None),
    ("OrderShipped",
        {"order_id": 123, "shipment_id": 789},
        {"priority": "high"}
    ),
    ("OrderDelivered", {"order_id": 123, "delivery_date": "2024-04-23"}, None)
]

# Writing a batch of messages to a stream
last_position = message_db.write_batch(stream_name, data)

print(f"Last message written at position: {last_position}")

Consumer Groups

Consumer groups enable horizontal scaling by distributing the processing load of a single category among multiple consumers. This allows parallel processing of messages while ensuring that each stream is processed by exactly one consumer in the group.

How Consumer Groups Work

Consumer groups use consistent hashing to assign streams to consumers:

Each stream's cardinal ID (the part after the first hyphen) is hashed to a 64-bit integer
The hash is divided by the group size using modulo division
The result determines which consumer processes that stream
The same stream always maps to the same consumer, ensuring consistency

Using Consumer Groups

To use consumer groups with read_category, specify both the consumer_group_member (zero-based consumer identifier) and consumer_group_size (total number of consumers):

message_db = MessageDB(connection_pool=my_pool)
category_name = "user_updates"

# Consumer 0 in a group of 3
messages_0 = message_db.read_category(
    category_name,
    consumer_group_member=0,
    consumer_group_size=3
)

# Consumer 1 in a group of 3
messages_1 = message_db.read_category(
    category_name,
    consumer_group_member=1,
    consumer_group_size=3
)

# Consumer 2 in a group of 3
messages_2 = message_db.read_category(
    category_name,
    consumer_group_member=2,
    consumer_group_size=3
)

Consumer Group Parameters

consumer_group_member (int | None): Zero-based consumer identifier (0, 1, 2, etc.)
- Must be >= 0
- Must be less than consumer_group_size
consumer_group_size (int | None): Total number of consumers in the group
- Must be > 0

Important: Both parameters must be provided together or both must be None. Providing only one will raise a ValueError.

Complete Example

Here's a complete example of using consumer groups for parallel processing:

from message_db import MessageDB
import threading

# Initialize the database connection
message_db = MessageDB.from_url("postgresql://message_store@localhost:5432/message_store")

# Define a consumer function
def process_messages(consumer_id, group_size):
    while True:
        messages = message_db.read_category(
            "user_updates",
            position=get_last_processed_position(consumer_id),
            no_of_messages=100,
            consumer_group_member=consumer_id,
            consumer_group_size=group_size
        )

        if not messages:
            break

        for message in messages:
            # Process each message
            print(f"Consumer {consumer_id} processing: {message['data']}")

        # Update the last processed position
        update_last_processed_position(consumer_id, messages[-1]['global_position'])

# Run 3 consumers in parallel
group_size = 3
threads = []

for consumer_id in range(group_size):
    thread = threading.Thread(
        target=process_messages,
        args=(consumer_id, group_size)
    )
    thread.start()
    threads.append(thread)

# Wait for all consumers to finish
for thread in threads:
    thread.join()

Benefits of Consumer Groups

Horizontal Scaling: Distribute processing across multiple instances
No Duplication: Each stream is processed by exactly one consumer
Consistency: The same stream always goes to the same consumer
Parallel Processing: Multiple consumers can process different streams simultaneously
Fault Tolerance: If a consumer fails, you can redistribute the group

Best Practices

Group Size: Choose a group size that matches your processing capacity
Position Tracking: Each consumer should track its own position independently
Error Handling: Implement retry logic for failed message processing
Monitoring: Monitor each consumer's progress and lag separately
Deployment: Deploy consumers as separate processes or containers

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.4

Mar 17, 2026

0.3.3

Mar 6, 2026

0.3.2

Feb 21, 2026

0.3.1

Feb 21, 2026

0.3.0

Feb 20, 2026

0.2.0

Apr 24, 2024

0.1.4

Apr 24, 2024

0.1.3

Apr 24, 2024

0.1.2

Jan 27, 2022

0.1.1

Jan 20, 2022

0.1.0

Nov 21, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

message_db_py-0.3.4.tar.gz (14.8 kB view details)

Uploaded Mar 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

message_db_py-0.3.4-py3-none-any.whl (11.5 kB view details)

Uploaded Mar 17, 2026 Python 3

File details

Details for the file message_db_py-0.3.4.tar.gz.

File metadata

Download URL: message_db_py-0.3.4.tar.gz
Upload date: Mar 17, 2026
Size: 14.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.1 CPython/3.13.11 Darwin/25.3.0

File hashes

Hashes for message_db_py-0.3.4.tar.gz
Algorithm	Hash digest
SHA256	`b1db0e241db5d1d292fe13b61eb76a347c56edc5c01a792e6f14703947834190`
MD5	`94c0b9a0fd0cbe316dc700cf7bf330d3`
BLAKE2b-256	`43aa36cc296e9c1be337abbd5cacc0a31dbbdba9d61508ac1e351e8d507bdc11`

See more details on using hashes here.

File details

Details for the file message_db_py-0.3.4-py3-none-any.whl.

File metadata

Download URL: message_db_py-0.3.4-py3-none-any.whl
Upload date: Mar 17, 2026
Size: 11.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.1 CPython/3.13.11 Darwin/25.3.0

File hashes

Hashes for message_db_py-0.3.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`14210e5b222cdf37b8140e0449f3ab716e966325517b9d23d5c6c5490e07db20`
MD5	`540503d2bb515ad556db1e8eb31c6dac`
BLAKE2b-256	`6d7304959a13fce705fbc72708861a4c2fe335e041ae5f7dd414eb72aa7cf0af`

See more details on using hashes here.

message-db-py 0.3.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

message-db-py

Installation

Setting up Message DB database

Database Name

Uninstalling the Database

Docker Image

Usage

Quickstart

Primary APIs

Write messages

Read messages from a stream or category

Read Last Message from stream

Utility APIs

Read Stream (Utility)

Read Category (Utility)

Write Batch (Utility)

Consumer Groups

How Consumer Groups Work

Using Consumer Groups

Consumer Group Parameters

Complete Example

Benefits of Consumer Groups

Best Practices

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes