A manager to easily handle multiple SQLAlchemy configurations

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

SQLAlchemy bind manager

This package provides an easy way to configure and use SQLAlchemy engines and sessions without depending on frameworks.

It is composed by two main components:

A manager class for SQLAlchemy engine configuration
A repository/unit-of-work pattern implementation for model retrieval and persistence

Installation

pip install sqlalchemy-bind-manager

Components maturity

SQLAlchemy manager: Implementation is mostly finalised, needs testing in production.
Repository / Unit of work: Implementation is mostly finalised, needs testing in production.

SQLAlchemy manager

Initialise the manager providing an instance of SQLAlchemyConfig

from sqlalchemy_bind_manager import SQLAlchemyConfig, SQLAlchemyBindManager

config = SQLAlchemyConfig(
    engine_url="sqlite:///./sqlite.db",
    engine_options=dict(connect_args={"check_same_thread": False}, echo=True),
    session_options=dict(expire_on_commit=False),
)

sa_manager = SQLAlchemyBindManager(config)

🚨 NOTE: Using global variables is not thread-safe, please read the Threading section if your application uses multi-threading.

engine_url and engine_options dictionary accept the same parameters as SQLAlchemy create_engine()

session_options dictionary accepts the same parameters as SQLALchemy sessionmaker()

Once the bind manager is initialised we can retrieve and use the SQLAlchemyBind using the method get_bind()

The SQLAlchemyBind class has the following attributes:

bind_async: A boolean property, True when the bind uses an async dialect (Note: async is not yet fully supported, see the section about asynchronous binds)
engine: The initialised SQLALchemy Engine
model_declarative_base: A base class that can be used to create declarative models
registry_mapper: The registry associated with the engine. It can be used with Alembic or to achieve imperative mapping
session_class: The class built by sessionmaker(), either Session or AsyncSession

The SQLAlchemyBindManager provides some helper methods to quickly access some of the bind properties without using the SQLAlchemyBind:

get_session: returns a Session object
get_mapper: returns the mapper associated with the bind

Example:

bind = sa_manager.get_bind()


class DeclarativeModel(bind.model_declarative_base):
    pass


class ImperativeModel:
    id: int


imperative_table = Table(
    "imperative",
    bind.registry_mapper.metadata,
    Column("id", Integer, primary_key=True),
    Column("name", String, primary_key=True),
)

bind.registry_mapper.map_imperatively(ImperativeModel, imperative_table)

# or using the get_mapper() helper method
sa_manager.get_mapper().map_imperatively(ImperativeModel, imperative_table)

# Persist an object
o = ImperativeModel()  # also o = DeclarativeModel()
o.name = "John"
with sa_manager.get_bind().session_class()() as session:
    session.add(o)
    session.commit()

# or using the get_session() helper method for better readability
with sa_manager.get_session() as session:
    session.add(o)
    session.commit()

Multiple database binds

SQLAlchemyBindManager accepts also multiple databases configuration, provided as a dictionary. The dictionary keys are used as a reference name for each bind.

from sqlalchemy_bind_manager import SQLAlchemyConfig, SQLAlchemyBindManager

config = {
    "default": SQLAlchemyConfig(
        engine_url="sqlite:///./sqlite.db",
        engine_options=dict(connect_args={"check_same_thread": False}, echo=True),
        session_options=dict(expire_on_commit=False),
    ),
    "secondary": SQLAlchemyConfig(
        engine_url="sqlite:///./secondary.db",
        engine_options=dict(connect_args={"check_same_thread": False}, echo=True),
        session_options=dict(expire_on_commit=False),
    ),
}

sa_manager = SQLAlchemyBindManager(config)

All the SQLAlchemyBindManager helper methods accept the bind_name optional parameter:

get_session(bind_name="default"): returns a Session or AsyncSession object
get_mapper(bind_name="default"): returns the mapper associated with the bind

Threading

Global variables in python are shared among threads. Neither SQLAlchemyBindManager class or the repositories implementation are thread safe on their own.

If your application uses multi-threads, the same SQLAlchemyBindManager object will be shared between different threads, together with the internal Session object, and changes on models could propagate among them, causing undesired operations.

Make sure you check Python Threading documentation, especially the Thread-local Data chapter to avoid issues.

Asynchronous database binds

Is it possible to supply configurations for asyncio supported engines.

config = SQLAlchemyAsyncConfig(
    engine_url="postgresql+asyncpg://scott:tiger@localhost/test",
)

This will make sure we have an AsyncEngine and an AsyncSession are initialised.

async with sa_manager.get_session() as session:
    session.add(o)
    session.commit()

Note that async implementation has several differences from the sync one, make sure to check SQLAlchemy asyncio documentation

Repository / Unit of work

The SQLAlchemyRepository and SQLAlchemyAsyncRepository class can be used simply by extending them.

from sqlalchemy_bind_manager import SQLAlchemyRepository


class MyModel(model_declarative_base):
    pass


class ModelRepository(SQLAlchemyRepository[MyModel]):
    _model = MyModel


repo_instance = ModelRepository(sqlalchemy_bind_manager.get_bind())

The classes provide some common use methods:

get: Retrieve a model by identifier
save: Persist a model
save_many: Persist multiple models in a single transaction
delete: Delete a model
find: Search for a list of models (basically an adapter for SELECT queries)

Session lifecycle in repositories

SQLAlchemy documentation recommends we create Session object at the beginning of a logical operation where database access is potentially anticipated.

Doing this too soon might cause unexpected effects, like unexpected updates being committed, if the initialised session is shared among different repositories.

A Repository represents a generic interface to persist data object to a storage, not necessarily using SQLAlchemy. It makes sense that the lifecycle of a Session follows the one of the Repository (If we create a Repository, we're going to do a DB operation, otherwise we don't need a Session)

This ensures the Session we use is isolated, and the same for all the operations we do with the same repository.

The session is automatically closed and reopen with each Repository operation, this make sure these operation are independent from each other.

These choices cause some consequences:

The operations that modify the database will reload the models from the DB, causing an additional SELECT query to be issued.
We can't use SQLAlchemy lazy loading, so we'll need to make sure relationship are always loaded eagerly, using either:
- Setup your model/table relationships to always use always eager loading
- Implement ad-hoc methods to deal with relationships as necessary

Also AsyncSession has the same limitation on lazy loading so it makes sense that the two repository implementations behave consistently.

Use the Unit Of Work to share a session among multiple repositories

It is possible we need to run several operations in a single database get_session. While a single repository provide by itself an isolated session for single operations, we have to use a different approach for multiple operations.

We can use the UnitOfWork or the AsyncUnitOfWork class to provide a shared session to be used for repository operations, assumed the same bind is used for all the repositories. (Two phase transactions are not currently supported).

class MyRepo(SQLAlchemyRepository):
    _model = MyModel
class MyOtherRepo(SQLAlchemyRepository):
    _model = MyOtherModel

bind = sa_manager.get_bind()
uow = UnitOfWork(bind, (MyRepo, MyOtherRepo))

with uow.transaction():
    uow.MyRepo.save(some_model)
    uow.MyOtherRepo.save(some_other_model)

# Optionally disable the commit/rollback handling
with uow.transaction(read_only=True):
    model1 = uow.MyRepo.get(1)
    model2 = uow.MyOtherRepo.get(2)

Both the UnitOfWork classes create an internal scoped_session or async_scoped_session, behaving in the same way at the repositories do. This provides the freedom to tune the session lifecycle based on our application requirements (e.g. one unit of work per http request, per domain, etc.)

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.10.0

Mar 23, 2024

0.9.1

Nov 21, 2023

0.9.0

Aug 19, 2023

0.8.0

Aug 15, 2023

0.7.0

Jul 22, 2023

0.6.0

Jul 8, 2023

0.5.0

Jul 5, 2023

0.4.0

Jul 3, 2023

0.3.2

Jul 3, 2023

0.3.1

Jun 24, 2023

0.3.0

Jun 18, 2023

0.2.1

Apr 25, 2023

0.2.0

Apr 25, 2023

This version

0.1.0

Feb 24, 2023

0.0.1

Feb 9, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sqlalchemy_bind_manager-0.1.0.tar.gz (15.2 kB view hashes)

Uploaded Feb 24, 2023 Source

Built Distribution

sqlalchemy_bind_manager-0.1.0-py3-none-any.whl (14.6 kB view hashes)

Uploaded Feb 24, 2023 Python 3

Hashes for sqlalchemy_bind_manager-0.1.0.tar.gz

Hashes for sqlalchemy_bind_manager-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`d49d97420cebb52d0cbf0e7a99362d3a6ec790c8b357d92eacf2577285e6e28d`
MD5	`afdcf41b8bba8b018c2039cbdff4ed26`
BLAKE2b-256	`2002d627771533361d86e609ee7f2a4024b01e5a9d1885c782270f1334eadcd7`

Hashes for sqlalchemy_bind_manager-0.1.0-py3-none-any.whl

Hashes for sqlalchemy_bind_manager-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d37f0f62f288715377ae4e92482fdd4dc8a2f9911cc0abff0be116e6dca65b58`
MD5	`3b0b1d3a806942f0d665ee46f52dba6a`
BLAKE2b-256	`59806a5349378bd36990b12f5bb9d35ce44f9fa554aadc6153b7f561021013bd`