Skip to main content

Database trigger and input/output bindings for Azure Functions Python v2, powered by SQLAlchemy

Project description

Azure Functions DB

PyPI Python Version CI Release codecov pre-commit Docs License: MIT

SQLAlchemy-powered database integration helpers for Azure Functions Python v2 — binding-style decorators for input/output/client injection and a poll-based pseudo trigger that works with any database that ships a SQLAlchemy dialect.

Not a native Azure Functions binding extension. This package does not register native Azure Functions bindings with the Functions host. The @db.input / @db.output / @db.trigger decorators are Python function wrappers that resolve data, inject writers, or poll for changes around your handler. For runtime-native Azure SQL bindings, use the official extension (see below).


Part of the Azure Functions Python DX Toolkit → Bring FastAPI-like developer experience to Azure Functions

Why this exists

Microsoft already ships official Azure SQL bindings — including a SQL trigger backed by SQL Change Tracking — for Azure SQL Database and SQL Server. If those cover your scenario, prefer them.

This package fills a different gap. Python teams on Azure Functions still have to hand-roll integration when they need:

  • No generic SQLAlchemy trigger — the official SQL bindings target Azure SQL / SQL Server only.
  • No unified multi-dialect binding layer — PostgreSQL, MySQL, SQLite, Oracle, DuckDB, CockroachDB, and other SQLAlchemy dialects each require custom code.
  • No SQLAlchemy-native reader/writer injection pattern for the Python v2 programming model.
  • No first-class polling primitive — checkpoint, lease, batching, idempotency, and at-least-once delivery built on top of the existing timer trigger.

When to use this

Use azure-functions-db when:

  • You need PostgreSQL, MySQL, SQLite, Oracle, DuckDB, CockroachDB, or another SQLAlchemy dialect.
  • You want Python decorator-based input / output / client injection that maps cleanly onto v2 handlers.
  • You want timer-driven polling with checkpoint and lease control instead of a host-managed trigger extension.
  • You want local-first, testable DB integration without depending on a native Functions extension.

Use the official Azure SQL bindings when:

  • You target Azure SQL Database or SQL Server.
  • You want a runtime-native SQL trigger / input binding / output binding registered with the Functions host.
  • You rely on SQL Change Tracking and extension-managed scaling for your trigger source.

What it does

  • Poll-based pseudo trigger — change detection via cursor-column polling, with checkpoint, lease, batching, and at-least-once delivery on top of an Azure Functions timer trigger.
  • Any SQLAlchemy database — PostgreSQL, MySQL, SQL Server out of the box; Oracle, CockroachDB, DuckDB, and any other dialect with one extra pip install.
  • Single pip install — one package with optional extras for the common database drivers.
  • Input-style data injection@db.input resolves query results and passes them to your handler.
  • Output-style writer injection@db.output injects a DbOut writer; you call .set(...) to write explicitly.
  • Client injection@db.inject_reader / @db.inject_writer provide imperative DbReader / DbWriter clients for multi-statement reads or transactional writes.

Compared with official Azure SQL bindings

Capability Official Azure SQL bindings azure-functions-db
Azure SQL / SQL Server Native, runtime-managed Supported via SQLAlchemy + pyodbc
PostgreSQL Not supported Built-in extra ([postgres])
MySQL Not supported Built-in extra ([mysql])
SQLite Not supported Supported via SQLAlchemy
Oracle / DuckDB / CockroachDB Not supported BYOD — install dialect, pass SQLAlchemy URL
Custom non-SQL source Not supported Implement SourceAdapter for db.trigger(...)
Runtime-native binding registration Yes (via Functions host extension) No — Python decorator wrapper
Trigger mechanism SQL Change Tracking Cursor-column polling on a timer trigger
Scaling integration Functions extension scale controller Driven by your timer trigger schedule
Delivery guarantee Per official docs At-least-once (handlers must be idempotent)
Checkpoint storage Managed by extension Azure Blob Storage (BlobCheckpointStore)

Choose your integration path

Path When to use What to do
Built-in extras PostgreSQL, MySQL, or SQL Server pip install azure-functions-db[postgres] and go
Bring your own SQLAlchemy database Oracle, CockroachDB, DuckDB, or any other RDBMS with a SQLAlchemy dialect Install the driver, use the SQLAlchemy connection URL
Custom trigger source (triggers only) Non-SQL sources (MongoDB, Kafka, REST APIs) Implement the SourceAdapter Protocol for db.trigger()

Bring your own database

The bindings and SqlAlchemySource are designed to work with any database that has a SQLAlchemy dialect. The built-in extras just bundle common drivers for convenience.

Three steps:

  1. Install the driver — e.g. pip install oracledb for Oracle
  2. Use the SQLAlchemy URL — e.g. url="oracle+oracledb://user:pass@host/db"
  3. Pass engine options if needed — use engine_kwargs for driver-specific settings
from azure_functions_db import DbBindings

db = DbBindings()

@db.input("rows", url="oracle+oracledb://user:pass@host:1521/mydb",
          query="SELECT * FROM orders WHERE status = :status",
          params={"status": "pending"})
def read_oracle_orders(rows: list[dict]) -> None:
    for row in rows:
        print(row)

The same applies to triggers — SqlAlchemySource accepts any SQLAlchemy URL:

from azure_functions_db import SqlAlchemySource

source = SqlAlchemySource(
    url="oracle+oracledb://user:pass@host:1521/mydb",
    table="orders",
    cursor_column="updated_at",
    pk_columns=["id"],
)

Note: The built-in extras (PostgreSQL, MySQL, SQL Server) are the tested path. Other dialects work through SQLAlchemy compatibility but are not explicitly tested by this project. Exact connection URL syntax varies by driver — check your driver's documentation.

See examples/byod_oracle/ for a complete runnable Function App and examples/usage_byod.py for a minimal standalone script.

Custom trigger source

If your data source has no SQLAlchemy dialect, implement the SourceAdapter protocol and pass it directly to db.trigger(source=...). This applies only to the trigger feature. See the Adapter SDK for the full contract.

Shared Core

azure-functions-db-python now exposes shared infrastructure for upcoming bindings. Use DbConfig for normalized connection settings and EngineProvider when multiple components should reuse the same lazily created SQLAlchemy engine.

Installation

# Core package (pick your database)
pip install azure-functions-db[postgres]
pip install azure-functions-db[mysql]
pip install azure-functions-db[mssql]

# Multiple databases
pip install azure-functions-db[postgres,mysql]

# All drivers
pip install azure-functions-db[all]

Your Function App dependencies should include:

azure-functions
azure-functions-db[postgres]

Quick Start

Which decorator to use?

Need Decorator Mode
Read data into handler input Input-style data injection
Write data to DB output Output-style writer injection
Complex reads (multiple queries) inject_reader Imperative client injection
Complex writes (transactions) inject_writer Imperative client injection
React to DB changes trigger Poll-based pseudo trigger

All decorators are Python function wrappers. They are not registered as native Azure Functions bindings with the host.

Input-style data injection

@db.input injects the actual query result into your handler — no client needed.

Row lookup mode — fetch a single row by primary key:

from azure_functions_db import DbBindings

db = DbBindings()

# Static primary key
@db.input("user", url="%DB_URL%", table="users", pk={"id": 42})
def load_user(user: dict | None) -> None:
    if user:
        print(user["name"])

# Dynamic primary key — resolved from handler kwargs
@db.input("user", url="%DB_URL%", table="users",
             pk=lambda req: {"id": req.params["id"]})
def get_user(req, user: dict | None) -> None:
    print(user)

Query mode — fetch multiple rows with SQL:

# Multiple rows by SQL query
@db.input("users", url="%DB_URL%",
             query="SELECT * FROM users WHERE active = :active",
             params={"active": True})
def list_active_users(users: list[dict]) -> None:
    for user in users:
        print(user["email"])

Output-style writer injection

@db.output injects a DbOut instance into your handler — call .set() to write explicitly.

from azure_functions_db import DbBindings, DbOut

db = DbBindings()

# Insert — call .set() with a dict for single row, list[dict] for batch
@db.output("out", url="%DB_URL%", table="orders")
def create_order(out: DbOut) -> str:
    out.set({"id": 1, "status": "pending", "total": 99.99})
    return "Created"

# Upsert — set action and conflict_columns
@db.output("out", url="%DB_URL%", table="orders",
              action="upsert", conflict_columns=["id"])
def upsert_orders(out: DbOut) -> str:
    out.set([
        {"id": 1, "status": "shipped", "total": 99.99},
        {"id": 2, "status": "pending", "total": 49.99},
    ])
    return "Upserted"

The handler's return value is independent of the write — use it for HTTP responses or anything else:

import azure.functions as func
from azure_functions_db import DbBindings, DbOut

db = DbBindings()

@db.output("out", url="%DB_URL%", table="orders")
def create_order(req: func.HttpRequest, out: DbOut) -> func.HttpResponse:
    out.set({"id": 1, "status": "pending"})
    return func.HttpResponse("Created", status_code=201)

Supported upsert dialects: PostgreSQL, SQLite, MySQL.

Client injection (imperative escape hatches)

For complex operations (multiple queries, transactions, update/delete), use inject_reader / inject_writer to receive a client instance:

from azure_functions_db import DbBindings, DbReader, DbWriter

db = DbBindings()

@db.inject_reader("reader", url="%DB_URL%", table="users")
def complex_read(reader: DbReader) -> None:
    user = reader.get(pk={"id": 42})
    orders = reader.query("SELECT * FROM orders WHERE user_id = :uid", params={"uid": 42})

@db.inject_writer("writer", url="%DB_URL%", table="orders")
def complex_write(writer: DbWriter) -> None:
    writer.insert(data={"id": 1, "status": "pending"})
    writer.update(data={"status": "shipped"}, pk={"id": 1})
    writer.delete(pk={"id": 1})

Trigger (poll-based pseudo trigger)

This is a pseudo trigger, not a native Azure Functions trigger. @db.trigger does not register a binding with the Functions host. It must be stacked on top of a real Azure Functions trigger (typically @app.schedule / timer) that fires the polling loop.

Delivery is at-least-once. Duplicates may occur during process crashes, lease transitions, or checkpoint commit failures. Handlers must be idempotent. See Semantics — Duplicate Windows.

import azure.functions as func
from azure.storage.blob import ContainerClient
from azure_functions_db import BlobCheckpointStore, DbBindings, RowChange, SqlAlchemySource

app = func.FunctionApp()
db = DbBindings()

source = SqlAlchemySource(
    url="%ORDERS_DB_URL%",
    table="orders",
    schema="public",
    cursor_column="updated_at",
    pk_columns=["id"],
)

checkpoint_store = BlobCheckpointStore(
    container_client=ContainerClient.from_connection_string(
        conn_str="%AzureWebJobsStorage%",
        container_name="db-state",
    ),
    source_fingerprint=source.source_descriptor.fingerprint,
)

@app.function_name(name="orders_poll")
@app.schedule(schedule="0 */1 * * * *", arg_name="timer", use_monitor=True)
@db.trigger(arg_name="events", source=source, checkpoint_store=checkpoint_store)
def orders_poll(timer: func.TimerRequest, events: list[RowChange]) -> None:
    for event in events:
        # Idempotent processing required: the same `event` may be delivered more than once.
        print(f"Order {event.pk}: {event.op}")

See Python API Spec for the full API reference.

Combined: Trigger + writer injection

Process database changes and write results to another table. Uses EngineProvider for shared connection pooling.

import azure.functions as func
from azure.storage.blob import ContainerClient

from azure_functions_db import (
    BlobCheckpointStore,
    DbBindings,
    DbOut,
    EngineProvider,
    RowChange,
    SqlAlchemySource,
)

app = func.FunctionApp()
db = DbBindings()

engine_provider = EngineProvider()

source = SqlAlchemySource(
    url="%SOURCE_DB_URL%",
    table="orders",
    cursor_column="updated_at",
    pk_columns=["id"],
    engine_provider=engine_provider,
)

checkpoint_store = BlobCheckpointStore(
    container_client=ContainerClient.from_connection_string(
        conn_str="%AzureWebJobsStorage%",
        container_name="db-state",
    ),
    source_fingerprint=source.source_descriptor.fingerprint,
)

@app.function_name(name="orders_poll")
@app.schedule(schedule="0 */1 * * * *", arg_name="timer", use_monitor=True)
@db.trigger(arg_name="events", source=source, checkpoint_store=checkpoint_store)
@db.output(
    "out",
    url="%DEST_DB_URL%",
    table="processed_orders",
    action="upsert",
    conflict_columns=["order_id"],
    engine_provider=engine_provider,
)
def orders_poll(timer: func.TimerRequest, events: list[RowChange], out: DbOut) -> None:
    out.set([
        {
            "order_id": event.pk["id"],
            "customer": event.after["name"],
            "processed_at": str(event.cursor),
        }
        for event in events
        if event.after is not None
    ])

See examples/trigger_with_binding/ for a complete runnable sample.

Built-in Extras

These databases have pre-packaged driver dependencies. Install the matching extra and you're ready to go.

Database Extra Driver
PostgreSQL azure-functions-db[postgres] psycopg
MySQL azure-functions-db[mysql] PyMySQL
SQL Server azure-functions-db[mssql] pyodbc

Any other database with a SQLAlchemy dialect works too — just install the driver yourself. See Choose your integration path.

Scope

  • Azure Functions Python v2 programming model
  • Timer-triggered functions for poll-based change detection
  • SQLAlchemy 2.0+ for database abstraction
  • Checkpoint storage via Azure Blob Storage
  • Read/write bindings via HTTP/Queue/Event triggers

This package does not implement a native Azure Functions trigger extension. It uses a poll-based approach on top of the existing timer trigger.

Observability

azure-functions-db-python exposes structured log helpers plus a lightweight MetricsCollector protocol so you can connect your own metrics backend without adding hard dependencies.

from collections.abc import Mapping

from azure_functions_db import MetricsCollector, PollTrigger


class PrintMetricsCollector:
    def increment(
        self, name: str, value: float = 1, *, labels: Mapping[str, str] | None = None
    ) -> None:
        print("increment", name, value, labels)

    def observe(
        self, name: str, value: float, *, labels: Mapping[str, str] | None = None
    ) -> None:
        print("observe", name, value, labels)

    def set_gauge(
        self, name: str, value: float, *, labels: Mapping[str, str] | None = None
    ) -> None:
        print("gauge", name, value, labels)


trigger = PollTrigger(
    name="orders",
    source=source,
    checkpoint_store=checkpoint_store,
    metrics=PrintMetricsCollector(),
)

Key Design Decisions

  • Pseudo trigger — timer-based polling instead of native C# extension (ADR-001)
  • SQLAlchemy-centric — single ORM layer for all databases (ADR-002)
  • Blob checkpoint — Azure Blob Storage for checkpoint persistence (ADR-003)
  • At-least-once — default delivery guarantee with idempotency support (ADR-004)
  • Unified package — trigger + binding in one package (ADR-005)

Duplicate Handling

This package provides at-least-once delivery for the polling trigger. Duplicates may occur during process crashes, lease transitions, or checkpoint commit failures. Handlers must be idempotent. Recommended patterns:

  • Use the row primary key (event.pk) plus event.cursor as a deduplication key in your sink.
  • Wrap downstream writes in a transaction with a unique constraint that you can swallow.
  • For batch writes, prefer upsert (action="upsert" with conflict_columns=...) over plain insert.

See Semantics — Duplicate Windows for the full guarantee model and the windows in which duplicates can be observed.

Documentation

Ecosystem

Part of the Azure Functions Python DX Toolkit:

Package Role
azure-functions-openapi-python OpenAPI spec generation and Swagger UI
azure-functions-validation-python Request/response validation and serialization
azure-functions-db-python SQLAlchemy-powered DB integration helpers (poll-based pseudo trigger, input/output/client injection)
azure-functions-langgraph-python LangGraph deployment adapter for Azure Functions
azure-functions-scaffold-python Project scaffolding CLI
azure-functions-logging-python Structured logging and observability
azure-functions-doctor-python Pre-deploy diagnostic CLI
azure-functions-durable-graph-python Manifest-first graph runtime with Durable Functions (experimental)
azure-functions-cookbook-python Recipes and examples

Disclaimer

This project is an independent community project and is not affiliated with, endorsed by, or maintained by Microsoft.

Azure and Azure Functions are trademarks of Microsoft Corporation.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

azure_functions_db-0.2.2.tar.gz (280.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

azure_functions_db-0.2.2-py3-none-any.whl (9.0 kB view details)

Uploaded Python 3

File details

Details for the file azure_functions_db-0.2.2.tar.gz.

File metadata

  • Download URL: azure_functions_db-0.2.2.tar.gz
  • Upload date:
  • Size: 280.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for azure_functions_db-0.2.2.tar.gz
Algorithm Hash digest
SHA256 dcf88f64e64c08f8963a7097ebf193870b317ff372e043b32dad116d81ccd0b0
MD5 b0cdd225e67e686a1ff9dc555b5278d8
BLAKE2b-256 b3773e265a70f949b6908508f31d97ddff9b5f41a0161423c36829e5f00064e9

See more details on using hashes here.

Provenance

The following attestation bundles were made for azure_functions_db-0.2.2.tar.gz:

Publisher: publish-pypi.yml on yeongseon/azure-functions-db-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file azure_functions_db-0.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for azure_functions_db-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 06034be67b66878dc9a795b33a770a8be014d1c2bd79dd47871ce1e5c140515e
MD5 40e6711eacb6523db2624327444639e6
BLAKE2b-256 a957333258c1fc673c5d9b187e0aa7b4b1c7cd24d7eb95b4b01c016cc7b357cb

See more details on using hashes here.

Provenance

The following attestation bundles were made for azure_functions_db-0.2.2-py3-none-any.whl:

Publisher: publish-pypi.yml on yeongseon/azure-functions-db-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page