Typed SQLAlchemy helpers for ParadeDB

These details have not been verified by PyPI

Project links

Project description

sqlalchemy-paradedb

ParadeDB integration for SQLAlchemy: typed helpers for BM25 indexes, search predicates, scoring, snippets, facets, and migration ergonomics.

Requirements & Compatibility

Component	Supported
Python	3.10+
SQLAlchemy	2.0.32+
ParadeDB	0.21.0+ (`pg_search`)
PostgreSQL	17+ (with ParadeDB extension)

Installation

uv add sqlalchemy-paradedb

For local development:

uv sync --extra test --extra dev

Quick Start

Prerequisites

Install pg_search in your Postgres database and connect SQLAlchemy to that database.

Create a BM25 Index

from sqlalchemy import Index
from paradedb.sqlalchemy import indexing

products_bm25_idx = Index(
    "products_bm25_idx",
    indexing.BM25Field(Product.id),
    indexing.BM25Field(
        Product.description,
        tokenizer=indexing.tokenize.unicode(lowercase=True),
    ),
    indexing.BM25Field(
        Product.category,
        tokenizer=indexing.tokenize.literal(),
    ),
    postgresql_using="bm25",
    postgresql_with={"key_field": "id"},
)

For JSON columns named metadata, use metadata_ as the ORM attribute name.

Query with ParadeDB Predicates

from sqlalchemy import select
from sqlalchemy.orm import Session
from paradedb.sqlalchemy import pdb, search

stmt = (
    select(Product.id, Product.description)
    .where(search.match_any(Product.description, "running", "shoes"))
    .order_by(pdb.score(Product.id).desc())
    .limit(10)
)

with Session(engine) as session:
    rows = session.execute(stmt).all()

Rows + Facets in a Single Query

from sqlalchemy import select
from sqlalchemy.orm import Session
from paradedb.sqlalchemy import facets, search

base = (
    select(Product.id, Product.description)
    .where(search.match_all(Product.description, "running"))
    .order_by(Product.id)
    .limit(10)
)

stmt = facets.with_rows(
    base,
    agg=facets.multi(
        facets.value_count(field="id"),
        facets.terms(field="category", size=10),
    ),
    key_field=Product.id,
)

with Session(engine) as session:
    rows = session.execute(stmt).all()
    facet_payload = facets.extract(rows)

Search Patterns

Fuzzy Matching

from paradedb.sqlalchemy import search

search.term(Product.description, "shose", distance=1)
search.match_any(Product.description, "wirless", distance=1, prefix=True)
search.term(Product.description, "rnnuing", distance=1, transpose_cost_one=True)

Use fuzzy options on term, match_any, or match_all; there is no separate search.fuzzy(...) helper.

Phrase Prefix and More-Like-This

from paradedb.sqlalchemy import search

search.phrase_prefix(Product.description, ["running", "sh"])
search.more_like_this(Product.id, document_id=1, fields=["description"])

Proximity Composition

from sqlalchemy import select
from paradedb.sqlalchemy import search

prox = search.prox_array("running").within(1, search.prox_regex("sho.*"), ordered=True)
stmt = select(Product.id).where(search.proximity(Product.description, prox))

Indexing and Tokenizers

Tokenizer config can be expressed as a structured mapping:

from sqlalchemy import Index
from paradedb.sqlalchemy import indexing

products_bm25_idx = Index(
    "products_bm25_idx",
    indexing.BM25Field(Product.id),
    indexing.BM25Field(
        Product.description,
        tokenizer=indexing.tokenize.from_config(
            {
                "tokenizer": "simple",
                "filters": ["lowercase", "stemmer"],
                "stemmer": "english",
                "alias": "description_simple",
            }
        ),
    ),
    indexing.BM25Field(
        Product.description,
        tokenizer=indexing.tokenize.from_config(
            {
                "tokenizer": "ngram",
                "args": [3, 8],
                "named_args": {"prefix_only": True},
                "alias": "description_ngram",
            }
        ),
    ),
    postgresql_using="bm25",
    postgresql_with={"key_field": "id"},
)

Validate that a field is indexed with the expected tokenizer:

from paradedb.sqlalchemy import indexing

indexing.assert_indexed(engine, Product.category, tokenizer="literal")

Inspect BM25 metadata for a mapped table:

from paradedb.sqlalchemy import indexing

meta = indexing.describe(engine, Product.__table__)

Alembic Operations

Import once in migration environment startup so Alembic registers ParadeDB operations:

import paradedb.sqlalchemy.alembic  # noqa: F401

Use custom operations in migrations:

op.create_bm25_index(
    "products_bm25_idx",
    "products",
    ["id", "description"],
    key_field="id",
    table_schema="public",
)
op.reindex_bm25("products_bm25_idx", concurrently=True, schema="public")
op.drop_bm25_index("products_bm25_idx", if_exists=True, schema="public")

op.reindex_bm25(..., concurrently=True) must run outside a transaction (autocommit block).

Diagnostics Helpers

paradedb.sqlalchemy.diagnostics exposes wrapper functions for ParadeDB diagnostics:

from paradedb.sqlalchemy import diagnostics

indexes = diagnostics.paradedb_indexes(engine)
segments = diagnostics.paradedb_index_segments(engine, "products_bm25_idx")
check = diagnostics.paradedb_verify_index(engine, "products_bm25_idx", sample_rate=0.1)
all_checks = diagnostics.paradedb_verify_all_indexes(engine, schema_pattern="public")

Common Errors

`with_rows requires ORDER BY`

from sqlalchemy import select
from paradedb.sqlalchemy import facets

# Missing order_by(...)
base = select(Product.id).limit(10)
facets.with_rows(base, agg=facets.value_count(field="id"), key_field=Product.id)

`with_rows requires LIMIT`

from sqlalchemy import select
from paradedb.sqlalchemy import facets

# Missing limit(...)
base = select(Product.id).order_by(Product.id)
facets.with_rows(base, agg=facets.value_count(field="id"), key_field=Product.id)

`with_rows requires a ParadeDB predicate`

from sqlalchemy import select
from paradedb.sqlalchemy import facets

# ensure_predicate=False disables automatic search.all(...) injection
facets.with_rows(
    select(Product.id).order_by(Product.id).limit(10),
    agg=facets.value_count(field="id"),
    key_field=Product.id,
    ensure_predicate=False,
)

`tokenizer config requires 'tokenizer'`

from paradedb.sqlalchemy import indexing

indexing.tokenize.from_config({"filters": ["lowercase"]})

Examples

Quick Start: setup, run
Faceted Search: setup, run
Autocomplete: setup, run
More Like This: setup, run
Hybrid Search (RRF): setup, run
RAG: setup, run

Testing

Use repository script helpers:

./scripts/run_unit_tests.sh
./scripts/run_integration_tests.sh
./scripts/run_examples.sh

Documentation

Contributing

See CONTRIBUTING.md for setup, linting, tests, and PR workflow.

Support

If you found a bug or need a feature, open a GitHub Issue.

Community and team support:

License

sqlalchemy-paradedb is licensed under the MIT License.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.0

Apr 21, 2026

0.5.0

Apr 16, 2026

0.4.0

Apr 13, 2026

0.3.0

Apr 9, 2026

0.2.0

Apr 1, 2026

This version

0.1.0

Mar 25, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sqlalchemy_paradedb-0.1.0.tar.gz (59.8 kB view details)

Uploaded Mar 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sqlalchemy_paradedb-0.1.0-py3-none-any.whl (30.3 kB view details)

Uploaded Mar 25, 2026 Python 3

File details

Details for the file sqlalchemy_paradedb-0.1.0.tar.gz.

File metadata

Download URL: sqlalchemy_paradedb-0.1.0.tar.gz
Upload date: Mar 25, 2026
Size: 59.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for sqlalchemy_paradedb-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`af3ce10eb0df89083c551529bd8a6d23f3376d5948f09983e028feb03997838f`
MD5	`551c52249e12ca7f975be4d7e8f169cd`
BLAKE2b-256	`5730dda155d4516ecdf4dcfc5f1a83ba7b58de243a006ac3a8ee648ae0f502c4`

See more details on using hashes here.

File details

Details for the file sqlalchemy_paradedb-0.1.0-py3-none-any.whl.

File metadata

Download URL: sqlalchemy_paradedb-0.1.0-py3-none-any.whl
Upload date: Mar 25, 2026
Size: 30.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for sqlalchemy_paradedb-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`455bff272e7387167dbb3972fc4e60659740026b14b1f1409ec33e91607c9399`
MD5	`074f4940d0eb535d376f573886b005f9`
BLAKE2b-256	`5815c6997753c38e9a55ee25a74bc1a475037390aa36da814b082c9eb70a6e00`

See more details on using hashes here.

sqlalchemy-paradedb 0.1.0

Navigation

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Project description

sqlalchemy-paradedb

Requirements & Compatibility

Installation

Quick Start

Prerequisites

Create a BM25 Index

Query with ParadeDB Predicates

Rows + Facets in a Single Query

Search Patterns

Fuzzy Matching

Phrase Prefix and More-Like-This

Proximity Composition

Indexing and Tokenizers

Alembic Operations

Diagnostics Helpers

Common Errors

with_rows requires ORDER BY

with_rows requires LIMIT

with_rows requires a ParadeDB predicate

tokenizer config requires 'tokenizer'

Examples

Testing

Documentation

Contributing

Support

License

Project details

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`with_rows requires ORDER BY`

`with_rows requires LIMIT`

`with_rows requires a ParadeDB predicate`

`tokenizer config requires 'tokenizer'`