Convert CEL (Common Expression Language) expressions to SQL WHERE clauses

These details have not been verified by PyPI

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
Topic
- Database
- Software Development :: Libraries :: Python Modules
Typing
- Typed

Project description

pycel2sql

Convert CEL (Common Expression Language) expressions to SQL WHERE clauses.

Python port of cel2sql (Go).

Installation

pip install pycel2sql

Requires Python 3.12+.

Quick Start

from pycel2sql import convert

# Default dialect is PostgreSQL
sql = convert('name == "alice" && age > 30')
# => name = 'alice' AND age > 30

sql = convert('status == "active" || tags.size() > 0')
# => status = 'active' OR ARRAY_LENGTH(tags, 1) > 0

Dialects

Six SQL dialects are supported:

from pycel2sql import convert
from pycel2sql.dialect import get_dialect

# Using get_dialect() factory
sql = convert('name == "alice"', dialect=get_dialect("postgresql"))
sql = convert('name == "alice"', dialect=get_dialect("mysql"))
sql = convert('name == "alice"', dialect=get_dialect("sqlite"))
sql = convert('name == "alice"', dialect=get_dialect("duckdb"))
sql = convert('name == "alice"', dialect=get_dialect("bigquery"))
sql = convert('name == "alice"', dialect=get_dialect("spark"))

# Or instantiate directly
from pycel2sql import (
    PostgresDialect, MySQLDialect, SQLiteDialect, DuckDBDialect,
    BigQueryDialect, SparkDialect,
)

sql = convert('name == "alice"', dialect=MySQLDialect())

Parameterized Queries

Use convert_parameterized() to produce parameterized SQL with bind placeholders:

from pycel2sql import convert_parameterized, MySQLDialect

result = convert_parameterized('name == "alice" && age > 30')
# result.sql => 'name = $1 AND age > $2'  (PostgreSQL default)
# result.parameters => ['alice', 30]

result = convert_parameterized('name == "alice"', dialect=MySQLDialect())
# result.sql => 'name = ?'
# result.parameters => ['alice']

Placeholder styles per dialect:

Dialect	Placeholder
PostgreSQL	`$1`, `$2`, ...
DuckDB	`$1`, `$2`, ...
BigQuery	`@p1`, `@p2`, ...
MySQL	`?` (positional)
SQLite	`?` (positional)
Apache Spark	`?` (positional)

Conversion Options

`json_variables`

Declare CEL variable names that correspond to flat JSONB columns. Field access via dot notation or bracket notation emits dialect-specific JSON extraction:

from pycel2sql import convert

# PostgreSQL: dot and bracket notation both produce ->> operators
sql = convert("context.host == 'a'", json_variables={"context"})
# => context->>'host' = 'a'

sql = convert('context["host"] == "a"', json_variables={"context"})
# => context->>'host' = 'a'

# Nested paths: intermediate keys use ->, final key uses ->>
sql = convert("tags.corpus.section == 'x'", json_variables={"tags"})
# => tags->'corpus'->>'section' = 'x'

json_variables takes precedence over schema-declared JSON. Comprehension iter vars shadow json_variables (collisions are not treated as JSON inside the comprehension body).

`column_aliases`

Map CEL identifier names to SQL column names. Useful when database columns use prefixed names while user-facing CEL expressions use clean names:

sql = convert("name == 'a'", column_aliases={"name": "usr_name"})
# => usr_name = 'a'

The alias is validated against the dialect's identifier rules. The original CEL name remains the schema key — alias is output-only.

`param_start_index`

Shift the placeholder counter for convert_parameterized() when embedding the generated fragment into a larger pre-parameterized query:

result = convert_parameterized(
    "name == 'a' && age > 30",
    param_start_index=5,
)
# result.sql => 'name = $5 AND age > $6'
# result.parameters => ['a', 30]

Values less than 1 are clamped to 1. For positional-? dialects (MySQL, SQLite, Apache Spark) the placeholder text is unchanged but the parameter ordering is preserved.

`format()` per-dialect mapping

CEL's string.format(args) dispatches to dialect-specific SQL:

Dialect	Output
PostgreSQL	`FORMAT('...', ...)`
BigQuery	`FORMAT('...', ...)`
SQLite	`printf('...', ...)`
DuckDB	`printf('...', ...)`
Apache Spark	`format_string('...', ...)`
MySQL	raises `UnsupportedDialectFeatureError`

JSON Fields

Provide schemas to enable JSON field detection:

from pycel2sql import convert, PostgresDialect
from pycel2sql.schema import Schema, FieldSchema

schemas = {
    "usr": Schema([FieldSchema("metadata", is_jsonb=True)])
}

sql = convert(
    'usr.metadata.role == "admin"',
    dialect=PostgresDialect(),
    schemas=schemas,
)
# => usr.metadata->>'role' = 'admin'

Schema Validation

Enable strict validation to catch typos and references to nonexistent fields:

from pycel2sql import convert, InvalidSchemaError
from pycel2sql.schema import Schema, FieldSchema

schemas = {
    "usr": Schema([
        FieldSchema("name"),
        FieldSchema("age", type="integer"),
        FieldSchema("metadata", is_jsonb=True),
    ])
}

# Valid field — works normally
sql = convert('usr.name == "alice"', schemas=schemas, validate_schema=True)

# Unknown field — raises InvalidSchemaError
convert('usr.email == "test"', schemas=schemas, validate_schema=True)
# => InvalidSchemaError: field not found in schema

Validation scope:

Validates: table.field references — table must exist in schemas, field must exist in that table's Schema
Skips: Nested JSON paths beyond the first field (e.g., usr.metadata.settings.theme validates metadata exists, not settings)
Skips: Comprehension variables (t in tags.all(t, t > 0))
Skips: Bare identifiers without a table prefix (age > 10)

Works with all three public API functions: convert(), convert_parameterized(), and analyze().

Schema Introspection

Auto-discover table schemas from a live database connection instead of building Schema objects manually:

from pycel2sql import convert, introspect
from pycel2sql.dialect.postgres import PostgresDialect
import psycopg

conn = psycopg.connect("postgresql://localhost/mydb")

# Introspect specific tables — detects JSON, JSONB, and array columns
schemas = introspect("postgresql", conn, table_names=["users", "orders"])

sql = convert(
    'users.metadata.role == "admin"',
    dialect=PostgresDialect(),
    schemas=schemas,
)
# => users.metadata->>'role' = 'admin'

Per-dialect functions are also available:

from pycel2sql.introspect import introspect_postgres, introspect_sqlite

# PostgreSQL — detects JSONB, JSON, and ARRAY columns
schemas = introspect_postgres(conn, table_names=["users"], schema_name="public")

# SQLite — explicit json_columns since SQLite has no JSON type
schemas = introspect_sqlite(
    conn,
    table_names=["events"],
    json_columns={"events": ["payload", "tags"]},
)

All five JDBC-style dialects are supported: introspect_postgres, introspect_duckdb, introspect_bigquery, introspect_mysql, introspect_sqlite. Apache Spark introspection is not provided — construct Schema directly.

Supported CEL Features

Comparisons: ==, !=, <, <=, >, >=
Logic: &&, ||, !
Arithmetic: +, -, *, /, %
String functions: contains(), startsWith(), endsWith(), matches(), size(), split(), join()
Type casting: int(), uint(), double(), string(), bool(), bytes(), timestamp(), duration()
Collections: in operator, list literals, size(), index access
Timestamps & durations: arithmetic, getFullYear(), getMonth(), getDayOfMonth(), getHours(), getMinutes(), getSeconds()
Macros: exists(), all(), exists_one(), map(), filter()
Ternary: conditional expressions
JSON: field access, has() existence checks, nested paths
Regex: matches() with RE2 syntax
Structs: struct construction

Index Analysis

Analyze expressions for PostgreSQL index recommendations:

from pycel2sql import analyze

result = analyze('name == "alice" && age > 30')
# result.sql => "name = 'alice' AND age > 30"
# result.recommendations => [IndexRecommendation(...), ...]

Security Limits

Configurable resource limits prevent abuse:

from pycel2sql import convert

sql = convert(
    cel_expr,
    max_depth=100,           # AST recursion depth (default: 100)
    max_output_length=50000, # Max SQL output bytes (default: 50000)
)

Additional built-in limits: comprehension nesting (3 levels), regex pattern length (500 chars), field name length (63 chars), byte array size (10,000).

Development

# Setup
uv venv && uv pip install -e ".[dev]"

# Tests
uv run pytest tests/ --ignore=tests/integration -v

# Integration tests (requires Docker/Podman)
uv pip install -e ".[integration]"
uv run pytest tests/integration/ -v

# Lint & type check
uv run ruff check src/ tests/
uv run mypy src/pycel2sql/

License

MIT

Project details

These details have not been verified by PyPI

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
Topic
- Database
- Software Development :: Libraries :: Python Modules
Typing
- Typed

Release history Release notifications | RSS feed

This version

0.3.0

Apr 28, 2026

0.2.0

Feb 27, 2026

0.1.0

Feb 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycel2sql-0.3.0.tar.gz (191.2 kB view details)

Uploaded Apr 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pycel2sql-0.3.0-py3-none-any.whl (61.3 kB view details)

Uploaded Apr 28, 2026 Python 3

File details

Details for the file pycel2sql-0.3.0.tar.gz.

File metadata

Download URL: pycel2sql-0.3.0.tar.gz
Upload date: Apr 28, 2026
Size: 191.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pycel2sql-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`c2cdfef8d8ce461ed8532b8d10a8e64e6a42b01c969307d981bcead59669c6ec`
MD5	`b70e320dbf96708aa0aa4451a0d73739`
BLAKE2b-256	`ae2d4313bc0c03e544bffa3481c379fbf57ed192c9fb85b2578e5738c427b2eb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pycel2sql-0.3.0.tar.gz:

Publisher: release.yml on SPANDigital/pycel2sql

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pycel2sql-0.3.0.tar.gz
- Subject digest: c2cdfef8d8ce461ed8532b8d10a8e64e6a42b01c969307d981bcead59669c6ec
- Sigstore transparency entry: 1396835913
- Sigstore integration time: Apr 28, 2026
Source repository:
- Permalink: SPANDigital/pycel2sql@1c4f931d3ca2c699983546a828f0d53cb69fc8e0
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/SPANDigital
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@1c4f931d3ca2c699983546a828f0d53cb69fc8e0
- Trigger Event: push

File details

Details for the file pycel2sql-0.3.0-py3-none-any.whl.

File metadata

Download URL: pycel2sql-0.3.0-py3-none-any.whl
Upload date: Apr 28, 2026
Size: 61.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pycel2sql-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e722da9a24a5318e722cdf7f1993c3994b2437129b7885ada8869dacd33bf835`
MD5	`9437c7e575cc149c21dcf551be6b95c7`
BLAKE2b-256	`27f5a0f14ef7db0b4ef0974b1d0ec5948820e39635675b379a83a66917c46939`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pycel2sql-0.3.0-py3-none-any.whl:

Publisher: release.yml on SPANDigital/pycel2sql

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pycel2sql-0.3.0-py3-none-any.whl
- Subject digest: e722da9a24a5318e722cdf7f1993c3994b2437129b7885ada8869dacd33bf835
- Sigstore transparency entry: 1396836005
- Sigstore integration time: Apr 28, 2026
Source repository:
- Permalink: SPANDigital/pycel2sql@1c4f931d3ca2c699983546a828f0d53cb69fc8e0
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/SPANDigital
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@1c4f931d3ca2c699983546a828f0d53cb69fc8e0
- Trigger Event: push

pycel2sql 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

pycel2sql

Installation

Quick Start

Dialects

Parameterized Queries

Conversion Options

json_variables

column_aliases

param_start_index

format() per-dialect mapping

JSON Fields

Schema Validation

Schema Introspection

Supported CEL Features

Index Analysis

Security Limits

Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`json_variables`

`column_aliases`

`param_start_index`

`format()` per-dialect mapping