Skip to main content

A SQL-y and well-typed ORM for Python

Project description

Embar

Embar logo

A Python ORM with types


Embar is a new ORM for Python with the following goals:

  • Type safety: your type checker should know what arguments are valid, and what is being returned from any call.
  • Type hints: your LSP should be able to guide you towards the query you want to write.
  • SQL-esque: you should be able to write queries simply by knowing SQL and your data model.
  • You should be able to actually just write SQL when you need to.

These are mostly inspired by Drizzle. The Python ecosystem deserves something with similar DX.

Embar supports three database clients:

The async psycopg3 client is recommended. The others are provided mostly for testing and experimenting locally.

NB:

  • Embar uses Template strings and so only supports Python 3.14.
  • Embar is developed with the ty type-checker in mind. Other type-checkers may behave differently. We're pushing the limits of the Python type system here.
  • Embar is alpha and ready for experimentation but not production use.

Documentation: embar.rdrn.me

Roadmap

  • Improve the story around updates. Requires codegen.
  • Create a drizzle-style db.query.users.findMany({ where: ... }) alternative syntax. Requires codegen.
  • Create a migration diffing engine.

Quickstart

The quickstart uses the non-async sqlite client to make an easy example.

If you want to see a fully worked Postgres example, check out the Postgres Quickstart.

Install

NB: Pydantic is optional, see the docs on using Embar without Pydantic.

uv add embar pydantic

Set up database models

# schema.py
from embar.column.common import Integer, Text, integer, text
from embar.config import EmbarConfig
from embar.table import Table

class User(Table):
    # If you don't provide a table name, it is generated from your class name
    embar_config: EmbarConfig = EmbarConfig(table_name="users")

    id: Integer = integer(primary=True)
    # Columns will also generate their own name if not provided
    email: Text = text("user_email", default="text", not_null=True)

class Message(Table):
    id: Integer = integer()
    # Foreign key constraints are easy to add
    user_id: Integer = integer(fk=lambda: User.id)
    content: Text = text()

Create client and apply migrations

In production, you would (probably) use the embar CLI to generate and run migrations. This example uses the utility function to do it all in code.

# main.py
import sqlite3
from embar.db.sqlite import SqliteDb

conn = sqlite3.connect(":memory:")
db = SqliteDb(conn)
db.migrate([User, Message]).run()

Insert some data

user = User(id=1, email="foo@bar.com")
message = Message(id=1, user_id=user.id, content="Hello!")

db.insert(User).values(user).run()

# you can return your inserted data if you want
msg_inserted = db.insert(Message).values(message).returning().run()
assert msg_inserted[0].content == message.content

Query some data

With join, where and group by.

from typing import Annotated
from pydantic import BaseModel
from embar.query.where import Eq, Like, Or

class UserSel(BaseModel):
    id: Annotated[int, User.id]
    messages: Annotated[list[str], Message.content.many()]

users = (
    db.select(UserSel)
    .from_(User)
    .left_join(Message, Eq(User.id, Message.user_id))
    .where(Or(
        Eq(User.id, 1),
        Like(User.email, "foo%")
    ))
    .group_by(User.id)
    .run()
)
# [ UserSel(id=1, messages=['Hello!']) ]

Query some more data

This time with fully nested child tables, and some raw SQL.

from datetime import datetime
from embar.sql import Sql

class UserHydrated(BaseModel):
    email: Annotated[str, User.email]
    messages: Annotated[list[Message], Message.many()]
    date: Annotated[datetime, Sql(t"CURRENT_TIMESTAMP")]

users = (
    db.select(UserHydrated)
    .from_(User)
    .left_join(Message, Eq(User.id, Message.user_id))
    .group_by(User.id)
    .limit(2)
    .run()
)
# [UserHydrated(
#      email='foo@bar.com',
#      messages=[Message(content='Hello!', id=1, user_id=1)],
#      date: datetime(2025, 10, 26, ...)
# )]

See the SQL

Every query produces exactly one... query. And you can always see what's happening under the hood with the .sql() method:

users_query = (
    db.select(UserHydrated)
    .from_(User)
    .left_join(Message, Eq(User.id, Message.user_id))
    .group_by(User.id)
    .sql()
)
users_query.sql
# SELECT 
#     "users"."user_email" AS "email",
#     json_group_array(json_object(
#         'id', "message"."id",
#         'user_id', "message"."user_id",
#         'content', "message"."content"
#     )) AS "messages",
#     CURRENT_TIMESTAMP AS "date"
# FROM "users"
# LEFT JOIN "message" ON "users"."id" = "message"."user_id"
# GROUP BY "users"."id"

Update a row

Unfortunately this requires another model to be defined, as Python doesn't have a Partial[] type.

from typing import TypedDict

class MessageUpdate(TypedDict, total=False):
    id: int
    user_id: int
    content: str

(
    db.update(Message)
    .set(MessageUpdate(content="Goodbye"))
    .where(Eq(Message.id, 1))
    .run()
)

Delete some rows

And return the deleted data if you like.

deleted = db.delete(Message).returning().run()
assert len(deleted) == 1

Add indexes

from embar.constraint import Index

class MessageIndexed(Table):
    embar_config: EmbarConfig = EmbarConfig(
        constraints=[Index("message_idx").on(lambda: MessageIndexed.user_id)]
    )
    user_id: Integer = integer(fk=lambda: User.id)

Run raw SQL

db.sql(t"DELETE FROM {Message}").run()

Or with a return:

class UserId(BaseModel):
    id: Annotated[int, int]

res = (
    db.sql(t"SELECT * FROM {User}")
    .model(UserId)
    .run()
)
# [UserId(id=1)]

Migrations

Properly diffing migrations is not supported yet, but it's in the pipeline.

In the meantime, you have two options:

Embar CLI (work in progress)

This uses which uses an LLM (and your ANTHROPIC_API_KEY) to generate vibe-diffs. You should inspect these before running them.

You can see a working example at example/.

First create a config file embar.toml in your app root:

dialect = "postgresql"
db_url = "postgresql://pg:pw@localhost:3601/db"
schema_path = "app.schema"
migrations_dir = "migrations"  # optional

Simple DDL output

If you just want to output the current schema as SQL (DDL), run:

embar schema

Migration files

Then to generate migrations, run the following and follow the prompts:

embar migrate

Push changes

Or to push directly to your db, run the following. You will be prompted before each change.

embar push

Or use an external schema management tool

Use the migrate() method shown in the quickstart to dump the current DDL to a .sql file.

Then use a schema management tool to manage updates. Some options are:

Contributing

Install uv.

Then:

uv sync

This project uses poethepoet for tasks/scripts.

You'll need Docker installed to run tests.

Format, lint, type-check, test:

uv run poe fmt
           lint
           check
           test

# or
uv run poe all

Or do this:

# Run this or put it in .zshrc/.bashrc/etc
alias poe="uv run poe"

# Then you can just:
poe test

Other ORMs to consider

There seems to be a gap in the Python ORM market.

  • SQLAlchemy (and, by extension, SQLModel) is probably great if you're familiar with it, but too complicated for people who don't live in it
  • PonyORM has no types
  • PugSQL has no types
  • TortoiseORM is probably appealing if you like Django/ActiveRecord
  • Piccolo is cool but not very type-safe (functions accept Any, return dicts)
  • ormar is not very type-safe and still based on SQLAlchemy

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

embar-0.4.0.tar.gz (151.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

embar-0.4.0-py3-none-any.whl (65.8 kB view details)

Uploaded Python 3

File details

Details for the file embar-0.4.0.tar.gz.

File metadata

  • Download URL: embar-0.4.0.tar.gz
  • Upload date:
  • Size: 151.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for embar-0.4.0.tar.gz
Algorithm Hash digest
SHA256 c8a14cac7d2ebcf25e558c86aa14a2accf47bf2cf9c98d5ede165b33c968beff
MD5 9545fa5328560519defc2323f9d0fcfe
BLAKE2b-256 bd877ec18c6a1c739aa609fcee18012a12baccf184a0c5d47fc25b453840f1aa

See more details on using hashes here.

Provenance

The following attestation bundles were made for embar-0.4.0.tar.gz:

Publisher: release.yml on carderne/embar

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file embar-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: embar-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 65.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for embar-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b2938ba15e0092eb78278b0c44adc86b396fa0d864281ae0ad72d87e14f28174
MD5 23a9abe8b37ace2c53dc001226041cfa
BLAKE2b-256 b0104a887cbbfc7e69a262ca15892afbe77337b8509d3475bd45faf7a880c3f5

See more details on using hashes here.

Provenance

The following attestation bundles were made for embar-0.4.0-py3-none-any.whl:

Publisher: release.yml on carderne/embar

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page