Convert Pydantic models to Polars schemas
Project description
🧩 Poldantic
Convert Pydantic models into Polars schemas — and back again.
Poldantic bridges the world of data validation (Pydantic) and blazing-fast computation (Polars). It's ideal for type-safe ETL pipelines, FastAPI response models, and schema round-tripping between Python classes and DataFrames.
✨ Features
- 🔁 Bidirectional conversion — Pydantic models ⇄ Polars schemas
- 🧠 Smart type handling — nested models, containers (
list,set,tuple), enums,Optional,Annotated, and more - 🔒 Thread-safe settings — context-aware configuration using
contextvarsfor concurrent environments - 🛡️ Custom exceptions — clear error messages for conversion failures and type mismatches
- 🛠 Sensible fallbacks — ambiguous types like
Union[int, str]map topl.Object() - 🧪 Thoroughly tested — 47+ tests covering primitives, containers, structs, enums, and round-trip inference
- 📦 Minimal dependencies — Pydantic v2+, Polars ≥ 0.20, Python 3.8+ — production‑ready
- 🎯 Type-safe — full mypy coverage with strict type checking
📦 Install
pip install poldantic
Requires: Python ≥ 3.8, Pydantic ≥ 2.0, Polars ≥ 0.20.0
🚀 Quick Start
from typing import Optional, List
from pydantic import BaseModel
import polars as pl
from poldantic import to_polars_schema, to_pydantic_model
# Define a Pydantic model
class User(BaseModel):
id: int
name: str
tags: Optional[List[str]] = None
# Convert to Polars schema
schema = to_polars_schema(User)
# {'id': Int64, 'name': String, 'tags': List(String)}
# Create a DataFrame
df = pl.DataFrame([
{"id": 1, "name": "Alice", "tags": ["python", "rust"]},
{"id": 2, "name": "Bob", "tags": None}
], schema=schema)
# Convert Polars schema back to Pydantic
polars_schema = df.schema
GeneratedModel = to_pydantic_model(polars_schema, "GeneratedModel")
instance = GeneratedModel(id=3, name="Charlie", tags=["go"])
📖 Usage Guide
💡 See also: Interactive Jupyter notebooks in the
examples/directory:
01_basic_usage.ipynb- Core functionality and round-trip conversions02_advanced_types.ipynb- Nested models, enums, and complex types03_settings_configuration.ipynb- Customizing behavior with settings04_fastapi_integration.ipynb- Building APIs with FastAPI05_etl_pipeline.ipynb- Real-world ETL pipeline example
🔄 Pydantic ➜ Polars
from pydantic import BaseModel
from poldantic.infer_polars import to_polars_schema
from typing import Optional, List
class Person(BaseModel):
name: str
tags: Optional[List[str]]
schema = to_polars_schema(Person)
print(schema)
# {'name': String, 'tags': List(String)}
Initialize a DataFrame with the schema:
import polars as pl
data = [{"name": "Alice", "tags": ["x"]}, {"name": "Bob", "tags": None}]
df = pl.DataFrame(data, schema=schema)
🔄 Polars ➜ Pydantic
import polars as pl
from poldantic.infer_pydantic import to_pydantic_model
schema = {
"name": pl.String,
"tags": pl.List(pl.String())
}
Model = to_pydantic_model(schema) # fields are Optional[...] by default
print(Model(name="Alice", tags=["x", "y"]))
# name='Alice' tags=['x', 'y']
Pass
force_optional=Falseto require fields on the generated model:StrictModel = to_pydantic_model(schema, "StrictModel", force_optional=False)
🧬 Nested Models
from pydantic import BaseModel
from poldantic.infer_polars import to_polars_schema
class Address(BaseModel):
street: str
zip: int
class Customer(BaseModel):
id: int
address: Address
print(to_polars_schema(Customer))
# {'id': Int64, 'address': Struct({'street': String, 'zip': Int64})}
⚡ FastAPI Integration
from fastapi import FastAPI
from pydantic import BaseModel
import polars as pl
from poldantic.infer_polars import to_polars_schema
from poldantic.infer_pydantic import to_pydantic_model
class User(BaseModel):
id: int
name: str
schema = to_polars_schema(User)
UserOut = to_pydantic_model(schema, "UserOut", force_optional=False)
app = FastAPI()
@app.get("/users", response_model=list[UserOut])
def list_users():
df = pl.DataFrame([{"id": 1, "name": "Ada"}, {"id": 2, "name": "Alan"}], schema=schema)
return df.to_dicts()
⚙️ Settings
Both directions expose thread-safe, context-aware settings you can customize.
Pydantic ➜ Polars (poldantic.infer_polars)
from poldantic.infer_polars import settings, Settings, set_settings
# Modify global settings (backward compatible)
settings.use_pl_enum_for_string_enums = True
settings.decimal_precision = 38
settings.decimal_scale = 18
settings.uuid_as_string = True
# Or use immutable, context-aware settings (recommended for concurrent code)
from poldantic.infer_polars import Settings, set_settings, get_settings
custom_settings = Settings(
use_pl_enum_for_string_enums=False,
decimal_precision=10,
decimal_scale=2,
uuid_as_string=False
)
set_settings(custom_settings) # Thread-safe, context-local
Polars ➜ Pydantic (poldantic.infer_pydantic)
from poldantic.infer_pydantic import settings
# Map pl.Duration → datetime.timedelta (True) or int (False)
settings.durations_as_timedelta = True
# Default Decimal instance for reverse mapping (precision/scale)
settings.decimal_precision = 38
settings.decimal_scale = 18
Thread Safety: Settings use
contextvarsfor safe concurrent access. Each context (thread/async task) can have its own settings without interference.
🛡️ Error Handling
Poldantic provides custom exceptions for better error messages:
from poldantic import (
PoldanticError, # Base exception
SchemaConversionError, # Schema conversion failures
UnsupportedTypeError, # Unsupported type encountered
InvalidSchemaError, # Invalid schema provided
)
try:
schema = to_polars_schema(MyModel)
except SchemaConversionError as e:
print(f"Conversion failed: {e}")
print(f"Field: {e.field_name}, Type: {e.field_type}")
📚 Supported Type Mappings
| Python / Pydantic | ➜ Polars dtype | ➜ back to Python |
|---|---|---|
int |
pl.Int64() |
int |
float |
pl.Float64() |
float |
str |
pl.String() |
str |
bool |
pl.Boolean() |
bool |
bytes |
pl.Binary() |
bytes |
datetime.date |
pl.Date() |
datetime.date |
datetime.datetime |
pl.Datetime() |
datetime.datetime |
datetime.time |
pl.Time() |
datetime.time |
datetime.timedelta |
pl.Duration() |
datetime.timedelta |
Decimal |
pl.Decimal(p,s) |
Decimal |
Enum[str] |
pl.Enum([...]) or pl.String() |
str |
list[T], set[T] |
pl.List(inner) |
list[T] |
tuple[T, ...] |
pl.List(inner) |
list[T] |
nested BaseModel |
pl.Struct([...]) |
nested Pydantic model |
Union[int, str], Any |
pl.Object() |
Any |
dict[...] |
pl.Object() |
Any |
Ambiguous unions (e.g.,
Union[int, str]) intentionally map topl.Object()and back totyping.Any.
🧭 Design Notes
- Nullability: From-Polars conversion wraps all fields in
Optional[...]by default; disable withforce_optional=False. - Utf8 vs String: Normalized to
pl.Stringfor forward compatibility. - Structs: Works with tuple fields
("name", dtype)andpolars.Fieldobjects. - Classes vs Instances: Accepts both
pl.Int64andpl.Int64()in schema dicts.
🧪 Development & Testing
Quick Start
# Clone the repository
git clone https://github.com/eddiethedean/poldantic.git
cd poldantic
# Install with development dependencies
pip install -e ".[dev,test]"
# Run tests
pytest
# Run with coverage
pytest --cov=poldantic --cov-report=html
# Type checking
mypy poldantic
# Linting & formatting
ruff check poldantic tests
ruff format poldantic tests
Using Make
make install # Install all dependencies
make test # Run tests with coverage
make lint # Run linters
make format # Format code
make check # Run all checks (lint + type check + tests)
For more details, see CONTRIBUTING.md.
💡 When to use Poldantic
- ✅ You already have Pydantic models and want to create Polars DataFrames with matching schemas
- ✅ You have Polars transformations and need FastAPI response models without manual typing
- ✅ You want type-safe ETL: validate with Pydantic → transform with Polars → publish validated results
- ✅ You need bidirectional schema conversion between validation and computation layers
- ✅ You're building data pipelines that benefit from both Pydantic's validation and Polars' performance
🤝 Contributing
Contributions are welcome! Please see CONTRIBUTING.md for development setup, coding standards, and submission guidelines.
📝 Changelog
See CHANGELOG.md for a detailed history of changes.
📄 License
MIT © 2025 Odos Matthews
🔗 Links
- PyPI: https://pypi.org/project/poldantic/
- GitHub: https://github.com/eddiethedean/poldantic
- Issues: https://github.com/eddiethedean/poldantic/issues
- Pydantic: https://docs.pydantic.dev/
- Polars: https://pola.rs/
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file poldantic-0.3.0.tar.gz.
File metadata
- Download URL: poldantic-0.3.0.tar.gz
- Upload date:
- Size: 19.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
223d205c601ffbe937127a2f5363de8e333384fed51a1f9dcd3acc6b80ce0b9d
|
|
| MD5 |
0fed301a21febcdbc304c9da8c8590b7
|
|
| BLAKE2b-256 |
3a4f6ef1771486fc8a560cf5b47e0d04a6e9faf233cbccc9db4b48e41d042909
|
File details
Details for the file poldantic-0.3.0-py3-none-any.whl.
File metadata
- Download URL: poldantic-0.3.0-py3-none-any.whl
- Upload date:
- Size: 13.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1560955bfcc179ae4b06bf075cadb2dfd3ab7937665cd0a0930daeb725b1d9bb
|
|
| MD5 |
c3c32d11f7525ee6a4714e0251d26014
|
|
| BLAKE2b-256 |
06bfe95bad089f53f3dc7cdb34e654ff841f9d6233336534524519013085c80d
|