18 projects
dataframely
A declarative, polars-native data frame validation library
sqlcompyre
Tool for comparing and inspecting data in SQL databases.
pytest-freeze
Pytest plugin to simplify writing freeze tests.
diffly
Utility package for comparing polars dataframes.
slim-trees
A python package for efficient pickling of ML models.
polarify
Simplifying conditional Polars Expressions with Python 🐍 🐻❄️
cf-job-logs
A utility for fetching and structuring conda-forge Azure CI logs into clean, agent-readable artifacts.
spox
A framework for constructing ONNX computational graphs.
tabulardelta
Simplify table comparisons.
glum
High performance Python GLMs with all the features!
pydiverse.pipetest
An adaption layer for pydiverse.pipedag that simplyfies execution of pipedag steps as unit tests with cache invalidation awareness.
pydiverse
A collection of nicely interoperable libraries for data pipeline orchestration allowing for both SQL target and in-memory operation.
ndonnx
ONNX backed array library compliant with Array API standard.
tabmat
Efficient matrix representations for working with tabular data.
pydiverse-colspec
Validate column specifications and constraints for SQL tables and polars data frames.
pydiverse-transform
Pipe based dataframe manipulation library that can also transform data on SQL databases
pydiverse-common
Common functionality shared between pydiverse libraries
pydiverse-pipedag
A pipeline orchestration library executing tasks within one python session. It takes care of SQL table (de)materialization, caching and cache invalidation. Blob storage is supported as well for example for storing model files.