21 projects
pytest-freeze
Pytest plugin to simplify writing freeze tests.
datajudge
datajudge allows to assess whether data from database complies with referenceinformation.
tabulardelta
Simplify table comparisons.
ndonnx
ONNX backed array library compliant with Array API standard.
dataframely
A declarative, polars-native data frame validation library
pydiverse
A collection of nicely interoperable libraries for data pipeline orchestration allowing for both SQL target and in-memory operation.
pydiverse-transform
Pipe based dataframe manipulation library that can also transform data on SQL databases
metalearners
MetaLearners for CATE estimation
slim-trees
A python package for efficient pickling of ML models.
tabmat
Efficient matrix representations for working with tabular data.
diffly
Utility package for comparing polars dataframes.
sqlcompyre
Tool for comparing and inspecting data in SQL databases.
pytsql
`Pytsql` allows to run mssql sripts, typically run via GUIs, via CLI.
pydiverse-common
Common functionality shared between pydiverse libraries
pydiverse.pipetest
An adaption layer for pydiverse.pipedag that simplyfies execution of pipedag steps as unit tests with cache invalidation awareness.
polarify
Simplifying conditional Polars Expressions with Python 🐍 🐻❄️
pydiverse-pipedag
A pipeline orchestration library executing tasks within one python session. It takes care of SQL table (de)materialization, caching and cache invalidation. Blob storage is supported as well for example for storing model files.
glum
High performance Python GLMs with all the features!
pydiverse-colspec
Validate column specifications and constraints for SQL tables and polars data frames.
spox
A framework for constructing ONNX computational graphs.
cf-job-logs
A utility for fetching and structuring conda-forge Azure CI logs into clean, agent-readable artifacts.