12 projects
pydbtools
A python package to query data via amazon athena and bring it into a pandas df using aws-wrangler.
data-linter
data linter
splink
Fast probabilistic data linkage at scale
etl_manager
A python package to manage etl processes on AWS
iam-builder
A lil python package to generate iam policies
dataengineeringutils3
Data engineering utils Python 3 version
fuzzymatcher
Fuzzy match two pandas dataframes based on one or more common fields
splink-cluster-studio
Create an interactive webpage to visualise clusters
splink-data-generation
Generate synthetic data with a specified data generating process
splink-comparison-viewer
Create an interactive webpage to visualise Splink record comparisons
splink-data-standardisation
gluejobutils
Python 2.7 utils for glue jobs