13 projects
splink
Fast probabilistic data linkage at scale
iam-builder
A lil python package to generate iam policies
pydbtools
A python package to query data via amazon athena and bring it into a pandas df using aws-wrangler.
uk_address_matcher
A package for matching UK addresses using a pretrained Splink model
etl_manager
A python package to manage etl processes on AWS
data-linter
data linter
dataengineeringutils3
Data engineering utils Python 3 version
fuzzymatcher
Fuzzy match two pandas dataframes based on one or more common fields
splink-cluster-studio
Create an interactive webpage to visualise clusters
splink-data-generation
Generate synthetic data with a specified data generating process
splink-comparison-viewer
Create an interactive webpage to visualise Splink record comparisons
splink-data-standardisation
gluejobutils
Python 2.7 utils for glue jobs