17 projects
sk-dist-ems
Distributed scikit-learn meta-estimators with PySpark
drifter-ml
Testing for models confirming to the scikit-learn api
stats-can-ext
Interface ontop of StatsCan
zipcode-features
A tool to get features based on census data from zipcodes
zipcode3
USA zipcode programmable database, includes 2020 census data and geometry information.
describer-ml
A set of descriptive statistics and hypothesis tests
causality-ml
Yet Another Causality Library
data-pipeline-tooling
A library for databricks jobs api
randomizer-ml
Training for models conforming to the scikit-learn api
databricks-tooling
A library for databricks jobs api
datalake-copy
A library for datalake copying to databricks
ts-fe
timeseries processing to run any model
backtester
A backtesting for timeseries data in a pandas dataframe
dvc-ml
Data Validation and Version Control
text-processing-ml
A library for processing text for machine learning
spellchecker-ml
Spellchecker that makes use of a hidden markov model
performance-tuner
A set of tools to performance tune your models