Skip to main content
Avatar for datamade.wheelbuilder from gravatar.com
Username    datamade.wheelbuilder
Date joined   Joined on

32 projects

dedupe

Last released on

A python library for accurate and scaleable data deduplication and entity-resolution

census

Last released on

A wrapper for the US Census Bureau's API

django-councilmatic

Last released on

Core functions for councilmatic.org family

csvdedupe

Last released on

Command line tools for deduplicating and merging csv files

pyhacrf-datamade

Last released on

Hidden alignment conditional random field, a discriminative string edit distance

django-geomultiplechoice

Last released on

A Django widget to select multiple geographic areas

affinegap

Last released on

A Cython implementation of the affine gap string distance

dedupe-hcluster

Last released on

Hierarchical Clustering Algorithms (Information Theory)

PyLBFGS

Last released on

LBFGS and OWL-QN optimization algorithms

dedupe-variable-number

Last released on

Employer variable type for dedupe

django-councilmatic-notifications

Last released on

Core functions for councilmatic.org family

census-area

Last released on

Census data for arbitrary geographies

dedupe-variable-address

Last released on

Address variable type for dedupe

dedupe-variable-name

Last released on

Name variable type for dedupe

parseratorvariable

Last released on

Structured variable type for dedupe

rlr

Last released on

Case weighted L2 regularized logistic regression

parserator

Last released on

Create parsers

dedupe-variable-datetime

Last released on

DateTime variable type for dedupe

datetime-distance

Last released on

Compare string distances between dates, timestamps, or datetime objects.

simplecosine

Last released on

Simple cosine distance

probablepeople

Last released on

Parse romanized names & companies using advanced NLP methods

usaddress

Last released on

Parse US addresses using conditional random fields

highered

Last released on

Learnable Edit Distance Using PyHacrf

categorical-distance

Last released on

Compare two categorical variables

dedupe-variable-person

Last released on

Variable type for American Person Names

DoubleMetaphone

Last released on

Python wrapper for C++ Double Metaphone

companyparser

Last released on

UNKNOWN

probableparsing

Last released on

Common methods for propbable parsers

dedupe-variable-employer

Last released on

Employer variable type for dedupe

dedupe-variable-fuzzycategory

Last released on

Fuzzy Categoy variable type for dedupe

fuzzycategory

Last released on

A context comparison

canonicalize

Last released on

canonicalize a cluster of records

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page