Skip to main content
Avatar for fgregg from gravatar.com
Username    fgregg

29 projects

vectors2vrt

Last released

Generate a VRT file from GIS vector sources

DoubleMetaphone

Last released

Python wrapper for C++ Double Metaphone

python-crfsuite

Last released

Python binding for CRFsuite

dedupe

Last released

A python library for accurate and scaleable data deduplication and entity-resolution

dedupe-variable-address

Last released

Address variable type for dedupe

dedupe-variable-datetime

Last released

DateTime variable type for dedupe

dedupe-variable-name

Last released

Name variable type for dedupe

parseratorvariable

Last released

Structured variable type for dedupe

pyhacrf-datamade

Last released

Hidden alignment conditional random field, a discriminative string edit distance

PyLBFGS

Last released

LBFGS and OWL-QN optimization algorithms

kubra

Last released

command line tool for downloading utility outage data

chicagorequests

Last released

command line tool for downloading Chicago Open311 data

dedupe-Levenshtein-search

Last released

Search through documents for approximately matching strings. A fork of Matt Anderson's library for MIT licensing

affinegap

Last released

A Cython implementation of the affine gap string distance

rlr

Last released

Case weighted L2 regularized logistic regression

dedupe-hcluster

Last released

Hierarchical Clustering Algorithms (Information Theory)

django-proxy-overrides

Last released

Overridable foreign key fields for Proxy models

dedupe-variable-ilcs

Last released

Dedupe variable for Illinois Compiled Statute (ILCS) codes

csvdedupe

Last released

Command line tools for deduplicating and merging csv files

dedupe-variable-number

Last released

Employer variable type for dedupe

datetime-distance

Last released

Compare string distances between dates, timestamps, or datetime objects.

simplecosine

Last released

Simple cosine distance

highered

Last released

Learnable Edit Distance Using PyHacrf

categorical-distance

Last released

Compare two categorical variables

dedupe-variable-person

Last released

Variable type for American Person Names

dedupe-variable-employer

Last released

Employer variable type for dedupe

dedupe-variable-fuzzycategory

Last released

Fuzzy Categoy variable type for dedupe

fuzzycategory

Last released

A context comparison

canonicalize

Last released

canonicalize a cluster of records

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page