Skip to main content
Avatar for Derek Eder from gravatar.com

Derek Eder

Username    Derek.Eder
Date joined   Joined

35 projects

pupa

Last released

scraping framework for muncipal data

dedupe

Last released

A python library for accurate and scaleable data deduplication and entity-resolution

census

Last released

A wrapper for the US Census Bureau's API

census-area

Last released

Census data for arbitrary geographies

django-councilmatic

Last released

Core functions for councilmatic.org family

dedupe-variable-ilcs

Last released

Dedupe variable for Illinois Compiled Statute (ILCS) codes

ilcs-parser

Last released

Probabilistic parser for tagging data that references the Illinois Compiled Statutes (ILCS).

opencivicdata

Last released

python opencivicdata library

csvdedupe

Last released

Command line tools for deduplicating and merging csv files

pyhacrf-datamade

Last released

Hidden alignment conditional random field, a discriminative string edit distance

affinegap

Last released

A Cython implementation of the affine gap string distance

dedupe-hcluster

Last released

Hierarchical Clustering Algorithms (Information Theory)

PyLBFGS

Last released

LBFGS and OWL-QN optimization algorithms

dedupe-variable-number

Last released

Employer variable type for dedupe

django-councilmatic-notifications

Last released

Core functions for councilmatic.org family

dedupe-variable-address

Last released

Address variable type for dedupe

dedupe-variable-name

Last released

Name variable type for dedupe

parseratorvariable

Last released

Structured variable type for dedupe

rlr

Last released

Case weighted L2 regularized logistic regression

parserator

Last released

Create parsers

dedupe-variable-datetime

Last released

DateTime variable type for dedupe

datetime-distance

Last released

Compare string distances between dates, timestamps, or datetime objects.

simplecosine

Last released

Simple cosine distance

probablepeople

Last released

Parse romanized names & companies using advanced NLP methods

usaddress

Last released

Parse US addresses using conditional random fields

highered

Last released

Learnable Edit Distance Using PyHacrf

categorical-distance

Last released

Compare two categorical variables

dedupe-variable-person

Last released

Variable type for American Person Names

DoubleMetaphone

Last released

Python wrapper for C++ Double Metaphone

companyparser

Last released

UNKNOWN

probableparsing

Last released

Common methods for propbable parsers

dedupe-variable-employer

Last released

Employer variable type for dedupe

dedupe-variable-fuzzycategory

Last released

Fuzzy Categoy variable type for dedupe

fuzzycategory

Last released

A context comparison

canonicalize

Last released

canonicalize a cluster of records

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page