fuzzylink-py
Last released
Merges two pandas data frames using blocking columns and a fuzzy string matching column. Fuzzy matching relies on cosine similarity from text embeddings, Jaro Winkler similarity, and any other user provided similarity calculations. Implementation of the R fuzzylink package developed by Joe Ornstein.