A package for the matching of company names
Project description
Name matching
Name matching is a Python package for the matching of company names
Installation
To install the package, download the repo, navigate to the folder and run the setup in pip
pip install .
Usage
To see example usage of the package you can use the notebook folder. An example of the usage is also given below
from name_matching.name_matcher import NameMatcher
# initialise the name matcher
matcher = NameMatcher(column='name',
number_of_matches=3,
legal_suffixes=True,
common_words=False,
top_n=50,
verbose=True)
# adjust the distance metrics to use
matcher.set_distance_metrics(discounted_levenshtein=False,
bag=True,
typo=True,
refined_soundex=True)
# load the data to which the names should be matched
matcher.load_and_process_master_data(df_gleif, transform=True)
# perform the name matching on the data you want matched
matches = matcher.match_names(to_be_matched=unknown_counterparties, column_matching='name')
Contributing
All contributions are welcome. For more substantial changes, please open an issue first to discuss what you would like to change.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
name_matching-0.8.2.tar.gz
(16.9 kB
view hashes)
Built Distribution
Close
Hashes for name_matching-0.8.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 659970fa16f004c666ad1ecf0a40f704447f5c7bc1b17e0e45f6eca6b98e41e0 |
|
MD5 | bde880d83d2ff4fda154aa8f43c8aab0 |
|
BLAKE2b-256 | c7a3d18da6220af48b436786b0c4b6ebb8c74315503558a2135231321b03df73 |