Skip to main content

Extended funcationality for NEural-symbolic Entity Reasoning and Matching

Project description

Neer Match Utilities

neermatch utilities website

License: MIT

neermatch is a framework for entity matching that supports deep learning, symbolic learning, and hybrid approaches, using neural networks, gradient boosting, logit, or probit as classifiers. It is tailored to the needs of economic historians and quantitative economists, but generalizes to any domain requiring entity matching.

The neermatch framework encompasses three packages:

  1. py-neer-match: The Python implementation of the basic functionalities (ANNs only). Learn more
  2. py-neer-utilities: A Python package that provides additional models and functionalities to streamline and support the entity matching workflow. (this project)
  3. r-neer-match: The R implementation of the basic functionalites. Learn more

The project is financially supported by the Deutsche Forschungsgemeinschaft (DFG) under Grant 539465691 as part of the Infrastructure Priority Programme New Data Spaces for the Social Sciences (SPP 2431). Reading the article Karapanagiotis and Liebald (2023) helps to understand the theoretical foundation and design of neermatch (note that the article refers to an earlier version of the framework, previously labeled as mlmatch).

The documentation provides examples of how neermatch may be used. The data used in these examples are available in this folder of the GitHub repository.

Contributors

Marius Liebald (maintainer)

Pantelis Karapanagiotis (contributor)

Erik Hellberg (contributor)

Installation

pip install neer-match
pip install neer-match-utilities

Official Documentation

The documentation is hosted under https://www.marius-liebald.com/py-neer-utilities/index.html

License

The package is distributed under the MIT license.

References

Bénédict, Gabriel, Vincent Koops, Daan Odijk, and Maarten de Rijke. 2022. “sigmoidF1: A Smooth F1 Score Surrogate Loss for Multilabel Classification.” https://arxiv.org/abs/2108.10566.

Gram, Dennis, Pantelis Karapanagiotis, Marius Liebald, and Uwe Walz. 2022. “Design and Implementation of a Historical German Firm-Level Financial Database.” ACM Journal of Data and Information Quality (JDIQ) 14 (3): 1–22. https://doi.org/10.1145/3531533.

Karapanagiotis, Pantelis, and Marius Liebald. 2023. “Entity Matching with Similarity Encoding: A Supervised Learning Recommendation Framework for Linking (Big) Data.” http://dx.doi.org/10.2139/ssrn.4541376.

———. 2024a. “NEural-symbolic Entity Reasoning and Matching (Python Neer Match).” https://github.com/pi-kappa-devel/py-neer-match.

———. 2024b. “NEural-symbolic Entity Reasoning and Matching (R Neer Match).” https://github.com/pi-kappa-devel/r-neer-match.

Lin, Tsung-Yi, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. “Focal Loss for Dense Object Detection.” In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2980–88. IEEE. https://doi.org/10.1109/ICCV.2017.324.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

neer_match_utilities-1.0.0.tar.gz (65.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

neer_match_utilities-1.0.0-py3-none-any.whl (61.1 kB view details)

Uploaded Python 3

File details

Details for the file neer_match_utilities-1.0.0.tar.gz.

File metadata

  • Download URL: neer_match_utilities-1.0.0.tar.gz
  • Upload date:
  • Size: 65.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for neer_match_utilities-1.0.0.tar.gz
Algorithm Hash digest
SHA256 1b9534438d145f5cf79598ac242faa2389c78f799bae4cb86553c917904f9a27
MD5 f98c33e3c5275a923a4b64ace4b7389a
BLAKE2b-256 85346d8ffdb76387909d6568bf96a01e3cf473ce339fe97ae897eb57f94a1c05

See more details on using hashes here.

File details

Details for the file neer_match_utilities-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for neer_match_utilities-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c4279a29a936de5ad5538f7cfe7f98a7f80871114df567795d8897296ddf5859
MD5 2eea3a37529af1a5c96c4f4e054c3318
BLAKE2b-256 47df9661b1c1e33458aa874ff90046678e638c327e7907922578d71b82b61a99

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page