Skip to main content

Extended funcationality for NEural-symbolic Entity Reasoning and Matching

Project description

Neer Match Utilities

neermatch utilities website

License: MIT

neermatch is a framework for entity matching that supports deep learning, symbolic learning, and hybrid approaches, using neural networks, gradient boosting, logit, or probit as classifiers. It is tailored to the needs of economic historians and quantitative economists, but generalizes to any domain requiring entity matching.

The neermatch framework encompasses three packages:

  1. py-neer-match: The Python implementation of the basic functionalities (ANNs only). Learn more
  2. py-neer-utilities: A Python package that provides additional models and functionalities to streamline and support the entity matching workflow. (this project)
  3. r-neer-match: The R implementation of the basic functionalites. Learn more

The project is financially supported by the Deutsche Forschungsgemeinschaft (DFG) under Grant 539465691 as part of the Infrastructure Priority Programme New Data Spaces for the Social Sciences (SPP 2431). Reading the article Karapanagiotis and Liebald (2023) helps to understand the theoretical foundation and design of neermatch (note that the article refers to an earlier version of the framework, previously labeled as mlmatch).

The documentation provides examples of how neermatch may be used. The data used in these examples are available in this folder of the GitHub repository.

Contributors

Marius Liebald (maintainer)

Pantelis Karapanagiotis (contributor)

Erik Hellberg (contributor)

Installation

pip install neer-match
pip install neer-match-utilities

Official Documentation

The documentation is hosted under https://www.marius-liebald.com/py-neer-utilities/index.html

License

The package is distributed under the MIT license.

References

Gram, Dennis, Pantelis Karapanagiotis, Marius Liebald, and Uwe Walz. 2022. “Design and Implementation of a Historical German Firm-Level Financial Database.” ACM Journal of Data and Information Quality (JDIQ) 14 (3): 1–22. https://doi.org/10.1145/3531533.

Karapanagiotis, Pantelis, and Marius Liebald. 2023. “Entity Matching with Similarity Encoding: A Supervised Learning Recommendation Framework for Linking (Big) Data.” http://dx.doi.org/10.2139/ssrn.4541376.

———. 2024a. “NEural-symbolic Entity Reasoning and Matching (Python Neer Match).” https://github.com/pi-kappa-devel/py-neer-match.

———. 2024b. “NEural-symbolic Entity Reasoning and Matching (R Neer Match).” https://github.com/pi-kappa-devel/r-neer-match.

Lin, Tsung-Yi, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. “Focal Loss for Dense Object Detection.” In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2980–88. IEEE. https://doi.org/10.1109/ICCV.2017.324.

Lipton, Zachary C., Charles Elkan, and Balakrishnan Naryanaswamy. 2014. “Optimal Thresholding of Classifiers to Maximize F1 Measure.” In ECML PKDD 2014.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

neer_match_utilities-1.0.1.tar.gz (65.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

neer_match_utilities-1.0.1-py3-none-any.whl (61.1 kB view details)

Uploaded Python 3

File details

Details for the file neer_match_utilities-1.0.1.tar.gz.

File metadata

  • Download URL: neer_match_utilities-1.0.1.tar.gz
  • Upload date:
  • Size: 65.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for neer_match_utilities-1.0.1.tar.gz
Algorithm Hash digest
SHA256 a0357c5269e37376ce8c03c9cc658f9393bc50cdfd6b055f55d8eec42502af6a
MD5 1a0b0cbbc5163da41199fb5476ba6a61
BLAKE2b-256 5d1479e08f4e3d36bda789a98867d2abc848f0a59aef137186e0fb0fc335487e

See more details on using hashes here.

File details

Details for the file neer_match_utilities-1.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for neer_match_utilities-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5e45b4d541265efebc413a5baf35fd6ed6723f1752cdcf088e2e8ea72471c88a
MD5 21f9b3d3f2b04fb6dff1b858b072c5e8
BLAKE2b-256 572c71c16cc01cb737fab8c07e0a4d3d4636900dc6a6e83569fca498260093ab

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page