Skip to main content

Extended funcationality for NEural-symbolic Entity Reasoning and Matching

Project description

Neer Match Utilities

neermatch utilities website

License: MIT

The framework neermatch provides a set of tools for entity matching based on deep learning, symbolic learning, and a hybrid approach combining both deep and symbolic learning. It is designed to support easy set-up, training, and inference of entity matching models. The package provides automated fuzzy logic reasoning (by refutation) functionality that can be used to examine the significance of particular associations between fields in an entity matching task.

The neermatch framework encompasses three packages:

  1. py-neer-match: The Python implementation of the basic functionalities. Learn more
  2. py-neer-utilities: A Python package that provides additional functionalities to streamline and support the entity matching workflow. (this project)
  3. r-neer-match: The R implementation of the basic functionalites. Learn more

The project is financially supported by the Deutsche Forschungsgemeinschaft (DFG) under Grant 539465691 as part of the Infrastructure Priority Programme New Data Spaces for the Social Sciences (SPP 2431). Reading the article Karapanagiotis and Liebald (2023) helps to understand the theoretical foundation and design of neermatch (note that the article refers to an earlier version of the framework, previously labeled as mlmatch).

The documentation provides examples of how neermatch may be used. The data used in these examples are available in this folder of the GitHub repository.

Contributors

Marius Liebald (maintainer)

Pantelis Karapanagiotis (contributor)

Installation

pip install neer-match
pip install neer-match-utilities

Official Documentation

The documentation is hosted under https://www.marius-liebald.com/py-neer-utilities/index.html

License

The package is distributed under the MIT license.

References

Bénédict, Gabriel, Vincent Koops, Daan Odijk, and Maarten de Rijke. 2022. “sigmoidF1: A Smooth F1 Score Surrogate Loss for Multilabel Classification.” https://arxiv.org/abs/2108.10566.

Gram, Dennis, Pantelis Karapanagiotis, Marius Liebald, and Uwe Walz. 2022. “Design and Implementation of a Historical German Firm-Level Financial Database.” ACM Journal of Data and Information Quality (JDIQ) 14 (3): 1–22. https://doi.org/10.1145/3531533.

Karapanagiotis, Pantelis, and Marius Liebald. 2023. “Entity Matching with Similarity Encoding: A Supervised Learning Recommendation Framework for Linking (Big) Data.” http://dx.doi.org/10.2139/ssrn.4541376.

———. 2024a. “NEural-symbolic Entity Reasoning and Matching (Python Neer Match).” https://github.com/pi-kappa-devel/py-neer-match.

———. 2024b. “NEural-symbolic Entity Reasoning and Matching (R Neer Match).” https://github.com/pi-kappa-devel/r-neer-match.

Lin, Tsung-Yi, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. “Focal Loss for Dense Object Detection.” In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2980–88. IEEE. https://doi.org/10.1109/ICCV.2017.324.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

neer_match_utilities-1.0.26b0.tar.gz (47.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

neer_match_utilities-1.0.26b0-py3-none-any.whl (40.5 kB view details)

Uploaded Python 3

File details

Details for the file neer_match_utilities-1.0.26b0.tar.gz.

File metadata

  • Download URL: neer_match_utilities-1.0.26b0.tar.gz
  • Upload date:
  • Size: 47.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.14

File hashes

Hashes for neer_match_utilities-1.0.26b0.tar.gz
Algorithm Hash digest
SHA256 5647389c76cd8182c427255c0cf4d59497dde139702b0d2b1b6023e9596f3c2a
MD5 cbf574f4caecacf27652ae7f2cc89b79
BLAKE2b-256 701e3831ed38fdf482a94ff2d2fcd2acb7d128ad6210e47c44231d330ac0730f

See more details on using hashes here.

File details

Details for the file neer_match_utilities-1.0.26b0-py3-none-any.whl.

File metadata

File hashes

Hashes for neer_match_utilities-1.0.26b0-py3-none-any.whl
Algorithm Hash digest
SHA256 0e58764924839de2038bb4029f77f0062b3dc51548109bfe81cd231a67322fe8
MD5 7467b65bb9b382cab58c30bd94f52334
BLAKE2b-256 4c5bbb6db72ffd0529fca3246b75e98d92869140703885fdb1601704e5166574

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page