Base abstract data types for Entity Resolution
Project description
matchescu-base
This Python package includes common abstract data types (adt package),
utilities (common package) and generic data extraction algorithms for entity
resolution.
These abstractions are used in other packages such as:
matchescu-reference-extraction: extracts entity references from data sourcesmatchescu-reference-stores: stores entity references efficientlymatchescu-comparison-space-generation: generates the comparison space used for matching or clusteringmatchescu-matching: various methods of scoring the similarity of entity referencesmatchescu-clustering: various methods of scoring the colocation of entity referencesmatchescu-profile-assembly: algorithms used to build concrete entity profiles from specific data structures (tuples, lists or graphs)
On its own, the package may be used to create other structured approaches towards entity resolution, particularly based on the Resolvi reference architecture.
Set up dev environment
- (optional) install pyenv
- install Python 3.11
- install Poetry
- clone this repository
- run a couple of shell commands
$ cd <REPO_ROOT>
$ poetry install
Run tests
$ poetry run pytest
Activate virtual environment
$ poetry shell
-or-
$ source .venv/bin/activate
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file matchescu_base-0.27.0.tar.gz.
File metadata
- Download URL: matchescu_base-0.27.0.tar.gz
- Upload date:
- Size: 7.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.13.5 Linux/6.17.0-1010-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e9bbe77ef4458ab23becbd14a927b7837e332164c3798250ccf1b9d088773685
|
|
| MD5 |
45d5745f4472a50afb651f3fd094a7e1
|
|
| BLAKE2b-256 |
4792c58c7c85bd3f329105b27e5a30e5cb8fcbb65ae5d9937baed93697f419dd
|
File details
Details for the file matchescu_base-0.27.0-py3-none-any.whl.
File metadata
- Download URL: matchescu_base-0.27.0-py3-none-any.whl
- Upload date:
- Size: 10.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.13.5 Linux/6.17.0-1010-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2edd35b45f59e5f7ddda34fb50ed0857ff9640ac15902b4f30539b15381c22f0
|
|
| MD5 |
0e57d6364fc3602a3000427b362b6806
|
|
| BLAKE2b-256 |
c7a5dd0414a5af808b8f0c0a17c2b165e8ae36ae30b69af594b4bff42c61881e
|