Skip to main content

A super-fast canonical name lookup service

Project description

juditha on pypi PyPI Downloads PyPI - Python Version Python test and package pre-commit Coverage Status AGPLv3+ License Pydantic v2

Juditha

A super-fast in-process lookup service for canonical names, backed by tantivy.

juditha exists to tame the noise that follows from Named Entity Recognition: given a huge list of known names (company registries, persons of interest, sanctions lists), it tells you whether a span produced by your NER pipeline corresponds to one of them, even when the casing, accents, token order, or spelling differs.

The implementation uses a pre-populated names database and index. Data is either FollowTheMoney entities or simply list of names.

Documentation

https://docs.investigraph.dev/lib/juditha

The name

Juditha Dommer was the daughter of a coppersmith and raised seven children, while her husband Johann Pachelbel wrote a canon.

Versioning

To mark the compatibility with followthemoney, juditha follows the same major version, which is currently 4.x.x.

License and copyright

juditha, (C) 2024 investigativedata.io. (C) 2025, 2026 Data and Research Center – DARC. Licensed under AGPLv3 or later. See NOTICE and LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

juditha-4.4.0.tar.gz (31.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

juditha-4.4.0-py3-none-any.whl (35.8 kB view details)

Uploaded Python 3

File details

Details for the file juditha-4.4.0.tar.gz.

File metadata

  • Download URL: juditha-4.4.0.tar.gz
  • Upload date:
  • Size: 31.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.4.1 CPython/3.13.12 Linux/7.0.4+deb14-amd64

File hashes

Hashes for juditha-4.4.0.tar.gz
Algorithm Hash digest
SHA256 fd696a94cb5bdc4ae36c9a50269a17cad7e5e50e1ea099be0bbd177bc6903531
MD5 15ee46b10826dff6cde399f227f1d6db
BLAKE2b-256 427770b4abfb1001f465c2aac1e89c58e0f58310e77ba252706aff80b0f6a729

See more details on using hashes here.

File details

Details for the file juditha-4.4.0-py3-none-any.whl.

File metadata

  • Download URL: juditha-4.4.0-py3-none-any.whl
  • Upload date:
  • Size: 35.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.4.1 CPython/3.13.12 Linux/7.0.4+deb14-amd64

File hashes

Hashes for juditha-4.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 01cd653af161818b6882ec64beb26ac1105b68e3bda88cfd165d12c952425dd8
MD5 8b4dd73c8eaab1fbd0a8bedbc5643e50
BLAKE2b-256 f22b897795897b5a9677735ce22ca89cca6c56bad66870df9a7d5920deec2c4f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page