Skip to main content

vokab: named entity linking through hybrid (lexical and semantic) search engine.

Project description

vokab

vokab is a python module for storing, searching and matching named entities on their name or aliases. This project is useful for auto-correcting or linking named entities in NLP information extraction use cases or when cleaning up user submitted data.

Supports:

  • Exact Matching
  • Case-insensitive Matching
  • Lexical Matching (i.e. Fuzzy String Matching)
  • Semantic Matching (i.e. Vector Similarity Searching)
  • Hybrid (Lexical/Fuzzy + Semantic/Vector)

Getting Started

Installation

Available on PyPI:

pip install vokab

Maintainer

vokab was created by Ian Maurer, the CTO of GenomOncology.

This MIT-based open-source project was extracted from our product which includes the ability to normalize biomedical data for use in precision oncology clinical decision support systems. Contact me to learn more about our product offerings.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vokab-0.0.2.tar.gz (36.4 kB view details)

Uploaded Source

Built Distribution

vokab-0.0.2-py3-none-any.whl (32.0 kB view details)

Uploaded Python 3

File details

Details for the file vokab-0.0.2.tar.gz.

File metadata

  • Download URL: vokab-0.0.2.tar.gz
  • Upload date:
  • Size: 36.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.13

File hashes

Hashes for vokab-0.0.2.tar.gz
Algorithm Hash digest
SHA256 7706894ebe54f6e6378fb90e66975f4f11bc09e89d69b97aba60192700be6bbf
MD5 cf06a7f980e7bf0d37164e0c9534afd0
BLAKE2b-256 68fff4c5c06d27da60b4eb09f43ad673c6712e3fe5e507203087d6bc5be56e79

See more details on using hashes here.

File details

Details for the file vokab-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: vokab-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 32.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.13

File hashes

Hashes for vokab-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 fac247726fef22cbf66f186c494cefa4ae289b12b9cb9b96cbd49a1ba587ab94
MD5 9ae81ab2257bc94f8d92b6675ae8365d
BLAKE2b-256 a5ac2590a24438c4bbd81edfb2f2bd87072ba543e7c067b7f65ac0d5ce857f95

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page