vokab: named entity linking through hybrid (lexical and semantic) search engine.
Project description
vokab
vokab is a python module for storing, searching and matching named entities on their name or aliases. This project is useful for auto-correcting or linking named entities in NLP information extraction use cases or when cleaning up user submitted data.
Supports:
- Exact Matching
- Case-insensitive Matching
- Lexical Matching (i.e. Fuzzy String Matching)
- Semantic Matching (i.e. Vector Similarity Searching)
- Hybrid (Lexical/Fuzzy + Semantic/Vector)
Getting Started
Installation
Available on PyPI:
pip install vokab
Maintainer
vokab was created by Ian Maurer, the CTO of GenomOncology.
This MIT-based open-source project was extracted from our product which includes the ability to normalize biomedical data for use in precision oncology clinical decision support systems. Contact me to learn more about our product offerings.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file vokab-0.0.2.tar.gz
.
File metadata
- Download URL: vokab-0.0.2.tar.gz
- Upload date:
- Size: 36.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7706894ebe54f6e6378fb90e66975f4f11bc09e89d69b97aba60192700be6bbf |
|
MD5 | cf06a7f980e7bf0d37164e0c9534afd0 |
|
BLAKE2b-256 | 68fff4c5c06d27da60b4eb09f43ad673c6712e3fe5e507203087d6bc5be56e79 |
File details
Details for the file vokab-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: vokab-0.0.2-py3-none-any.whl
- Upload date:
- Size: 32.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fac247726fef22cbf66f186c494cefa4ae289b12b9cb9b96cbd49a1ba587ab94 |
|
MD5 | 9ae81ab2257bc94f8d92b6675ae8365d |
|
BLAKE2b-256 | a5ac2590a24438c4bbd81edfb2f2bd87072ba543e7c067b7f65ac0d5ce857f95 |