QuickVec: Fast and easy loading and querying of word vectors
Project description
QuickVec
QuickVec is a simple package to make it easy to work with word embeddings. QuickVec supports instantaneous loading of word embeddings after converting them to a native SQLite format. QuickVec is designed to do exactly one thing well: allow you to quickly load word embeddings and look up the vectors for words.
Installation
pip install quickvec
(requires Python 3.6+)
Design philosophy
QuickVec was created to support NERPy, a named entity recognition framework that uses word embeddings for feature generation. NERPy originally used gensim, but the time and memory required to load a word embedding completely into memory was a large performance bottleneck. NERPy then turned to Magnitude, but its conversion process is quite slow, and its installation process caused problems for NERPy users. The NERPy developers created QuickVec based on the design of Magnitude, but with the goal of creating a package with minimal features and dependencies.
FAQ
- How does QuickVec compare to gensim's
KeyedVectors
for loading word embeddings? QuickVec can load word embeddings instantaneously after conversion to its native SQLite-based format, and does not load the whole embedding into memory, making it more memory efficient. However, QuickVec only supports text-format word embeddings files, and in general has far less functionality. - How does QuickVec compare to Magnitude for loading word embeddings? Like Magnitude, QuickVec can instantly load word embeddings after conversion to its native SQLite-based format. QuickVec's conversion process is faster than Magnitude's. However, QuickVec does not support many of Magnitude's features, such as word similarity or generating vectors for out-of-vocabulary words, and QuickVec does not provide pre-converted word embeddings and only supports loading from text-format embeddings.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file quickvec-0.3.0.tar.gz
.
File metadata
- Download URL: quickvec-0.3.0.tar.gz
- Upload date:
- Size: 7.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/34.0 requests/2.23.0 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.64.0 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c4a4ffdb5ec4053f8f03a410e117557d01a192571bb361faf7f36fb00b02d1a6 |
|
MD5 | eceb399fad4316f64005ca40e24aff11 |
|
BLAKE2b-256 | 40b589e02f456efbad77ce36039e223e571e0b2759b0480b6d8cbc782e8d6bc4 |
File details
Details for the file quickvec-0.3.0-py3-none-any.whl
.
File metadata
- Download URL: quickvec-0.3.0-py3-none-any.whl
- Upload date:
- Size: 8.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/34.0 requests/2.23.0 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.64.0 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d65953b7c86625cc59e9ea1d3dc1b8b1b1123d18bea21b484e4571183392fdff |
|
MD5 | 72ca21b76193850e0318680f5b129c5b |
|
BLAKE2b-256 | 3e21aeb1d66b7544d22326699770327dec7ea4b1f3cc7d01d811f14238c4ae4d |