Skip to main content

QuickVec: Fast and easy loading and querying of word vectors

Project description

QuickVec

build

QuickVec is a simple package to make it easy to work with word embeddings. QuickVec supports instantaneous loading of word embeddings after converting them to a native SQLite format. QuickVec is designed to do exactly one thing well: allow you to quickly load word embeddings and look up the vectors for words.

Installation

pip install quickvec (requires Python 3.6+)

Design philosophy

QuickVec was created to support NERPy, a named entity recognition framework that uses word embeddings for feature generation. NERPy originally used gensim, but the time and memory required to load a word embedding completely into memory was a large performance bottleneck. NERPy then turned to Magnitude, but its conversion process is quite slow, and its installation process caused problems for NERPy users. The NERPy developers created QuickVec based on the design of Magnitude, but with the goal of creating a package with minimal features and dependencies.

FAQ

  • How does QuickVec compare to gensim's KeyedVectors for loading word embeddings? QuickVec can load word embeddings instantaneously after conversion to its native SQLite-based format, and does not load the whole embedding into memory, making it more memory efficient. However, QuickVec only supports text-format word embeddings files, and in general has far less functionality.
  • How does QuickVec compare to Magnitude for loading word embeddings? Like Magnitude, QuickVec can instantly load word embeddings after conversion to its native SQLite-based format. QuickVec's conversion process is faster than Magnitude's. However, QuickVec does not support many of Magnitude's features, such as word similarity or generating vectors for out-of-vocabulary words, and QuickVec does not provide pre-converted word embeddings and only supports loading from text-format embeddings.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

quickvec-0.3.0.tar.gz (7.1 kB view details)

Uploaded Source

Built Distribution

quickvec-0.3.0-py3-none-any.whl (8.2 kB view details)

Uploaded Python 3

File details

Details for the file quickvec-0.3.0.tar.gz.

File metadata

  • Download URL: quickvec-0.3.0.tar.gz
  • Upload date:
  • Size: 7.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/34.0 requests/2.23.0 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.64.0 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.10

File hashes

Hashes for quickvec-0.3.0.tar.gz
Algorithm Hash digest
SHA256 c4a4ffdb5ec4053f8f03a410e117557d01a192571bb361faf7f36fb00b02d1a6
MD5 eceb399fad4316f64005ca40e24aff11
BLAKE2b-256 40b589e02f456efbad77ce36039e223e571e0b2759b0480b6d8cbc782e8d6bc4

See more details on using hashes here.

File details

Details for the file quickvec-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: quickvec-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 8.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/34.0 requests/2.23.0 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.64.0 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.10

File hashes

Hashes for quickvec-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d65953b7c86625cc59e9ea1d3dc1b8b1b1123d18bea21b484e4571183392fdff
MD5 72ca21b76193850e0318680f5b129c5b
BLAKE2b-256 3e21aeb1d66b7544d22326699770327dec7ea4b1f3cc7d01d811f14238c4ae4d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page