Skip to main content

QuickVec: Fast and easy loading and querying of word vectors

Project description

QuickVec

build

QuickVec is a simple package to make it easy to work with word embeddings. QuickVec supports instantaneous loading of word embeddings after converting them to a native SQLite format. QuickVec is designed to do exactly one thing well: allow you to quickly load word embeddings and look up the vectors for words.

Installation

pip install quickvec (requires Python 3.6+)

Design philosophy

QuickVec was created to support NERPy, a named entity recognition framework that uses word embeddings for feature generation. NERPy originally used gensim, but the time and memory required to load a word embedding completely into memory was a large performance bottleneck. NERPy then turned to Magnitude, but its conversion process is quite slow, and its installation process caused problems for NERPy users. The NERPy developers created QuickVec based on the design of Magnitude, but with the goal of creating a package with minimal features and dependencies.

FAQ

  • How does QuickVec compare to gensim's KeyedVectors for loading word embeddings? QuickVec can load word embeddings instantaneously after conversion to its native SQLite-based format, and does not load the whole embedding into memory, making it more memory efficient. However, QuickVec only supports text-format word embeddings files, and in general has far less functionality.
  • How does QuickVec compare to Magnitude for loading word embeddings? Like Magnitude, QuickVec can instantly load word embeddings after conversion to its native SQLite-based format. QuickVec's conversion process is faster than Magnitude's. However, QuickVec does not support many of Magnitude's features, such as word similarity or generating vectors for out-of-vocabulary words, and QuickVec does not provide pre-converted word embeddings and only supports loading from text-format embeddings.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

quickvec-0.3.0.tar.gz (7.1 kB view hashes)

Uploaded Source

Built Distribution

quickvec-0.3.0-py3-none-any.whl (8.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page