Embedding models from Jina AI
Project description
llm-embed-jina
Embedding models from Jina AI
Background
Jina AI Launches World's First Open-Source 8K Text Embedding, Rivaling OpenAI introduces these models.
See also Embeddings: What they are and why they matter for background on embeddings and an explanation of the LLM embeddings tool.
Installation
Install this plugin in the same environment as LLM.
llm install llm-embed-jina
Usage
This plugin adds support for three new embedding models:
jina-embeddings-v2-small-en
: 33 million parameters.jina-embeddings-v2-base-en
: 137 million parameters.jina-embeddings-v2-large-en
: 435 million parameters - not yet released, but it will work once it has been released.
The models will be downloaded the first time you try to use them.
See the LLM documentation for everything you can do.
To get started embedding a single string, run the following:
llm embed -m jina-embeddings-v2-small-en -c 'Hello world'
This will output a JSON array of 512 floating point numbers to your terminal.
Development
To set up this plugin locally, first checkout the code. Then create a new virtual environment:
cd llm-embed-jina
python3 -m venv venv
source venv/bin/activate
Now install the dependencies and test dependencies:
llm install -e '.[test]'
To run the tests:
pytest
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for llm_embed_jina-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 86df8af03955ef4c1bac250b69c572a6f984219c655c1b071a96a4e37ca63391 |
|
MD5 | 4011b80460d282b9e323164a3c10df96 |
|
BLAKE2b-256 | 4c100332ef43aa532cd1bdab8195174641454da52de008e03c031f6e3a6719a0 |