Skip to main content

llama-index embeddings xinference integration

Project description

LlamaIndex Embeddings Integration: Xinference

Xorbits Inference (Xinference) is an open-source platform to streamline the operation and integration of a wide array of AI models.

You can find a list of built-in embedding models in Xinference from its document Embedding Models

To learn more about Xinference in general, visit https://inference.readthedocs.io/en/latest/

Installation

pip install llama-index-embeddings-xinference

Usage

Parameters Description:

  • model_uid: Model uid not the model name, sometimes they may be the same (e.g., bce-embedding-base_v1).
  • base_url: base url of Xinference (e.g., http://localhost:9997).
  • timeout: request timeout set (default 60s).
  • prompt: Text to embed.

Text Embedding Example

from llama_index.embeddings.xinference import XinferenceEmbedding

xi_model_uid = "xinference model uid"
xi_base_url = "xinference base url"

xi_embed = XinferenceEmbedding(
    model_uid=xi_model_uid,
    base_url=xi_base_url,
    timeout=60,
)


def text_embedding(prompt: str):
    embeddings = xi_embed.get_query_embedding(prompt)
    print(embeddings)


async def async_text_embedding(prompt: str):
    embeddings = await xi_embed.aget_query_embedding(prompt)
    print(embeddings)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_embeddings_xinference-0.3.1.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file llama_index_embeddings_xinference-0.3.1.tar.gz.

File metadata

File hashes

Hashes for llama_index_embeddings_xinference-0.3.1.tar.gz
Algorithm Hash digest
SHA256 98e006da46cc72bb901a5af5ae41693673a8daa2debd03248fcd7e4a12932fc1
MD5 dbca6dbf9c0f92941cfc76aaae079315
BLAKE2b-256 feed4f273cc663ed6a72aac83cc69b02d011562d62d0ce2eabccfde572ad7b1f

See more details on using hashes here.

File details

Details for the file llama_index_embeddings_xinference-0.3.1-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_embeddings_xinference-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b231bbfa6e78ea76889f2f2436412cf6c8e4b358e7fd698f76db0609cd64d368
MD5 6fd9af930ddcd050911b0e42b5192d56
BLAKE2b-256 4706e834bf2c732fd703e1473eff2d47ccc3562132b330aa4dd58e3f68c9062a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page