Skip to main content

llama-index embeddings xinference integration

Project description

LlamaIndex Embeddings Integration: Xinference

Xorbits Inference (Xinference) is an open-source platform to streamline the operation and integration of a wide array of AI models.

You can find a list of built-in embedding models in Xinference from its document Embedding Models

To learn more about Xinference in general, visit https://inference.readthedocs.io/en/latest/

Installation

pip install llama-index-embeddings-xinference

Usage

Parameters Description:

  • model_uid: Model uid not the model name, sometimes they may be the same (e.g., bce-embedding-base_v1).
  • base_url: base url of Xinference (e.g., http://localhost:9997).
  • timeout: request timeout set (default 60s).
  • prompt: Text to embed.

Text Embedding Example

from llama_index.embeddings.xinference import XinferenceEmbedding

xi_model_uid = "xinference model uid"
xi_base_url = "xinference base url"

xi_embed = XinferenceEmbedding(
    model_uid=xi_model_uid,
    base_url=xi_base_url,
    timeout=60,
)


def text_embedding(prompt: str):
    embeddings = xi_embed.get_query_embedding(prompt)
    print(embeddings)


async def async_text_embedding(prompt: str):
    embeddings = await xi_embed.aget_query_embedding(prompt)
    print(embeddings)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_embeddings_xinference-0.3.0.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file llama_index_embeddings_xinference-0.3.0.tar.gz.

File metadata

File hashes

Hashes for llama_index_embeddings_xinference-0.3.0.tar.gz
Algorithm Hash digest
SHA256 95f203029d6d12ac27d54fabc9e1ccf5b3d7bcd4f799387c718fc1063807e0be
MD5 ba0aac867ec452133d500e8ddc3f1717
BLAKE2b-256 5dc7c00b158fe4ce1608765cf108826d672ab6b3742abb909155b1a8ea83e69a

See more details on using hashes here.

File details

Details for the file llama_index_embeddings_xinference-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_embeddings_xinference-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bd2ba13422bda98a0fc3ef97317fa13dc71e0b4e1587bf0338ce9714c3a430e4
MD5 a8a9d217fd6a5c61626a0b8aef48cc41
BLAKE2b-256 65beeff3690bf8f4958b7b5313a923be9463d896e5e39dd0eb6f79f079c174e5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page