Skip to main content

Client for Xinference

Project description

Xinference Restful Client

Xinference provides a restful client for managing and accessing models programmatically.

Install

pip install xinference-client

Usage

from xinference_client import RESTfulClient as Client

client = Client("http://localhost:9997")
model_uid = client.launch_model(model_name="chatglm2")
model = client.get_model(model_uid)

chat_history = []
prompt = "What is the largest animal?"
model.chat(
    prompt,
    chat_history=chat_history,
    generate_config={"max_tokens": 1024}
)

Result:

{
  "id": "chatcmpl-8d76b65a-bad0-42ef-912d-4a0533d90d61",
  "model": "56f69622-1e73-11ee-a3bd-9af9f16816c6",
  "object": "chat.completion",
  "created": 1688919187,
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "The largest animal that has been scientifically measured is the blue whale, which has a maximum length of around 23 meters (75 feet) for adult animals and can weigh up to 150,000 pounds (68,000 kg). However, it is important to note that this is just an estimate and that the largest animal known to science may be larger still. Some scientists believe that the largest animals may not have a clear \"size\" in the same way that humans do, as their size can vary depending on the environment and the stage of their life."
      },
      "finish_reason": "None"
    }
  ],
  "usage": {
    "prompt_tokens": -1,
    "completion_tokens": -1,
    "total_tokens": -1
  }
}

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xinference_client-2.10.0.tar.gz (58.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xinference_client-2.10.0-py3-none-any.whl (40.2 kB view details)

Uploaded Python 3

File details

Details for the file xinference_client-2.10.0.tar.gz.

File metadata

  • Download URL: xinference_client-2.10.0.tar.gz
  • Upload date:
  • Size: 58.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for xinference_client-2.10.0.tar.gz
Algorithm Hash digest
SHA256 617381ca9f841a421d376434524f1233d90429181e370c2d40c24d700e8006a2
MD5 76225d88b78448f91dbb5cb1671d9046
BLAKE2b-256 ea576d103b3c09dcf13038074c45d9ce7185492bf35cd3cd80d2fc93e1a49722

See more details on using hashes here.

File details

Details for the file xinference_client-2.10.0-py3-none-any.whl.

File metadata

File hashes

Hashes for xinference_client-2.10.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fc75fde0486437b891b107a27c4159d7fc0fc77a5c2cbaadd705e7574f906a1d
MD5 403eba00636c2fe8a8bf0fa308e8a5d5
BLAKE2b-256 918058307da55ff58f5af3a1dab6ab70c7d9603678161f2ad7de74ce3c21af62

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page