Skip to main content

Client for Xinference

Project description

Xinference Restful Client

Xinference provides a restful client for managing and accessing models programmatically.

Install

pip install xinference-client

Usage

from xinference_client import RESTfulClient as Client

client = Client("http://localhost:9997")
model_uid = client.launch_model(model_name="chatglm2")
model = client.get_model(model_uid)

chat_history = []
prompt = "What is the largest animal?"
model.chat(
    prompt,
    chat_history=chat_history,
    generate_config={"max_tokens": 1024}
)

Result:

{
  "id": "chatcmpl-8d76b65a-bad0-42ef-912d-4a0533d90d61",
  "model": "56f69622-1e73-11ee-a3bd-9af9f16816c6",
  "object": "chat.completion",
  "created": 1688919187,
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "The largest animal that has been scientifically measured is the blue whale, which has a maximum length of around 23 meters (75 feet) for adult animals and can weigh up to 150,000 pounds (68,000 kg). However, it is important to note that this is just an estimate and that the largest animal known to science may be larger still. Some scientists believe that the largest animals may not have a clear \"size\" in the same way that humans do, as their size can vary depending on the environment and the stage of their life."
      },
      "finish_reason": "None"
    }
  ],
  "usage": {
    "prompt_tokens": -1,
    "completion_tokens": -1,
    "total_tokens": -1
  }
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xinference-client-0.16.1.tar.gz (44.2 kB view details)

Uploaded Source

Built Distribution

xinference_client-0.16.1-py3-none-any.whl (25.1 kB view details)

Uploaded Python 3

File details

Details for the file xinference-client-0.16.1.tar.gz.

File metadata

  • Download URL: xinference-client-0.16.1.tar.gz
  • Upload date:
  • Size: 44.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for xinference-client-0.16.1.tar.gz
Algorithm Hash digest
SHA256 e705ab167bdad7b27a5596a4c3fe26c9e570b459f905b3e9f2afe1d27bf61ad3
MD5 b5202473f13c3153fd5dd96b87d0e036
BLAKE2b-256 98ea9914bddfc692595ed23b8d85a86f1c9b605a7ef039ee8cf20f0b7ac49a9f

See more details on using hashes here.

File details

Details for the file xinference_client-0.16.1-py3-none-any.whl.

File metadata

File hashes

Hashes for xinference_client-0.16.1-py3-none-any.whl
Algorithm Hash digest
SHA256 88788dc341dfb8539adc90e156b1e938af5ce6dad5b77508c2c850d5a8c4cdd6
MD5 048013a8e5268d24466c2e25b789db2e
BLAKE2b-256 7455ccbab7ce9060a1b8bd8226d5c1a346e868776bb5b3401ab3287f72fe7336

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page