Skip to main content

Client for Xinference

Project description

Xinference Restful Client

Xinference provides a restful client for managing and accessing models programmatically.

Install

pip install xinference-client

Usage

from xinference_client import RESTfulClient as Client

client = Client("http://localhost:9997")
model_uid = client.launch_model(model_name="chatglm2")
model = client.get_model(model_uid)

chat_history = []
prompt = "What is the largest animal?"
model.chat(
    prompt,
    chat_history=chat_history,
    generate_config={"max_tokens": 1024}
)

Result:

{
  "id": "chatcmpl-8d76b65a-bad0-42ef-912d-4a0533d90d61",
  "model": "56f69622-1e73-11ee-a3bd-9af9f16816c6",
  "object": "chat.completion",
  "created": 1688919187,
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "The largest animal that has been scientifically measured is the blue whale, which has a maximum length of around 23 meters (75 feet) for adult animals and can weigh up to 150,000 pounds (68,000 kg). However, it is important to note that this is just an estimate and that the largest animal known to science may be larger still. Some scientists believe that the largest animals may not have a clear \"size\" in the same way that humans do, as their size can vary depending on the environment and the stage of their life."
      },
      "finish_reason": "None"
    }
  ],
  "usage": {
    "prompt_tokens": -1,
    "completion_tokens": -1,
    "total_tokens": -1
  }
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xinference-client-0.16.3.tar.gz (44.2 kB view details)

Uploaded Source

Built Distribution

xinference_client-0.16.3-py3-none-any.whl (25.1 kB view details)

Uploaded Python 3

File details

Details for the file xinference-client-0.16.3.tar.gz.

File metadata

  • Download URL: xinference-client-0.16.3.tar.gz
  • Upload date:
  • Size: 44.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for xinference-client-0.16.3.tar.gz
Algorithm Hash digest
SHA256 0a9ad1fb1e16ffe83ef420efc2ad9e3f2e97d6fbe1867ba1eb8b12b10468fc24
MD5 06e277f221b719689d32b53af5f193ee
BLAKE2b-256 d6b0b3cacc404b63477c46a1b3cc599a5e8daa63c4c4fd564c4f4a43e11a80ad

See more details on using hashes here.

File details

Details for the file xinference_client-0.16.3-py3-none-any.whl.

File metadata

File hashes

Hashes for xinference_client-0.16.3-py3-none-any.whl
Algorithm Hash digest
SHA256 af0099520027acb90965c0ddacbc2fbbad80f0dae97c503dc9072aec1f998e3f
MD5 dcec3c1030eed2feba6fb8bb9f70f0a5
BLAKE2b-256 2ca0031856b41dd582b00c3cc4585b91699fa83869e99e38ddcc51eb4f185b2c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page