Skip to main content

Client for Xinference

Project description

Xinference Restful Client

Xinference provides a restful client for managing and accessing models programmatically.

Install

pip install xinference-client

Usage

from xinference_client import RESTfulClient as Client

client = Client("http://localhost:9997")
model_uid = client.launch_model(model_name="chatglm2")
model = client.get_model(model_uid)

chat_history = []
prompt = "What is the largest animal?"
model.chat(
    prompt,
    chat_history=chat_history,
    generate_config={"max_tokens": 1024}
)

Result:

{
  "id": "chatcmpl-8d76b65a-bad0-42ef-912d-4a0533d90d61",
  "model": "56f69622-1e73-11ee-a3bd-9af9f16816c6",
  "object": "chat.completion",
  "created": 1688919187,
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "The largest animal that has been scientifically measured is the blue whale, which has a maximum length of around 23 meters (75 feet) for adult animals and can weigh up to 150,000 pounds (68,000 kg). However, it is important to note that this is just an estimate and that the largest animal known to science may be larger still. Some scientists believe that the largest animals may not have a clear \"size\" in the same way that humans do, as their size can vary depending on the environment and the stage of their life."
      },
      "finish_reason": "None"
    }
  ],
  "usage": {
    "prompt_tokens": -1,
    "completion_tokens": -1,
    "total_tokens": -1
  }
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xinference-client-0.16.0.tar.gz (44.1 kB view details)

Uploaded Source

Built Distribution

xinference_client-0.16.0-py3-none-any.whl (25.1 kB view details)

Uploaded Python 3

File details

Details for the file xinference-client-0.16.0.tar.gz.

File metadata

  • Download URL: xinference-client-0.16.0.tar.gz
  • Upload date:
  • Size: 44.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for xinference-client-0.16.0.tar.gz
Algorithm Hash digest
SHA256 9b48482c173334fdf973498a3cc1b1a8e9f0410cf2bff942f746c33726cee21c
MD5 f2e0bc150b01fc2875b611f31edaa1e3
BLAKE2b-256 000bbeffa68b56a1f91a4560057f91d3c776faa1e854537e0957943b726a846c

See more details on using hashes here.

File details

Details for the file xinference_client-0.16.0-py3-none-any.whl.

File metadata

File hashes

Hashes for xinference_client-0.16.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c44d66f8dac76e9d89e158b4a9f676eb3eb6954a62e61e9f9b6eed0cc3c691c9
MD5 54912add5334fbecc0a104a53a976e1e
BLAKE2b-256 68cd59410e119bfeb543c04fd01ea636e642ef21ef87b62a1865d6e92ea35b24

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page