Client for Xinference
Project description
Xinference Restful Client
Xinference provides a restful client for managing and accessing models programmatically.
Install
pip install xinference-client
Usage
from xinference_client import RESTfulClient as Client
client = Client("http://localhost:9997")
model_uid = client.launch_model(model_name="chatglm2")
model = client.get_model(model_uid)
chat_history = []
prompt = "What is the largest animal?"
model.chat(
prompt,
chat_history=chat_history,
generate_config={"max_tokens": 1024}
)
Result:
{
"id": "chatcmpl-8d76b65a-bad0-42ef-912d-4a0533d90d61",
"model": "56f69622-1e73-11ee-a3bd-9af9f16816c6",
"object": "chat.completion",
"created": 1688919187,
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "The largest animal that has been scientifically measured is the blue whale, which has a maximum length of around 23 meters (75 feet) for adult animals and can weigh up to 150,000 pounds (68,000 kg). However, it is important to note that this is just an estimate and that the largest animal known to science may be larger still. Some scientists believe that the largest animals may not have a clear \"size\" in the same way that humans do, as their size can vary depending on the environment and the stage of their life."
},
"finish_reason": "None"
}
],
"usage": {
"prompt_tokens": -1,
"completion_tokens": -1,
"total_tokens": -1
}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
xinference-client-0.16.1.tar.gz
(44.2 kB
view details)
Built Distribution
File details
Details for the file xinference-client-0.16.1.tar.gz
.
File metadata
- Download URL: xinference-client-0.16.1.tar.gz
- Upload date:
- Size: 44.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e705ab167bdad7b27a5596a4c3fe26c9e570b459f905b3e9f2afe1d27bf61ad3 |
|
MD5 | b5202473f13c3153fd5dd96b87d0e036 |
|
BLAKE2b-256 | 98ea9914bddfc692595ed23b8d85a86f1c9b605a7ef039ee8cf20f0b7ac49a9f |
File details
Details for the file xinference_client-0.16.1-py3-none-any.whl
.
File metadata
- Download URL: xinference_client-0.16.1-py3-none-any.whl
- Upload date:
- Size: 25.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 88788dc341dfb8539adc90e156b1e938af5ce6dad5b77508c2c850d5a8c4cdd6 |
|
MD5 | 048013a8e5268d24466c2e25b789db2e |
|
BLAKE2b-256 | 7455ccbab7ce9060a1b8bd8226d5c1a346e868776bb5b3401ab3287f72fe7336 |