Fastspeed is a toolkit for serving LLM.
Project description
Fastspeed
Fastspeed is a toolkit for serving LLM.
Requirements
- Python >= 3.6
Installation
pip install fastspeed
Example
- server
fastspeed --model model_name_or_path
fastspeed --model model_name_or_path --model_hub hf
- client
curl -X 'POST' \
'http://127.0.0.1:8000/chat/completions' \
-H 'accept: application/json' \
-H 'Content-Type: application/json' \
-d '{
"messages": [
{
"role": "user",
"content": "Hello"
}
],
"model": ""
}'
0.1.0 (2024-08-29)
- Fastapi with single GPU, model from huggingface or modelscope
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fastspeed-0.1.0.tar.gz
(4.4 kB
view details)
Built Distribution
File details
Details for the file fastspeed-0.1.0.tar.gz
.
File metadata
- Download URL: fastspeed-0.1.0.tar.gz
- Upload date:
- Size: 4.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a67956f6383ae114d1d5e892900c538c01baf296c484ce5a9f3af9ad6d0fb884 |
|
MD5 | 72ce439bb23c0859ec4d34d8f4b8b7af |
|
BLAKE2b-256 | aad42107e8fce0953e14190ce3de7c902d742ec8c0730b22ad4e0ee135429550 |
File details
Details for the file fastspeed-0.1.0-py2.py3-none-any.whl
.
File metadata
- Download URL: fastspeed-0.1.0-py2.py3-none-any.whl
- Upload date:
- Size: 4.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e1d877345da9b793533220eb87f8858c5601a91b370ebddd2fad5a4e02a1518e |
|
MD5 | 4fddfb6345b3b75e322ed8eabba10e0d |
|
BLAKE2b-256 | 5396557402dbe01acbf82dc705168758a0ab6900307cda4ff7857baeee86c662 |