Skip to main content

Fastspeed is a toolkit for serving LLM.

Project description

Fastspeed

Fastspeed is a toolkit for serving LLM.

Requirements

  • Python >= 3.6

Installation

pip install fastspeed

Example

  • server
fastspeed --model model_name_or_path  # default model_hub=modelscope
fastspeed --model model_name_or_path --model_hub hf
  • client
curl -X 'POST' \
  'http://127.0.0.1:8000/chat/completions' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "messages": [
    {
      "role": "user",
      "content": "Hello"
    }
  ],
  "model": ""
}'

Change Log

0.1.1 (2024-09-15)

  • Suport half, cuda and trust_remote_code=True

0.1.0 (2024-08-29)

  • Fastapi with single GPU, model from huggingface or modelscope

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastspeed-0.1.1.tar.gz (4.5 kB view details)

Uploaded Source

Built Distribution

fastspeed-0.1.1-py2.py3-none-any.whl (4.4 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file fastspeed-0.1.1.tar.gz.

File metadata

  • Download URL: fastspeed-0.1.1.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.10

File hashes

Hashes for fastspeed-0.1.1.tar.gz
Algorithm Hash digest
SHA256 5d9d82b8d6159f0e5bd6d424c422b6156ce8c630298fecc6515bd8c920d06516
MD5 953089c4c321305eb8b18f30d833ef18
BLAKE2b-256 dc0e25e591b93593288add097758fbf478bae7a79eefcbb28f216f1b7998c925

See more details on using hashes here.

File details

Details for the file fastspeed-0.1.1-py2.py3-none-any.whl.

File metadata

  • Download URL: fastspeed-0.1.1-py2.py3-none-any.whl
  • Upload date:
  • Size: 4.4 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.10

File hashes

Hashes for fastspeed-0.1.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 0be6cac2258b54a85a9595a1688279543707834095f5fdef0466e54c006a4464
MD5 d6a7788e0947fc81a2fe0d84f0c0dc89
BLAKE2b-256 68512b9cd08f5fdcdf0320d018e98466c40edb0e303e5ac838694123e0654e47

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page