Serve Transformers and Manage GPUs
Project description
tfmx
Install
pip install tfmx --upgrade
Commands
Set GPU control state:
gpu_fan -cs a:1
Set GPU power limit:
# M-X GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0:160;1:200"
# M-A GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0,1:160"
Set GPU fan speed:
gpu_fan -cs a:1 && gpu_fan -fs a:100
Set GPU monitor with curve:
# gpu_mon -c "a:30-50/50-65/60-80/75-100;3,7:25-100" -s
Run tei compose and machine:
tei machine run --auto-start --perf-track --on-conflict replace
Run tei benchmark:
tei benchmark run -E "http://localhost:28800" -n 100000
Run qwn compose, machine, and benchmark:
export QWN_MACHINE_URL="http://$QWN_HOST:27800"
qwn compose up --gpu-configs "0"
qwn compose up --gpu-layout uniform-awq
qwn machine run --auto-start -b --on-conflict replace
qwn client chat "你好,请做个自我介绍"
qwn benchmark run -E "$QWN_MACHINE_URL" -n 100
Staged Run Scripts
For repeatable repo-local workflows, use the staged script directories:
bash runs/teis/01_deploy_default.sh
bash runs/teis/02_start_machine.sh
bash runs/teis/03_health_check.sh
bash runs/qwns/01_deploy_uniform.sh
bash runs/qwns/02_start_machine.sh
bash runs/qwns/03_health_check.sh
bash runs/recovery/restart_tei_qwn.sh
runs/teis/README.md: staged TEI deploy, health, benchmark, cleanupruns/qwns/README.md: staged QWN deploy, health, benchmark, cleanupruns/recovery/README.md: joint TEI + QWN recovery and live validation
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
tfmx-1.1.4.tar.gz
(210.9 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
tfmx-1.1.4-py3-none-any.whl
(232.4 kB
view details)
File details
Details for the file tfmx-1.1.4.tar.gz.
File metadata
- Download URL: tfmx-1.1.4.tar.gz
- Upload date:
- Size: 210.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2686de60ee25d313a47f491ad56f7f956654c3749f464bf22b6555befa8a157f
|
|
| MD5 |
9ea99f71844d36ac5a592f83c2c75e58
|
|
| BLAKE2b-256 |
5c0dfd844ac05cfed35326e83ca53ced3ced6548c2aaeb0e3a41d346a8bda344
|
File details
Details for the file tfmx-1.1.4-py3-none-any.whl.
File metadata
- Download URL: tfmx-1.1.4-py3-none-any.whl
- Upload date:
- Size: 232.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
740e673692f7e18311608e336cea93a3fe058903a698bc61a275b2a811a91a7b
|
|
| MD5 |
971e33817aa4d7d3343416b9ff31c1fd
|
|
| BLAKE2b-256 |
0f9bc71f0438fd3d5c71cbaa7dabaf6b7e5b0e958c2d9598680371ac60a95788
|