Serve Transformers and Manage GPUs
Project description
tfmx
Install
pip install tfmx --upgrade
Commands
Set GPU control state:
gpu_fan -cs a:1
Set GPU power limit:
# M-X GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0:160;1:200"
# M-A GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0,1:160"
Set GPU fan speed:
gpu_fan -cs a:1 && gpu_fan -fs a:100
Set GPU monitor with curve:
# gpu_mon -c "a:30-50/50-65/60-80/75-100;3,7:25-100" -s
Run tei compose and machine:
tei machine run --auto-start --perf-track --on-conflict replace
Run tei benchmark:
tei benchmark run -E "http://localhost:28800" -n 100000
Run qwn compose, machine, and benchmark:
export QWN_MACHINE_URL="http://$QWN_HOST:27800"
qwn compose up --gpu-configs "0"
qwn compose up --gpu-layout uniform-awq
qwn machine run --auto-start -b --on-conflict replace
qwn client chat "你好,请做个自我介绍"
qwn benchmark run -E "$QWN_MACHINE_URL" -n 100
Run qsr compose, machine, and benchmark:
export QSR_MACHINE_URL="http://$QSR_HOST:27900"
qsr compose up --gpu-configs "0"
qsr machine run --auto-start -b --on-conflict replace
qsr client transcribe ./sample.wav
qsr client chat --audio ./sample.wav "请转写为简体中文"
qsr benchmark run -E "$QSR_MACHINE_URL" -n 20 --audio ./sample.wav
Staged Run Scripts
For repeatable repo-local workflows, use the staged script directories:
bash runs/teis/01_deploy_default.sh
bash runs/teis/02_start_machine.sh
bash runs/teis/03_health_check.sh
bash runs/qwns/01_deploy_uniform.sh
bash runs/qwns/02_start_machine.sh
bash runs/qwns/03_health_check.sh
bash runs/qsrs/01_deploy_default.sh
bash runs/qsrs/02_start_machine.sh
bash runs/qsrs/03_health_check.sh
bash runs/recovery/restart_tei_qwn.sh
runs/teis/README.md: staged TEI deploy, health, benchmark, cleanupruns/qwns/README.md: staged QWN deploy, health, benchmark, cleanupruns/qsrs/README.md: staged QSR deploy, health, benchmark, cleanupruns/recovery/README.md: joint TEI + QWN recovery and live validation
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
tfmx-1.3.tar.gz
(282.2 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
tfmx-1.3-py3-none-any.whl
(309.5 kB
view details)
File details
Details for the file tfmx-1.3.tar.gz.
File metadata
- Download URL: tfmx-1.3.tar.gz
- Upload date:
- Size: 282.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1f2e203d5d162bf474b4612ca1d9d1e262eceeab7c1cdcf2f59edacc988b02d6
|
|
| MD5 |
41bcb81c9b82a62d279079b62e41aeda
|
|
| BLAKE2b-256 |
239ecf68ad81b48ec908526b1bd16275b86ab378daf4a1b0d657fe0bcc4d8982
|
File details
Details for the file tfmx-1.3-py3-none-any.whl.
File metadata
- Download URL: tfmx-1.3-py3-none-any.whl
- Upload date:
- Size: 309.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4323e9f8e89d79ffe0be6e01ddec0b6b049d5b62d6fc96835340bf0e8753f950
|
|
| MD5 |
03ebd1311596fa7cc923a98e161c42ca
|
|
| BLAKE2b-256 |
98c47ed3604bcbd577da442c3b292d6d5c9ba06d9556b49022f1728d86471b30
|