Serve Transformers and Manage GPUs
Project description
tfmx
Install
pip install tfmx --upgrade
Commands
Set GPU control state:
gpu_fan -cs a:1
Set GPU power limit:
# M-X GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0:160;1:200"
# M-A GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0,1:160"
Set GPU fan speed:
gpu_fan -cs a:1 && gpu_fan -fs a:100
Set GPU monitor with curve:
# gpu_mon -c "a:30-50/50-65/60-80/75-100;3,7:25-100" -s
Run tei compose and machine:
tei machine run --auto-start --perf-track --on-conflict replace
Run tei benchmark:
tei benchmark run -E "http://localhost:28800" -n 100000
Run qwn compose, machine, and benchmark:
export QWN_MACHINE_URL="http://$QWN_HOST:27800"
qwn compose up --gpu-configs "0"
qwn compose up --gpu-layout uniform-awq
qwn machine run --auto-start -b --on-conflict replace
qwn client chat "你好,请做个自我介绍"
qwn benchmark run -E "$QWN_MACHINE_URL" -n 100
Run qsr compose, machine, and benchmark:
export QSR_MACHINE_URL="http://$QSR_HOST:27900"
qsr compose up --gpu-configs "0"
qsr machine run --auto-start -b --on-conflict replace
qsr client transcribe ./sample.wav
qsr client chat --audio ./sample.wav "请转写为简体中文"
qsr benchmark run -E "$QSR_MACHINE_URL" -n 20 --audio ./sample.wav
Staged Run Scripts
For repeatable repo-local workflows, use the staged script directories:
bash runs/teis/01_deploy_default.sh
bash runs/teis/02_start_machine.sh
bash runs/teis/03_health_check.sh
bash runs/qwns/01_deploy_uniform.sh
bash runs/qwns/02_start_machine.sh
bash runs/qwns/03_health_check.sh
bash runs/qsrs/01_deploy_default.sh
bash runs/qsrs/02_start_machine.sh
bash runs/qsrs/03_health_check.sh
bash runs/recovery/restart_tei_qwn.sh
runs/teis/README.md: staged TEI deploy, health, benchmark, cleanupruns/qwns/README.md: staged QWN deploy, health, benchmark, cleanupruns/qsrs/README.md: staged QSR deploy, health, benchmark, cleanupruns/recovery/README.md: joint TEI + QWN recovery and live validation
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
tfmx-1.2.tar.gz
(259.0 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
tfmx-1.2-py3-none-any.whl
(285.4 kB
view details)
File details
Details for the file tfmx-1.2.tar.gz.
File metadata
- Download URL: tfmx-1.2.tar.gz
- Upload date:
- Size: 259.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3ede0db34dc5d6bb34d094eac4af3dc8146f04c1d686f8159c84ba109128e8bc
|
|
| MD5 |
cc51628e4d31a8922cc8ec0fc4975697
|
|
| BLAKE2b-256 |
1b3928ebdcf6bc11706ea4c7cd491c91223a7293be5b6d226f25c4776b9f9a69
|
File details
Details for the file tfmx-1.2-py3-none-any.whl.
File metadata
- Download URL: tfmx-1.2-py3-none-any.whl
- Upload date:
- Size: 285.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9d8939ab39f473cda7cc8bb1160929a5fab80413e101436de078727e10e47b8a
|
|
| MD5 |
c0bb49f2d48e052e086311ac35114dcc
|
|
| BLAKE2b-256 |
06f5e5bab26f43b7273804452e5562e8f05d5d7ee5f2f3079aa1c6e9adbfe832
|