Skip to main content

Serve Transformers and Manage GPUs

Project description

tfmx

Install

pip install tfmx --upgrade

Commands

Set GPU control state:

gpu_fan -cs a:1

Set GPU power limit:

# M-X GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0:160;1:200"

# M-A GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0,1:160"

Set GPU fan speed:

gpu_fan -cs a:1 && gpu_fan -fs a:100

Set GPU monitor with curve:

# gpu_mon -c "a:30-50/50-65/60-80/75-100;3,7:25-100" -s

Run tei compose and machine:

tei machine run --auto-start --perf-track --on-conflict replace

Run tei benchmark:

tei benchmark run -E "http://localhost:28800" -n 100000

Run qwn compose, machine, and benchmark:

export QWN_MACHINE_URL="http://$QWN_HOST:27800"

qwn compose up --gpu-configs "0"
qwn compose up --gpu-layout uniform-awq
qwn machine run --auto-start -b --on-conflict replace
qwn client chat "你好,请做个自我介绍"
qwn benchmark run -E "$QWN_MACHINE_URL" -n 100

Run qsr compose, machine, and benchmark:

export QSR_MACHINE_URL="http://$QSR_HOST:27900"

qsr compose up --gpu-configs "0"
qsr machine run --auto-start -b --on-conflict replace
qsr client transcribe ./sample.wav
qsr client chat --audio ./sample.wav "请转写为简体中文"
qsr benchmark run -E "$QSR_MACHINE_URL" -n 20 --audio ./sample.wav

Staged Run Scripts

For repeatable repo-local workflows, use the staged script directories:

bash runs/teis/01_deploy_default.sh
bash runs/teis/02_start_machine.sh
bash runs/teis/03_health_check.sh
bash runs/qwns/01_deploy_uniform.sh
bash runs/qwns/02_start_machine.sh
bash runs/qwns/03_health_check.sh
bash runs/qsrs/01_deploy_default.sh
bash runs/qsrs/02_start_machine.sh
bash runs/qsrs/03_health_check.sh
bash runs/recovery/restart_tei_qwn.sh
  • runs/teis/README.md: staged TEI deploy, health, benchmark, cleanup
  • runs/qwns/README.md: staged QWN deploy, health, benchmark, cleanup
  • runs/qsrs/README.md: staged QSR deploy, health, benchmark, cleanup
  • runs/recovery/README.md: joint TEI + QWN recovery and live validation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tfmx-1.2.tar.gz (259.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tfmx-1.2-py3-none-any.whl (285.4 kB view details)

Uploaded Python 3

File details

Details for the file tfmx-1.2.tar.gz.

File metadata

  • Download URL: tfmx-1.2.tar.gz
  • Upload date:
  • Size: 259.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for tfmx-1.2.tar.gz
Algorithm Hash digest
SHA256 3ede0db34dc5d6bb34d094eac4af3dc8146f04c1d686f8159c84ba109128e8bc
MD5 cc51628e4d31a8922cc8ec0fc4975697
BLAKE2b-256 1b3928ebdcf6bc11706ea4c7cd491c91223a7293be5b6d226f25c4776b9f9a69

See more details on using hashes here.

File details

Details for the file tfmx-1.2-py3-none-any.whl.

File metadata

  • Download URL: tfmx-1.2-py3-none-any.whl
  • Upload date:
  • Size: 285.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for tfmx-1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 9d8939ab39f473cda7cc8bb1160929a5fab80413e101436de078727e10e47b8a
MD5 c0bb49f2d48e052e086311ac35114dcc
BLAKE2b-256 06f5e5bab26f43b7273804452e5562e8f05d5d7ee5f2f3079aa1c6e9adbfe832

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page