Skip to main content

Serve Transformers and Manage GPUs

Project description

tfmx

Install

pip install tfmx --upgrade

Commands

Set GPU control state:

gpu_fan -cs a:1

Set GPU power limit:

# M-X GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0:160;1:200"

# M-A GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0,1:160"

Set GPU fan speed:

gpu_fan -cs a:1 && gpu_fan -fs a:100

Set GPU monitor with curve:

# gpu_mon -c "a:30-50/50-65/60-80/75-100;3,7:25-100" -s

Run tei compose and machine:

tei machine run --auto-start --perf-track --on-conflict replace

Run tei benchmark:

tei benchmark run -E "http://localhost:28800" -n 100000

Run qwn compose, machine, and benchmark:

export QWN_MACHINE_URL="http://$QWN_HOST:27800"

qwn compose up --gpu-configs "0"
qwn compose up --gpu-layout uniform-awq
qwn machine run --auto-start -b --on-conflict replace
qwn client chat "你好,请做个自我介绍"
qwn benchmark run -E "$QWN_MACHINE_URL" -n 100

Run qsr compose, machine, and benchmark:

export QSR_MACHINE_URL="http://$QSR_HOST:27900"

qsr compose up --gpu-configs "0"
qsr machine run --auto-start -b --on-conflict replace
qsr client transcribe ./sample.wav
qsr client chat --audio ./sample.wav "请转写为简体中文"
qsr benchmark run -E "$QSR_MACHINE_URL" -n 20 --audio ./sample.wav

Staged Run Scripts

For repeatable repo-local workflows, use the staged script directories:

bash runs/teis/01_deploy_default.sh
bash runs/teis/02_start_machine.sh
bash runs/teis/03_health_check.sh
bash runs/qwns/01_deploy_uniform.sh
bash runs/qwns/02_start_machine.sh
bash runs/qwns/03_health_check.sh
bash runs/qsrs/01_deploy_default.sh
bash runs/qsrs/02_start_machine.sh
bash runs/qsrs/03_health_check.sh
bash runs/recovery/restart_tei_qwn.sh
  • runs/teis/README.md: staged TEI deploy, health, benchmark, cleanup
  • runs/qwns/README.md: staged QWN deploy, health, benchmark, cleanup
  • runs/qsrs/README.md: staged QSR deploy, health, benchmark, cleanup
  • runs/recovery/README.md: joint TEI + QWN recovery and live validation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tfmx-1.3.tar.gz (282.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tfmx-1.3-py3-none-any.whl (309.5 kB view details)

Uploaded Python 3

File details

Details for the file tfmx-1.3.tar.gz.

File metadata

  • Download URL: tfmx-1.3.tar.gz
  • Upload date:
  • Size: 282.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for tfmx-1.3.tar.gz
Algorithm Hash digest
SHA256 1f2e203d5d162bf474b4612ca1d9d1e262eceeab7c1cdcf2f59edacc988b02d6
MD5 41bcb81c9b82a62d279079b62e41aeda
BLAKE2b-256 239ecf68ad81b48ec908526b1bd16275b86ab378daf4a1b0d657fe0bcc4d8982

See more details on using hashes here.

File details

Details for the file tfmx-1.3-py3-none-any.whl.

File metadata

  • Download URL: tfmx-1.3-py3-none-any.whl
  • Upload date:
  • Size: 309.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for tfmx-1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 4323e9f8e89d79ffe0be6e01ddec0b6b049d5b62d6fc96835340bf0e8753f950
MD5 03ebd1311596fa7cc923a98e161c42ca
BLAKE2b-256 98c47ed3604bcbd577da442c3b292d6d5c9ba06d9556b49022f1728d86471b30

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page