Skip to main content

Serve Transformers and Manage GPUs

Project description

tfmx

Install

pip install tfmx --upgrade

Commands

Set GPU control state:

gpu_fan -cs a:1

Set GPU power limit:

# M-X GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0:160;1:200"

# M-A GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0,1:160"

Set GPU fan speed:

gpu_fan -cs a:1 && gpu_fan -fs a:100

Set GPU monitor with curve:

# gpu_mon -c "a:30-50/50-65/60-80/75-100;3,7:25-100" -s

Run tei compose and machine:

tei machine run --auto-start --perf-track --on-conflict replace

Run tei benchmark:

tei benchmark run -E "http://localhost:28800" -n 100000

Run qwn compose, machine, and benchmark:

export QWN_MACHINE_URL="http://$QWN_HOST:27800"

qwn compose up --gpu-configs "0"
qwn compose up --gpu-layout uniform-awq
qwn machine run --auto-start -b --on-conflict replace
qwn client chat "你好,请做个自我介绍"
qwn benchmark run -E "$QWN_MACHINE_URL" -n 100

Staged Run Scripts

For repeatable repo-local workflows, use the staged script directories:

bash runs/teis/01_deploy_default.sh
bash runs/teis/02_start_machine.sh
bash runs/teis/03_health_check.sh
bash runs/qwns/01_deploy_uniform.sh
bash runs/qwns/02_start_machine.sh
bash runs/qwns/03_health_check.sh
bash runs/recovery/restart_tei_qwn.sh
  • runs/teis/README.md: staged TEI deploy, health, benchmark, cleanup
  • runs/qwns/README.md: staged QWN deploy, health, benchmark, cleanup
  • runs/recovery/README.md: joint TEI + QWN recovery and live validation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tfmx-1.1.4.tar.gz (210.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tfmx-1.1.4-py3-none-any.whl (232.4 kB view details)

Uploaded Python 3

File details

Details for the file tfmx-1.1.4.tar.gz.

File metadata

  • Download URL: tfmx-1.1.4.tar.gz
  • Upload date:
  • Size: 210.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for tfmx-1.1.4.tar.gz
Algorithm Hash digest
SHA256 2686de60ee25d313a47f491ad56f7f956654c3749f464bf22b6555befa8a157f
MD5 9ea99f71844d36ac5a592f83c2c75e58
BLAKE2b-256 5c0dfd844ac05cfed35326e83ca53ced3ced6548c2aaeb0e3a41d346a8bda344

See more details on using hashes here.

File details

Details for the file tfmx-1.1.4-py3-none-any.whl.

File metadata

  • Download URL: tfmx-1.1.4-py3-none-any.whl
  • Upload date:
  • Size: 232.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for tfmx-1.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 740e673692f7e18311608e336cea93a3fe058903a698bc61a275b2a811a91a7b
MD5 971e33817aa4d7d3343416b9ff31c1fd
BLAKE2b-256 0f9bc71f0438fd3d5c71cbaa7dabaf6b7e5b0e958c2d9598680371ac60a95788

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page