Skip to main content

Serve Transformers and Manage GPUs

Project description

tfmx

Install

pip install tfmx --upgrade

Commands

Set GPU control state:

gpu_fan -cs a:1

Set GPU power limit:

# M-X GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0:160;1:200"

# M-A GPU-0/1
gpu_pow -pm a:1 && gpu_pow -pl "0,1:160"

Set GPU fan speed:

gpu_fan -cs a:1 && gpu_fan -fs a:100

Set GPU monitor with curve:

# gpu_mon -c "a:30-50/50-65/60-80/75-100;3,7:25-100" -s

Run tei compose and machine:

tei machine run --auto-start --perf-track --on-conflict replace

Run tei benchmark:

tei benchmark run -E "http://localhost:28800" -n 100000

Run qwn compose, machine, and benchmark:

export QWN_MACHINE_URL="http://$QWN_HOST:27800"

qwn compose up --gpu-configs "0"
qwn compose up --gpu-layout uniform-awq
qwn machine run --auto-start -b --on-conflict replace
qwn client chat "你好,请做个自我介绍"
qwn benchmark run -E "$QWN_MACHINE_URL" -n 100

Staged Run Scripts

For repeatable repo-local workflows, use the staged script directories:

bash runs/teis/01_deploy_default.sh
bash runs/teis/02_start_machine.sh
bash runs/teis/03_health_check.sh
bash runs/qwns/01_deploy_uniform.sh
bash runs/qwns/02_start_machine.sh
bash runs/qwns/03_health_check.sh
bash runs/recovery/restart_tei_qwn.sh
  • runs/teis/README.md: staged TEI deploy, health, benchmark, cleanup
  • runs/qwns/README.md: staged QWN deploy, health, benchmark, cleanup
  • runs/recovery/README.md: joint TEI + QWN recovery and live validation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tfmx-1.1.3.tar.gz (208.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tfmx-1.1.3-py3-none-any.whl (230.4 kB view details)

Uploaded Python 3

File details

Details for the file tfmx-1.1.3.tar.gz.

File metadata

  • Download URL: tfmx-1.1.3.tar.gz
  • Upload date:
  • Size: 208.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for tfmx-1.1.3.tar.gz
Algorithm Hash digest
SHA256 f83f51eba310085d5d9ecdd35827b4d5a732b7f82439b7f7cb0629c7c0571e42
MD5 510f0a24df273648aedba4fa06c7d05d
BLAKE2b-256 9ec895ec6c158118947f6b1870f8f1d6eeef59db4010300e98ea94997d960439

See more details on using hashes here.

File details

Details for the file tfmx-1.1.3-py3-none-any.whl.

File metadata

  • Download URL: tfmx-1.1.3-py3-none-any.whl
  • Upload date:
  • Size: 230.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for tfmx-1.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 a47dba6d58a6b9eb48b95ec2dfbfae29913b7763cb6a1d839e2da6495b655310
MD5 bc54ce96e8f39211673c8f4b278d42fe
BLAKE2b-256 8890c49c58910465d813c1a1666792e66fdb27ab55cc90e95cc8def3d8a9cb77

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page