Skip to main content

A CLI to estimate inference memory requirements for Hugging Face models, written in Python

Project description

hf-mem-github-x4

hf-mem is an experimental CLI to estimate inference memory requirements for Hugging Face models, written in Python. hf-mem is lightweight, only depends on httpx. It's recommended to run with uv for a better experience.

hf-mem lets you estimate the inference requirements to run any model from the Hugging Face Hub, including Transformers, Diffusers and Sentence Transformers models, as well as any model that contains Safetensors compatible weights.

Read more information about hf-mem in this short-form post.

Usage

uvx hf-mem --model-id MiniMaxAI/MiniMax-M2
uvx hf-mem --model-id Qwen/Qwen-Image

References

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hf_mem-0.2.0.tar.gz (7.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hf_mem-0.2.0-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

File details

Details for the file hf_mem-0.2.0.tar.gz.

File metadata

  • Download URL: hf_mem-0.2.0.tar.gz
  • Upload date:
  • Size: 7.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.5

File hashes

Hashes for hf_mem-0.2.0.tar.gz
Algorithm Hash digest
SHA256 cb72ae85a105b98cbd4c6a117e30f81806cfec136cb7f620c91afb40228812dd
MD5 d29321efadb9b11a14f1a3029165f53d
BLAKE2b-256 94d32b8fd6a54368bc466a971908ffc492d1d0bc42b1c879a6dcb784f38391c7

See more details on using hashes here.

File details

Details for the file hf_mem-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: hf_mem-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.5

File hashes

Hashes for hf_mem-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 01a5b65a828de44feb2791098872d2ff3c314bcc918c6d0085114173e1d01ee1
MD5 95942558efa238c5fad32bf18d0cc1b7
BLAKE2b-256 46a7e5beca12e1d174556b9961095ddaaebe081eb06ee8753149d0cd18f4aed7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page