A Python package for estimating GPU memory requirements and the number of GPUs needed for training machine learning models

These details have not been verified by PyPI

Project links

Project description

GPU Estimator

A Python package for estimating GPU memory requirements and the number of GPUs needed for training machine learning models.

Features

Latest Model Support: Built-in configs for LLaMA 4, Gemma 3, Qwen 2.5/3, and more
Estimate GPU memory requirements based on model parameters
Calculate optimal number of GPUs for training
Support for different precision types (FP32, FP16, BF16, INT8)
Account for optimizer states and gradient storage
Integration with Hugging Face Hub for latest models
Discover and search trending models
Support for popular architectures (GPT, LLaMA, BERT, T5, Mistral, Gemma, Qwen, etc.)
CLI interface for quick estimates
Detailed memory breakdown and recommendations

Installation

pip install gpu-estimator

Quick Start

Basic Usage

from gpu_estimator import GPUEstimator

estimator = GPUEstimator()

# Estimate for latest models using predefined configs
from gpu_estimator.utils import get_model_config

result = estimator.estimate_from_architecture(
    **get_model_config("qwen2.5-7b"),
    batch_size=8,
    sequence_length=2048,
    precision="fp16"
)

print(f"Memory needed per GPU: {result.memory_per_gpu_gb:.2f} GB")
print(f"Recommended GPUs: {result.num_gpus}")

# Or estimate by parameters for any model size
result = estimator.estimate(
    model_params=7e9,
    batch_size=32,
    sequence_length=2048,
    precision="fp16"
)

Hugging Face Integration

from gpu_estimator import GPUEstimator

estimator = GPUEstimator()

# Estimate directly from Hugging Face model ID
result = estimator.estimate_from_huggingface(
    model_id="meta-llama/Llama-3.2-3B",
    batch_size=4,
    sequence_length=2048,
    precision="fp16",
    gradient_checkpointing=True
)

print(f"Total memory required: {result.total_memory_gb:.2f} GB")
print(f"GPUs needed: {result.num_gpus}")

# Discover trending models
trending = estimator.list_trending_models(limit=10, task="text-generation")
for model in trending:
    print(f"{model.model_id} - {model.downloads:,} downloads")

# Search for specific models
models = estimator.search_models("qwen", limit=5)
for model in models:
    print(f"{model.model_id} - {model.architecture}")

CLI Usage

Basic Estimation

# Estimate for any model by parameters
gpu-estimate estimate --model-params 7e9 --batch-size 4 --precision fp16

# Estimate for predefined models (classic)
gpu-estimate estimate --model-name llama-7b --batch-size 8

# Estimate for latest predefined models
gpu-estimate estimate --model-name qwen2.5-7b --batch-size 4 --precision fp16
gpu-estimate estimate --model-name llama3.2-3b --batch-size 16 --gpu-type A100
gpu-estimate estimate --model-name gemma2-9b --batch-size 8 --precision bf16

# Estimate for Hugging Face models
gpu-estimate estimate --huggingface-model meta-llama/Llama-3.2-3B --batch-size 4
gpu-estimate estimate --huggingface-model Qwen/Qwen2.5-7B --batch-size 8

Model Discovery

# List trending models
gpu-estimate trending --limit 20 --task text-generation

# Search for models
gpu-estimate search "mistral" --limit 10

# Get popular models by architecture
gpu-estimate popular llama --limit 5

# Get model information
gpu-estimate info qwen2.5-7b

Advanced Options

# With gradient checkpointing and specific GPU
gpu-estimate estimate \
  --huggingface-model meta-llama/Llama-4-Scout-17B \
  --batch-size 8 \
  --seq-length 1024 \
  --precision fp16 \
  --gpu-type A100 \
  --gradient-checkpointing \
  --verbose

Interactive Mode

Launch an interactive session for guided GPU estimation:

gpu-estimate interactive

Features:

Guided workflows for all estimation tasks
Model discovery with direct estimation
Flexible model specification (parameters, names, or HF IDs)
Step-by-step configuration of training parameters
Quick estimates from trending model lists

Supported Models & Architectures

Hugging Face Models

The package automatically supports any model on Hugging Face Hub by detecting their configuration. Popular architectures include:

Architecture	Examples	Use Cases
LLaMA/LLaMA2/3/4	`meta-llama/Llama-2-7b-hf`, `meta-llama/Llama-3.2-3B`, `meta-llama/Llama-4-Scout-17B`	General language modeling, chat
GPT	`gpt2`, `microsoft/DialoGPT-large`	Text generation, conversation
Mistral	`mistralai/Mistral-7B-v0.1`	Efficient language modeling
CodeLlama	`codellama/CodeLlama-7b-Python-hf`	Code generation
BERT	`google-bert/bert-base-uncased`	Text classification, NLU
T5	`google-t5/t5-base`, `google/flan-t5-large`	Text-to-text tasks
Phi	`microsoft/phi-2`	Small efficient models
Gemma/Gemma2/3	`google/gemma-7b`, `google/gemma-2-9b`, `google/gemma-3-270m`	Google's language models
Qwen/Qwen2.5/3	`Qwen/Qwen-7B`, `Qwen/Qwen2.5-7B`, `Qwen/Qwen3-4B`	Multilingual models

Predefined Models

Classic and latest models with known configurations:

GPT Family:

gpt2, gpt2-medium, gpt2-large, gpt2-xl, gpt3

LLaMA Family:

Original: llama-7b, llama-13b, llama-30b, llama-65b
LLaMA 2: llama2-7b, llama2-13b, llama2-70b
LLaMA 3.2: llama3.2-1b, llama3.2-3b
LLaMA 3.3: llama3.3-70b
LLaMA 4: llama4-scout-17b, llama4-maverick-17b
Code LLaMA: codellama-7b, codellama-13b, codellama-34b

Mistral Family:

mistral-7b

Phi Family:

phi-1.5b, phi-2.7b

Gemma Family:

Original: gemma-2b, gemma-7b
Gemma 2: gemma2-2b, gemma2-9b, gemma2-27b
Gemma 3: gemma3-270m

Qwen Family:

Qwen 2.5: qwen2.5-7b, qwen2.5-14b, qwen2.5-32b, qwen2.5-72b
Qwen 3: qwen3-4b, qwen3-30b, qwen3-235b

Flexible Naming: Model names support flexible matching. Use custom-llama-7b, my-mistral-7b, or any name containing a known model identifier.

GPU Types Supported

GPU	Memory	Use Case
H100	80 GB	Latest high-performance training
A100	80 GB	Large model training and inference
A40	48 GB	Professional workstation training
A6000	48 GB	Creative and AI workstation
L40	48 GB	Data center inference
L4	24 GB	Efficient inference
RTX 4090	24 GB	Consumer high-end
RTX 3090	24 GB	Consumer enthusiast
V100	32 GB	Previous generation training
T4	16 GB	Cloud inference

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

Sep 15, 2025

0.1.4

Sep 10, 2025

0.1.3

Sep 10, 2025

0.1.2

Sep 10, 2025

0.1.1

Sep 10, 2025

0.1.0

Sep 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpu_estimator-0.2.0.tar.gz (24.0 kB view details)

Uploaded Sep 15, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gpu_estimator-0.2.0-py3-none-any.whl (19.5 kB view details)

Uploaded Sep 15, 2025 Python 3

File details

Details for the file gpu_estimator-0.2.0.tar.gz.

File metadata

Download URL: gpu_estimator-0.2.0.tar.gz
Upload date: Sep 15, 2025
Size: 24.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for gpu_estimator-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`3802f74802d8625ae5415fd40702a2cebc5fbaa26e1f76945182144e3c7f4445`
MD5	`93336ce8f0eb48fc115cd5536bac9f8a`
BLAKE2b-256	`3519e9b6ade606713a59931ab6f15d45c08312014cd8ec3a567e7e0dfe427a6c`

See more details on using hashes here.

File details

Details for the file gpu_estimator-0.2.0-py3-none-any.whl.

File metadata

Download URL: gpu_estimator-0.2.0-py3-none-any.whl
Upload date: Sep 15, 2025
Size: 19.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for gpu_estimator-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`63fcf271427c7498e78bcf65907d6cd1ebf2ec396c9972fa3831069a66db43f2`
MD5	`b341d0ad3f0c8e90ccde784e80b69c08`
BLAKE2b-256	`fbc6b399644159ff1445b50968a6a940639069245fa9d134c5113e5ddafcbafe`

See more details on using hashes here.

gpu-estimator 0.2.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

GPU Estimator

Features

Installation

Quick Start

Basic Usage

Hugging Face Integration

CLI Usage

Basic Estimation

Model Discovery

Advanced Options

Interactive Mode

Supported Models & Architectures

Hugging Face Models

Predefined Models

GPU Types Supported

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes