Predict GPU execution time & memory for PyTorch models — without running them.

These details have not been verified by PyPI

Project links

Project description

Blink 🔭

Python 3.9+

GPU Performance Predictor for Deep Learning Models

Blink predicts the execution time and peak memory usage of PyTorch neural networks on GPU hardware before you actually run or deploy them.

It combines classical ML (XGBoost, Random Forest) with a Graph Neural Network (GNN) that encodes the computational graph of any model architecture, acting as a "virtual profiler."

⚡ Quick Start

Installation

Blink is published on PyPI. You can install the core API, or install with optional dependency groups:

# Core prediction API only
pip install blink-gpu

# Include Streamlit Dashboard, SHAP explainability, and Plotly
pip install "blink-gpu[full]"

# Include FastAPI REST Server
pip install "blink-gpu[api]"

# Install everything
pip install "blink-gpu[all]"

Note: You must install PyTorch (torch, torchvision) separately according to your CUDA hardware.

Python Usage

import torchvision.models as tv
from blink import BlinkPredictor, BlinkAnalyzer

# 1. Analyze any PyTorch model architecture
model = tv.resnet18(weights=None)
print(BlinkAnalyzer().summary(model))
# ➔ Parameters: 11,689,512 | FLOPs: 1,814 M | Conv layers: 20 | Size: 44.59 MB

# 2. Predict execution time and memory for a batch size
predictor = BlinkPredictor()
result = predictor.predict(model, batch_size=32)

print(f"Exec time: {result['exec_time_ms']:.1f} ms")
print(f"Memory   : {result['memory_mb']:.1f} MB")
# ➔ Exec time: 18.3 ms | Memory: 184.3 MB

# 3. Sweep multiple batch sizes
sweep = predictor.predict_batch("resnet50", batch_sizes=[1, 16, 32, 64])

💻 Command Line Interface (CLI)

Blink comes with a built-in CLI for quick profiling without writing scripts:

# Predict via CLI
$ blink predict resnet50 --batch-size 32
🔮 Blink prediction for 'resnet50'
 Batch   Exec (ms)   Memory (MB)  CI-Exec (80%)
------------------------------------------------------------
    32       28.45         294.5  [22.1 - 36.6]

# Launch the Streamlit Dashboard
$ blink dashboard --port 8501

# Launch the FastAPI REST Server
$ blink server --host 0.0.0.0 --port 8000

📊 Streamlit Dashboard & Explainability

Blink includes a rich, interactive web dashboard. Run blink dashboard to access:

Blink Dashboard SHAP Explainability Demo

Live Predictions: Instantly predict performance for custom PyTorch code or TorchVision models.
🔍 SHAP Explainability ("Why this prediction?"): Interactive waterfall charts explaining exactly which architectural features (e.g., FLOPs, Conv layers, Model Depth) drove the predicted execution time and memory footprint up or down.

Blink Batch Optimizer Demo

Batch Size Optimizer: Find the maximum batch size that fits within your specific GPU memory budget (e.g., 8GB, 16GB, 24GB).
Compare Architectures: Side-by-side performance comparison of different models.

🌐 REST API & Docker Deployment

Blink can be deployed as a microservice to provide GPU cost estimates to other applications.

Docker Compose (Recommended)

You can spin up both the Streamlit Dashboard and the FastAPI backend instantly using Docker.

git clone https://github.com/Aniketxmishra/Blink_Main.git
cd Blink_Main
docker compose up -d

Dashboard: http://localhost:8501
REST API: http://localhost:8000/docs (Swagger UI)

REST API Example

curl -X POST "http://localhost:8000/api/v2/predict" \
     -H "Content-Type: application/json" \
     -d '{"model_name": "resnet50", "batch_size": 32}'

# Response:
# {
#   "model_name": "resnet50",
#   "batch_size": 32,
#   "predictions": {
#     "exec_time_ms": 28.45,
#     "exec_time_bounds": [22.1, 36.6],
#     "memory_usage_mb": 294.5,
#     ...
#   }
# }

🧠 How it Works (Architecture)

PyTorch Model
      │
      ▼
┌─────────────────────┐
│  Feature Extractor  │  ← layer counts, FLOPs, params, depth, width, skip connections
│  + GNN Extractor    │  ← graph-based architecture encoding (ArchitectureGNN)
└─────────┬───────────┘
          │
          ▼
┌─────────────────────┐
│  Prediction Models  │
│  ─────────────────  │
│  · XGBoost (tuned)  │  ← main predictor (best MAPE) + SHAP Explainer
│  · Random Forest    │  ← latency confidence intervals (Quantile Regression)
│  · GNN Predictor    │  ← graph-native, generalizes across architectures
└─────────┬───────────┘
          │
          ▼
   Predicted: exec_time_ms, memory_mb

Model Performance on Held-out Data:

Execution Time (XGBoost): ~8% MAPE
Memory Usage (XGBoost): ~6% MAPE

🔬 Development & Paper Reproducibility

Blink was developed alongside a research study evaluating the efficacy of static and graph-based features for GPU performance prediction.

To reproduce the paper's figures and ablation study:

git clone https://github.com/Aniketxmishra/Blink_Main.git
cd Blink_Main
pip install -e ".[full]"

python scripts/ablation_study.py
python scripts/generate_paper_figures.py

Outputs will be saved to the results/ directory.

📄 License

MIT License — see LICENSE for details. Made by Aniket Mishra.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.0

Mar 27, 2026

0.1.7

Mar 25, 2026

This version

0.1.6

Mar 25, 2026

0.1.5

Mar 25, 2026

0.1.0

Mar 4, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

blink_gpu-0.1.6.tar.gz (329.0 kB view details)

Uploaded Mar 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

blink_gpu-0.1.6-py3-none-any.whl (345.7 kB view details)

Uploaded Mar 25, 2026 Python 3

File details

Details for the file blink_gpu-0.1.6.tar.gz.

File metadata

Download URL: blink_gpu-0.1.6.tar.gz
Upload date: Mar 25, 2026
Size: 329.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for blink_gpu-0.1.6.tar.gz
Algorithm	Hash digest
SHA256	`7bfcfdfcd700d7774ee84eca61cefb22c8463d96b9fef923325cfb45f730d52e`
MD5	`ea705106b36576a270d742559805c5e0`
BLAKE2b-256	`ade9800d3f95fd33a332f1c43c6e548b4cd0895fbd480d73f0c0e946ac6989e3`

See more details on using hashes here.

File details

Details for the file blink_gpu-0.1.6-py3-none-any.whl.

File metadata

Download URL: blink_gpu-0.1.6-py3-none-any.whl
Upload date: Mar 25, 2026
Size: 345.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for blink_gpu-0.1.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7adeadd174922bd834c264efd82ad8142f5c126cafe7e5b28c3ae3542bb5b384`
MD5	`3d62eebe738493f76ead50991638eea3`
BLAKE2b-256	`74e189e2b4c28728576e3ef67706da45da9707ce83650fa6d9fc34214cffcafe`

See more details on using hashes here.

blink-gpu 0.1.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Blink 🔭

⚡ Quick Start

Installation

Python Usage

💻 Command Line Interface (CLI)

📊 Streamlit Dashboard & Explainability

🌐 REST API & Docker Deployment

Docker Compose (Recommended)

REST API Example

🧠 How it Works (Architecture)

🔬 Development & Paper Reproducibility

📄 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes