Framework-agnostic Python wrapper for OpenVINO 2025

These details have not been verified by PyPI

Project links

Project description

OpenVINO-Easy 🚀

Framework-agnostic Python wrapper for OpenVINO 2025

Load and run AI models with three functions:

import oe

oe.load("runwayml/stable-diffusion-v1-5")   # auto-download & convert
img = oe.infer("a neon cyber-city at night")       # chooses NPU>GPU>CPU  
stats = oe.benchmark()                              # JSON perf report

🎯 Installation

Pick the variant that matches your hardware:

# CPU-only (40MB wheel, fastest install)
pip install "openvino-easy[cpu]"
# or
pip install "openvino-easy[runtime]"

# Intel® Arc/Xe GPU support
pip install "openvino-easy[gpu]"

# Intel® NPU support (Arrow Lake/Lunar Lake with FP16-NF4)
pip install "openvino-easy[npu]"

# With INT8 quantization support
pip install "openvino-easy[quant]"

# Audio model support (Whisper, TTS)
pip install "openvino-easy[audio]"

# Full development environment (OpenVINO, NNCF, optimum ~1GB)
pip install "openvino-easy[full]"

# Everything (for development)
pip install "openvino-easy[all]"

🩺 Installation Troubleshooting

Something not working? Run the doctor:

# Comprehensive diagnostics
oe doctor

# Get fix suggestions for specific hardware
oe doctor --fix gpu
oe doctor --fix npu

# JSON output for CI systems
oe doctor --json

# Check device status
oe devices

Common issues:

Problem	Solution
`ImportError: OpenVINO runtime not found`	Install with hardware extras: `pip install "openvino-easy[cpu]"`
NPU detected but not functional	Install Intel NPU drivers from intel.com
GPU detected but not functional	Install Intel GPU drivers (`intel-opencl-icd` on Linux)
`NNCF not available` for INT8 quantization	Install quantization support: `pip install "openvino-easy[quant]"`
FP16-NF4 not supported	Requires Arrow Lake/Lunar Lake NPU with OpenVINO 2025.2+
Version warnings	Upgrade OpenVINO: `pip install --upgrade "openvino>=2025.2,<2026.0"`
PyTorch model (.pt/.pth) not loading	Convert to ONNX first: `torch.onnx.export(model, dummy_input, "model.onnx")` then `oe.load("model.onnx")`
"Native PyTorch model conversion failed"	Upload to Hugging Face Hub with config.json or use ONNX format for best compatibility

📦 What Each Variant Includes

Variant	OpenVINO Package	Size	Best For
`[cpu]` / `[runtime]`	`openvino` runtime	~40MB	Production deployments, CPU-only inference
`[gpu]`	`openvino` runtime	~40MB	Intel GPU acceleration
`[npu]`	`openvino` runtime	~40MB	Intel NPU acceleration
`[quant]`	`openvino` + NNCF	~440MB	INT8 quantization support
`[audio]`	`openvino` + librosa	~100MB	Audio models (Whisper, TTS)
`[full]`	`openvino` + NNCF + optimum	~1GB	Development, model optimization, research

⚡ Quick Start

Basic Usage

import oe

# Load any model (Hugging Face, ONNX, or OpenVINO IR)
oe.load("microsoft/DialoGPT-medium")

# Run inference (automatic tokenization for text models)
response = oe.infer("Hello, how are you?")
print(response)  # "I'm doing well, thank you for asking!"

# Benchmark performance
stats = oe.benchmark()
print(f"Average latency: {stats['avg_latency_ms']:.2f}ms")
print(f"Throughput: {stats['throughput_fps']:.1f} FPS")

# Explicitly free memory when done
oe.unload()

Advanced Usage

# Specify device preference and precision
oe.load(
    "runwayml/stable-diffusion-v1-5",
    device_preference=["NPU", "GPU", "CPU"],  # Try NPU first, fallback to GPU, then CPU
    dtype="fp16-nf4"  # New FP16-NF4 precision for Arrow Lake/Lunar Lake NPUs
)

# Generate image
image = oe.infer(
    "a serene mountain landscape at sunset",
    num_inference_steps=20,
    guidance_scale=7.5
)

# Get detailed model info
info = oe.get_info()
print(f"Running on: {info['device']}")
print(f"Model type: {info['dtype']}")
print(f"Quantized: {info['quantized']}")

# Context manager for automatic cleanup
with oe.load("runwayml/stable-diffusion-v1-5") as pipe:
    image = pipe.infer("a serene mountain landscape")
    # Model automatically unloaded when exiting context

Audio Models

# Speech-to-text with Whisper
oe.load("openai/whisper-base")
transcription = oe.infer("path/to/audio.wav")
print(transcription)  # "Hello, this is the transcribed audio"

# Text-to-speech (OpenVINO 2025.2+)
oe.load("microsoft/speecht5_tts")
audio = oe.infer("Hello world!")
# Save or play the generated audio

Memory Management

OpenVINO-Easy provides flexible memory management for production applications:

# Method 1: Explicit unload
oe.load("large-model")
result = oe.infer(data)
oe.unload()  # Free memory immediately

# Method 2: Context manager (recommended)
with oe.load("large-model") as pipe:
    result = pipe.infer(data)
    # Model automatically unloaded when exiting

# Method 3: Multiple model switching  
oe.load("text-model")
result1 = oe.infer("Hello world")
oe.unload()

oe.load("image-model")
result2 = oe.infer(image_data)
oe.unload()

# Check if model is still loaded
if oe.is_loaded():
    result = oe.infer(data)
else:
    print("Model has been unloaded")

Model Management & Discovery

OpenVINO-Easy provides comprehensive model management capabilities:

# Search for models on Hugging Face Hub
results = oe.models.search("stable diffusion", limit=5, model_type="image")
for model in results:
    print(f"{model['id']}: {model['downloads']:,} downloads")

# Get detailed model information
info = oe.models.info("microsoft/DialoGPT-medium")
print(f"Local: {info['local']}, Remote: {info['remote']}")
print(f"Requirements: {info['requirements']['min_memory_mb']} MB")

# Install models without loading them
result = oe.models.install("runwayml/stable-diffusion-v1-5", dtype="fp16")
print(f"Installed: {result['size_mb']:.1f} MB")

# Validate model integrity
results = oe.models.validate()
print(f"Validation: {results['passed']}/{results['validated']} models valid")

# Benchmark all installed models
results = oe.models.benchmark_all()
best = results['summary']['fastest_model']
print(f"Fastest model: {best['id']} ({best['fps']:.1f} FPS)")

Model Storage & Cache Management

OpenVINO-Easy uses a clean, Ollama-style directory structure:

# Check where models are stored
print("Models directory:", oe.models.dir())
# Windows: C:\Users\username\AppData\Local\openvino-easy\models\
# Linux/Mac: ~/.openvino-easy/models/

# List all cached models
models_list = oe.models.list()
for model in models_list:
    print(f"{model['name']}: {model['size_mb']:.1f} MB")

# Check cache usage
cache_info = oe.cache.size()
print(f"Total cache size: {cache_info['total_size_mb']:.1f} MB")
print(f"Models: {cache_info['model_count']}")

# Clean up temporary files only (keeps models)
oe.cache.clear()

# Remove a specific model (exact name required for safety)
result = oe.models.remove("microsoft--DialoGPT-medium--fp16--a1b2c3d4")
print(result)  # Shows what was removed

# Clear everything including models (requires confirmation)
result = oe.models.clear()  # Shows safety warning, requires confirm=False
result = oe.models.clear(confirm=False)  # Actually performs deletion

# Clear temp cache only (safe)
oe.cache.clear()

# Clear both temp cache and models (dangerous, requires confirmation)
oe.cache.clear(models=True)  # Shows safety warning
oe.cache.clear(models=True, confirm=False)  # Actually performs deletion

Directory Structure:

~/.openvino-easy/                    # Linux/Mac
C:\Users\user\AppData\Local\openvino-easy\  # Windows
├── models/                          # Downloaded/converted models (permanent)
│   ├── microsoft--DialoGPT-medium--fp16--a1b2c3d4/
│   └── openai--whisper-base--int8--e5f6g7h8/
├── cache/                           # Temporary conversion files
└── config/                          # User settings

Environment Override:

# Custom models directory
export OE_MODELS_DIR="/shared/ai-models"
# or
OE_MODELS_DIR="/shared/ai-models" python app.py

🔧 Command Line Interface

# Text inference
oe run "microsoft/DialoGPT-medium" --prompt "Hello there"

# Audio inference (speech-to-text)
oe run "openai/whisper-base" --input-file "audio.wav"

# Image generation
oe run "runwayml/stable-diffusion-v1-5" --prompt "a beautiful sunset"

# Benchmark with latest NPU precision
oe bench "runwayml/stable-diffusion-v1-5" --dtype fp16-nf4

# System diagnostics
oe doctor

# List available devices
oe devices

# Enhanced NPU diagnostics (Arrow Lake/Lunar Lake detection)
oe npu-doctor

# Cache management
oe cache list              # List cached models
oe cache size              # Show cache usage
oe cache remove <model>    # Remove specific model (with confirmation)
oe cache clear             # Clear temp cache only (safe)
oe cache clear --models    # Clear all models (DANGEROUS - requires confirmation)
oe cache clear --models --force  # Override safety (VERY DANGEROUS)

# Advanced model management
oe models search "stable diffusion" --limit 5  # Search HuggingFace Hub
oe models info microsoft/DialoGPT-medium       # Get model details
oe models install runwayml/stable-diffusion-v1-5 --dtype fp16  # Install model
oe models validate         # Validate all models
oe models benchmark        # Benchmark all installed models

🏗️ Architecture

OpenVINO-Easy wraps OpenVINO's API:

┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│   Your Code     │    │  OpenVINO-Easy   │    │   OpenVINO      │
│                 │    │                  │    │                 │
│ oe.load(...)    │───▶│ • Model Loading  │───▶│ • IR Conversion │
│ oe.infer(...)   │    │ • Device Select  │    │ • Compilation   │
│ oe.benchmark()  │    │ • Preprocessing  │    │ • Inference     │
└─────────────────┘    └──────────────────┘    └─────────────────┘

Key Features

Device Selection: Chooses NPU → GPU → CPU based on availability
Model Loading: Supports Hugging Face, ONNX, and OpenVINO IR formats
Conversion: Converts models to OpenVINO IR format
INT8 Quantization: Quantization with NNCF for faster inference
Benchmarking: Performance metrics and timing
Caching: SHA-256 based model caching for fast re-loading
Memory Management: Explicit unload() and context manager support
Hardware Diagnostics: Tools for troubleshooting device issues

🤖 Supported Models

Text Models

Conversational: DialoGPT, BlenderBot, ChatGLM
Text Generation: GPT-2, GPT-J, OPT, BLOOM
Question Answering: BERT, RoBERTa, DeBERTa
Text Classification: DistilBERT, ALBERT

Vision Models

Image Generation: Stable Diffusion, DALL-E 2
Object Detection: YOLO, SSD, RetinaNet
Image Classification: ResNet, EfficientNet, Vision Transformer
Segmentation: U-Net, DeepLab, Mask R-CNN

Audio Models

Speech Recognition: Whisper, Wav2Vec2, WavLM
Text-to-Speech: SpeechT5, Bark (coming soon)
Audio Classification: Hubert, Audio Transformers

Multimodal Models

Vision-Language: CLIP, BLIP, LLaVA
Image Captioning: BLIP-2, GIT, OFA

🚀 Performance

Performance benchmarks:

Model	Hardware	Throughput	Latency
Stable Diffusion 1.5	Intel Core Ultra 7 Lunar Lake (NPU)	2.3+ img/s	420ms
Stable Diffusion 1.5	Intel Core Ultra 7 Arrow Lake (NPU)	2.2+ img/s	450ms
Stable Diffusion 1.5	Intel Core Ultra 7 (1st gen NPU)	1.8 img/s	556ms
Stable Diffusion 1.5	Intel Arc A770 (GPU)	1.6 img/s	625ms
Stable Diffusion 1.5	Intel Core i7-13700K (CPU)	0.4 img/s	2.5s
DialoGPT-medium	Intel Core Ultra 7 Lunar Lake (NPU)	50+ tok/s	20ms
DialoGPT-medium	Intel Core Ultra 7 Arrow Lake (NPU)	48+ tok/s	21ms
DialoGPT-medium	Intel Core Ultra 7 (1st gen NPU)	40 tok/s	25ms
DialoGPT-medium	Intel Arc A770 (GPU)	38 tok/s	26ms
DialoGPT-medium	Intel Core i7-13700K (CPU)	12 tok/s	83ms

Benchmarks with FP16-NF4 precision on Arrow Lake/Lunar Lake NPUs (OpenVINO 2025.2+)

🔬 Text Processing Details

OpenVINO-Easy handles text preprocessing automatically:

# For text models, tokenization is automatic
pipe = oe.load("microsoft/DialoGPT-medium")

# Multiple input formats supported:
response = pipe.infer("Hello!")                    # String input
response = pipe.infer(["Hello!", "How are you?"])  # Batch input
response = pipe.infer({"text": "Hello!"})          # Dict input

Tokenization Strategy:

HuggingFace Models: Uses transformers.AutoTokenizer with model-specific settings
ONNX Models: Attempts to infer tokenizer from model metadata
OpenVINO IR: Falls back to basic text preprocessing
Custom Models: Provides hooks for custom tokenization

🧪 Development & Testing

Modern Python Packaging (Recommended)

# Install in editable mode with development dependencies
pip install -e ".[dev]"

# Or install specific extras for testing
pip install -e ".[full,dev]"  # Full OpenVINO + dev tools

Comprehensive Testing Framework

OpenVINO-Easy includes a robust testing framework with multiple test categories:

# Quick tests (unit tests only, fast)
python test_runner.py --mode fast

# All tests except slow ones
python test_runner.py --mode full

# Integration tests with real OpenVINO models
python test_runner.py --mode integration

# End-to-end tests with real HuggingFace models (requires internet)
python test_runner.py --mode e2e

# Performance regression testing
python test_runner.py --mode performance

# Model compatibility validation
pytest tests/test_model_compatibility.py -v

# Cache management and safety tests
pytest tests/test_model_management.py -v

# CLI functionality tests
pytest tests/test_cli_models.py -v

# Run with coverage
python test_runner.py --mode coverage

Quality Assurance Features

Performance Regression Testing

# Automated performance baselines
from tests.test_performance_regression_enhanced import PerformanceRegression

tester = PerformanceRegression()
test = PerformanceTest(
    model_id="microsoft/DialoGPT-medium",
    tolerance_percent=15.0  # Allow 15% regression
)

results = tester.run_performance_test(test)
if results['regressions']:
    print("Performance regressions detected!")

Model Compatibility Validation

# Automated compatibility testing across devices/precisions
from tests.test_model_compatibility import ModelCompatibilityValidator

validator = ModelCompatibilityValidator()
result = validator.validate_model_compatibility("runwayml/stable-diffusion-v1-5")

if not result['overall_compatible']:
    print(f"Compatibility issues: {result['issues']}")

Enhanced Error Recovery

# Automatic device fallback and retry logic
oe.load(
    "microsoft/DialoGPT-medium",
    device_preference=["NPU", "GPU", "CPU"],
    retry_on_failure=True,
    fallback_device="CPU"
)
# Automatically tries NPU -> GPU -> CPU -> CPU with default config

Test Categories

Test Type	Command	Purpose
Unit Tests	`pytest tests/ -m "not slow and not integration"`	Core functionality
Integration Tests	`pytest tests/ -m "integration"`	Real model loading
Performance Tests	`pytest tests/ -m "performance"`	Regression detection
Compatibility Tests	`pytest tests/ -m "compatibility"`	Device/model validation
End-to-End Tests	`pytest tests/test_e2e_real_models.py`	Full workflows
CLI Tests	`pytest tests/test_cli*.py`	Command-line interface
Safety Tests	`pytest tests/test_model_management.py`	Security validation

Development Workflow

# Format code
black oe/ tests/
isort oe/ tests/

# Type checking
mypy oe/

# Run all quality checks
python test_runner.py --mode full
pytest tests/test_model_compatibility.py -x
pytest tests/test_performance_regression_enhanced.py -x

📚 Examples

Check out the examples/ directory:

Stable Diffusion Notebook: Image generation with automatic optimization
Text Generation: Conversational AI with DialoGPT
ONNX Models: Loading and running ONNX models
Custom Models: Integrating your own models

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

📄 License

Apache License 2.0 - see LICENSE for details.

🙏 Acknowledgments

Intel OpenVINO Team for the inference engine
Hugging Face for the transformers ecosystem
ONNX Community for the model format standards

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.0

Aug 21, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openvino_easy-1.0.0.tar.gz (178.1 kB view details)

Uploaded Aug 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

openvino_easy-1.0.0-py3-none-any.whl (63.9 kB view details)

Uploaded Aug 21, 2025 Python 3

File details

Details for the file openvino_easy-1.0.0.tar.gz.

File metadata

Download URL: openvino_easy-1.0.0.tar.gz
Upload date: Aug 21, 2025
Size: 178.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for openvino_easy-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`6a02fc4b80521d056ff36c9926e6e78ca352e8d6429976f8a0cafbdd7e348453`
MD5	`0181c0061e343d83abb08951da034b11`
BLAKE2b-256	`60b01fc25018203a3d04a014b27140de7024dc8cbbc935d322ccc095863ed5b0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for openvino_easy-1.0.0.tar.gz:

Publisher: publish.yml on openvino-easy/openvino-easy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: openvino_easy-1.0.0.tar.gz
- Subject digest: 6a02fc4b80521d056ff36c9926e6e78ca352e8d6429976f8a0cafbdd7e348453
- Sigstore transparency entry: 415384740
- Sigstore integration time: Aug 21, 2025
Source repository:
- Permalink: openvino-easy/openvino-easy@adc14ad08b3d1b847e1cd1069144057eca923e75
- Branch / Tag: refs/tags/v0.1.0a1
- Owner: https://github.com/openvino-easy
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@adc14ad08b3d1b847e1cd1069144057eca923e75
- Trigger Event: push

File details

Details for the file openvino_easy-1.0.0-py3-none-any.whl.

File metadata

Download URL: openvino_easy-1.0.0-py3-none-any.whl
Upload date: Aug 21, 2025
Size: 63.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for openvino_easy-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8bf6bb5713c62ccd1f8d4c28936afa76748227b97f17a25a41523de38999895a`
MD5	`19fc305a898c0b90576c74284036ae3c`
BLAKE2b-256	`9900c8b5b3dbddafaec4061f2738cf8331acc87b43b9506a4ecca6ec341fc16c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for openvino_easy-1.0.0-py3-none-any.whl:

Publisher: publish.yml on openvino-easy/openvino-easy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: openvino_easy-1.0.0-py3-none-any.whl
- Subject digest: 8bf6bb5713c62ccd1f8d4c28936afa76748227b97f17a25a41523de38999895a
- Sigstore transparency entry: 415384763
- Sigstore integration time: Aug 21, 2025
Source repository:
- Permalink: openvino-easy/openvino-easy@adc14ad08b3d1b847e1cd1069144057eca923e75
- Branch / Tag: refs/tags/v0.1.0a1
- Owner: https://github.com/openvino-easy
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@adc14ad08b3d1b847e1cd1069144057eca923e75
- Trigger Event: push

openvino-easy 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

OpenVINO-Easy 🚀

🎯 Installation

🩺 Installation Troubleshooting

📦 What Each Variant Includes

⚡ Quick Start

Basic Usage

Advanced Usage

Audio Models

Memory Management

Model Management & Discovery

Model Storage & Cache Management

🔧 Command Line Interface

🏗️ Architecture

Key Features

🤖 Supported Models

Text Models

Vision Models

Audio Models

Multimodal Models

🚀 Performance

🔬 Text Processing Details

🧪 Development & Testing

Modern Python Packaging (Recommended)

Comprehensive Testing Framework

Quality Assurance Features

Performance Regression Testing

Model Compatibility Validation

Enhanced Error Recovery

Test Categories

Development Workflow

📚 Examples

🤝 Contributing

📄 License

🙏 Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance