Professional Multimodal AI Engine for Onyx platform
Project description
💎 ONYX AI Gemma 4 Engine (E2B Edition)
A high-performance, professional FastAPI wrapper for Gemma Multimodal models with built-in 4-bit quantization and streaming support. Developed by ONYX (RUI Company).
🚀 Features
- Zero Config Integration: Deploy a multimodal AI server in seconds.
- Optimized Performance: Native 4-bit quantization using
bitsandbytesfor low VRAM/RAM usage. - Real-time Streaming: Built-in SSE (Server-Sent Events) for smooth, token-by-token generation.
- Hardware Friendly: Optimized for both GPU and high-performance CPU inference.
📦 Installation
Option 1: Via pip
You can install the engine directly from PyPI: 📦 Installation Option 1: Install via pip
pip install onyx-AI-Gemma4
Option 2: requirements.txt
fastapi
uvicorn
transformers>=4.48.0
torch
accelerate
bitsandbytes
Pillow
torchvision
onyx-AI-Gemma4
💻 Usage ▶ Standard Script
from ONYXAI_Gemma4E2B import OnyxEngine
# Initialize the engine
engine = OnyxEngine(model_id="google/gemma-4-E2B-it")
# Run the server
if __name__ == "__main__":
engine.run(host="0.0.0.0", port=7860)
🌐 Production
from ONYXAI_Gemma4E2B import OnyxEngine
import uvicorn
import os
engine = OnyxEngine(model_id="google/gemma-4-E2B-it")
app = engine.app
@app.get("/")
def home():
return {"message": "ONYX Engine is running!"}
if __name__ == "__main__":
port = int(os.environ.get("PORT", 7860))
uvicorn.run(app, host="0.0.0.0", port=port)
🛠 API Usage Endpoint
POST /predict
Example Request
{
"messages": [
{
"role": "user",
"content": "Explain the importance of AI in modern software engineering."
}
],
"temperature": 0.7,
"max_tokens": 1024
}
🔗 Links Organization: ONYX / RUI Company Author: Eng. Rawan Jassim
© 2026 ONYX. All rights reserved.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file onyx_ai_gemma4-0.1.10.tar.gz.
File metadata
- Download URL: onyx_ai_gemma4-0.1.10.tar.gz
- Upload date:
- Size: 3.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
81eb67786f26ba6738c9ac853d1c65a066410ea7c74163719be01e95e6f568a8
|
|
| MD5 |
849a1485c9d427784d4343637886963c
|
|
| BLAKE2b-256 |
a2c533a8879a7bd311d42fd01d460298b479e4963885a61ed8c14e82828d7ee2
|
File details
Details for the file onyx_ai_gemma4-0.1.10-py3-none-any.whl.
File metadata
- Download URL: onyx_ai_gemma4-0.1.10-py3-none-any.whl
- Upload date:
- Size: 4.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8fc87173b9c80752dd885238ea05cf9f8408fd465acf7c603464b7c2a250cdf5
|
|
| MD5 |
02f2255a31006e1696e4fe92fffb4cf8
|
|
| BLAKE2b-256 |
a29de483d3546a6a0a2cc04e167cc2a508227b41fedc6d04b1c2eddd95c14c29
|