Professional Multimodal AI Engine for Onyx platform
Project description
💎 ONYX AI Gemma 4 Engine (E2B Edition)
A high-performance, easy-to-use FastAPI wrapper for Gemma Multimodal models with built-in 4-bit quantization and streaming support. Developed by ONYX.
🚀 Features
- Zero Config Integration: Deploy a multimodal AI server in seconds.
- Optimized Performance: Native 4-bit quantization using
bitsandbytesfor low VRAM/RAM usage. - Real-time Streaming: Built-in SSE (Server-Sent Events) for smooth, token-by-token generation.
- Hardware Friendly: Optimized for both GPU and high-performance CPU inference.
📦 Installation
You can install the engine directly via pip:
pip install onyx_AI_Gemma4
💻 Python Usage Guide
To use the library in your Python project, simply import the engine and run it. The engine handles model loading, quantization, and server routing automatically.
Python
from onyx_AI_Gemma4 import OnyxEngine
# 1. Initialize the Engine
# It will automatically download and quantize the model on the first run.
engine = OnyxEngine(model_id="google/gemma-4-E2B-it")
# 2. Start the FastAPI Server
# The server will be available at http://localhost:7860
if __name__ == "__main__":
print("🚀 Starting ONYX Engine...")
engine.run(host="0.0.0.0", port=7860)
🛠 API Interaction
Once the server is running, you can interact with it using any HTTP client (like Postman or cURL):
Endpoint: POST /predict
Example Request:
JSON
{
"messages": [
{
"role": "user",
"content": "Explain the importance of AI in modern software engineering."
}
],
"temperature": 0.7,
"max_tokens": 1024
}
🔗 Project Links
Organization: ONYX
Portfolio: Eng. Rawan Jassim
LinkedIn: Professional Profile
© 2026 ONYX. All rights reserved.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
onyx_ai_gemma4-0.1.3.tar.gz
(3.6 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file onyx_ai_gemma4-0.1.3.tar.gz.
File metadata
- Download URL: onyx_ai_gemma4-0.1.3.tar.gz
- Upload date:
- Size: 3.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1f91666ea4f98e84a3d3ccddfc5a9d472c0e5d8efa3a123eb7476fbd8182ccc0
|
|
| MD5 |
a9cca29ce8c208b23d6d7dd9b1b6f681
|
|
| BLAKE2b-256 |
035d7abfe589673c0d0d4d5d78c7c0319a7eee8d825d0a22a9ef20e92a4f6e7d
|
File details
Details for the file onyx_ai_gemma4-0.1.3-py3-none-any.whl.
File metadata
- Download URL: onyx_ai_gemma4-0.1.3-py3-none-any.whl
- Upload date:
- Size: 4.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1d5f2c322aa025d45ebe4c683f7f685c81b5fa6f216320b8a2a43d6ca5538e7c
|
|
| MD5 |
ef47d048f49a80016c1625219a67f147
|
|
| BLAKE2b-256 |
3e65e76c78a0a6ac2bd7a68b9940fd1b7914095e08504fec568f487f3e4dd7f7
|