Skip to main content

SOTA Omni-Modal Personal AI Orchestrator & Engine

Project description

📦 Xorfice: The SOTA Omni-Modal Orchestration Engine

xorfice is the official, high-performance Python package for Xoron-Dev. It is more than just a model wrapper—it is a complete inference and agentic orchestration layer designed for the next era of multimodal AI.

🚀 Installation

Stable version from PyPI:

pip install xorfice

Development version from source:

git clone https://gitlab.com/joeycristini56/Xoron-Dev.git
pip install -e ./xorfice_pkg

🛠️ The SOTA Orchestrator: XoronEngine

The XoronEngine is a heavy-duty, production-ready inference manager that handles the entire lifecycle of the Xoron-Dev model.

🌟 Key Capabilities

  • Automatic Weights Management: Snapshot downloads from HuggingFace Hub with local caching.
  • Multimodal Routing: Single entry point for Text, Image, Video, and Audio.
  • Dynamic Optimization: Auto-tunes hardware affinity based on CUDA, VRAM, and NUMA node detection.

⚙️ Developer Usage

from xorfice import XoronEngine

# Initialize the engine
# Auto-detects hardware and optimizes for max performance
# Correct model slug: Backup-bdg/Xoron-Dev-MultiMoe
engine = XoronEngine(
    model_path="Backup-bdg/Xoron-Dev-MultiMoe",
    max_vram_experts=4, # Offload 4/8 experts to CPU
    device="cuda"
)

# Multimodal Generation with Streaming
# The engine natively handles URLs, local paths, and raw tensors
for token in engine.generate(
    prompt="Explain this video and analyze the speaker's tone.",
    videos="https://example.com/demo.mp4",
    audios="path/to/voice.wav",
    stream=True
):
    print(token, end="", flush=True)

🏗️ Native SOTA Optimizations

Xorfice implements elite-level performance features right out of the box:

1. MoE Expert Offloading

Our custom LRUExpertCache manages Mixture of Experts (MoE) layers dynamically. By keeping only the most frequently used experts in VRAM, we enable 5B+ parameter models to run smoothly on 8GB consumer GPUs.

2. Paged KV Cache

Inspired by vLLM, xorfice uses Paged Attention to manage Key-Value memory. This allows for massively increased throughput and support for thousands of tokens in long-chain reasoning.

3. Integrated Agentic Memory

Xorfice includes a FlatFileMemoryManager that persists user interactions across sessions, allowing Xoron-Dev to "learn" from conversations without full fine-tuning.

4. Zero-Shot Voice Cloning

Using our SOTA VoiceManager, you can clone voices instantly by uploading a short 5-second sample. No retraining required—pure latent-space adaptation.


🎨 Creative Capabilities

Xorfice exposes raw diffusion pipelines through the engine.generate_image() and engine.generate_video() methods, allowing for Text-to-Video (T2V), Image-to-Video (I2V), Text-to-Image (T2I), Image-to-Image (I2I), and Video-to-Video (V2V) workflows.


🤝 Open Source & Contributing

Interested in pushing the boundaries of SOTA AI? Check out our Architecture Deep Dive.

Xorfice: Powering the next generation of omni-modal agents.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xorfice-0.1.39.tar.gz (261.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xorfice-0.1.39-py3-none-any.whl (276.6 kB view details)

Uploaded Python 3

File details

Details for the file xorfice-0.1.39.tar.gz.

File metadata

  • Download URL: xorfice-0.1.39.tar.gz
  • Upload date:
  • Size: 261.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for xorfice-0.1.39.tar.gz
Algorithm Hash digest
SHA256 fe9338b235b27dfb1d71123c31fc5b20a39f6d4fc677e1cc96c1a04254b942f0
MD5 625fde7da394049ae64a28f08fc1d38a
BLAKE2b-256 296d5efa8fc140812f5690bd39a556e5c569c5beb598080b3e2b3bc00e9473e8

See more details on using hashes here.

File details

Details for the file xorfice-0.1.39-py3-none-any.whl.

File metadata

  • Download URL: xorfice-0.1.39-py3-none-any.whl
  • Upload date:
  • Size: 276.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for xorfice-0.1.39-py3-none-any.whl
Algorithm Hash digest
SHA256 14101f1db968b62adf9e7e13b6db4b40d68d162042e157aaa0ffa052f9740f7d
MD5 286386c1f03bfc91b327c0e51701775e
BLAKE2b-256 ca19702e0264a3a2258ee826d13ff216ef230589937444c77cbafade742af8ad

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page