Skip to main content

SOTA Omni-Modal Personal AI Orchestrator & Engine

Project description

📦 Xorfice: The SOTA Omni-Modal Orchestration Engine

xorfice is the official, high-performance Python package for Xoron-Dev. It is more than just a model wrapper—it is a complete inference and agentic orchestration layer designed for the next era of multimodal AI.

🚀 Installation

Stable version from PyPI:

pip install xorfice

Development version from source:

git clone https://gitlab.com/joeycristini56/Xoron-Dev.git
pip install -e ./xorfice_pkg

🛠️ The SOTA Orchestrator: XoronEngine

The XoronEngine is a heavy-duty, production-ready inference manager that handles the entire lifecycle of the Xoron-Dev model.

🌟 Key Capabilities

  • Automatic Weights Management: Snapshot downloads from HuggingFace Hub with local caching.
  • Multimodal Routing: Single entry point for Text, Image, Video, and Audio.
  • Dynamic Optimization: Auto-tunes hardware affinity based on CUDA, VRAM, and NUMA node detection.

⚙️ Developer Usage

from xorfice import XoronEngine

# Initialize the engine
# Auto-detects hardware and optimizes for max performance
engine = XoronEngine(
    model_path="Backup-bdg/Xoron-Dev-MultiMoe",
    max_vram_experts=4, # Offload 4/8 experts to CPU
    device="cuda"
)

# Multimodal Generation with Streaming
# The engine natively handles URLs, local paths, and raw tensors
for token in engine.generate(
    prompt="Explain this video and analyze the speaker's tone.",
    videos="https://example.com/demo.mp4",
    audios="path/to/voice.wav",
    stream=True
):
    print(token, end="", flush=True)

🏗️ Native SOTA Optimizations

Xorfice implements elite-level performance features right out of the box:

1. MoE Expert Offloading

Our custom LRUExpertCache manages Mixture of Experts (MoE) layers dynamically. By keeping only the most frequently used experts in VRAM, we enable 5B+ parameter models to run smoothly on 8GB consumer GPUs.

2. Paged KV Cache

Inspired by vLLM, xorfice uses Paged Attention to manage Key-Value memory. This allows for massively increased throughput and support for thousands of tokens in long-chain reasoning.

3. Integrated Agentic Memory

Xorfice includes a FlatFileMemoryManager that persists user interactions across sessions, allowing Xoron-Dev to "learn" from conversations without full fine-tuning.

4. Zero-Shot Voice Cloning

Using our SOTA VoiceManager, you can clone voices instantly by uploading a short 5-second sample. No retraining required—pure latent-space adaptation.


🎨 Creative Capabilities

Xorfice exposes raw diffusion pipelines through the engine.generate_image() and engine.generate_video() methods, allowing for Text-to-Image (T2I), Image-to-Image (I2I), and Video-to-Video (V2V) workflows.


🤝 Open Source & Contributing

Interested in pushing the boundaries of SOTA AI? Check out our Architecture Deep Dive.

Xorfice: Powering the next generation of omni-modal agents.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xorfice-0.1.38.tar.gz (261.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xorfice-0.1.38-py3-none-any.whl (276.6 kB view details)

Uploaded Python 3

File details

Details for the file xorfice-0.1.38.tar.gz.

File metadata

  • Download URL: xorfice-0.1.38.tar.gz
  • Upload date:
  • Size: 261.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for xorfice-0.1.38.tar.gz
Algorithm Hash digest
SHA256 b5c00a5686f48c5ed68e2569968739193556ec4f92f2ad719e3bc23bf07642f0
MD5 b93b34914798f697e8fe18cc21762a02
BLAKE2b-256 036bbbf942e63fbc77bddf24995074b6b855cbc8ccca706f02ba09b70a9d5845

See more details on using hashes here.

File details

Details for the file xorfice-0.1.38-py3-none-any.whl.

File metadata

  • Download URL: xorfice-0.1.38-py3-none-any.whl
  • Upload date:
  • Size: 276.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for xorfice-0.1.38-py3-none-any.whl
Algorithm Hash digest
SHA256 40f677f136f68a76c48f878c838c466a10d2653ceb9adb821a9899f623b09f62
MD5 c3cffae8f2cd68be6bc7b6f12ea2c587
BLAKE2b-256 0501c60b4acb3cf7b7e74a1476930eb8f480a0886f1209d09820b0a87308db6c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page