Skip to main content

SOTA Omni-Modal Personal AI Orchestrator & Engine

Project description

📦 Xorfice: The SOTA Omni-Modal Orchestration Engine

xorfice is the official, high-performance Python package for Xoron-Dev. It is more than just a model wrapper—it is a complete inference and agentic orchestration layer designed for the next era of multimodal AI.

🚀 Installation

Stable version from PyPI:

pip install xorfice

Development version from source:

git clone https://gitlab.com/joeycristini56/Xoron-Dev.git
pip install -e ./xorfice_pkg

🛠️ The SOTA Orchestrator: XoronEngine

The XoronEngine is a heavy-duty, production-ready inference manager that handles the entire lifecycle of the Xoron-Dev model.

🌟 Key Capabilities

  • Automatic Weights Management: Snapshot downloads from HuggingFace Hub with local caching.
  • Multimodal Routing: Single entry point for Text, Image, Video, and Audio.
  • Dynamic Optimization: Auto-tunes hardware affinity based on CUDA, VRAM, and NUMA node detection.

⚙️ Developer Usage

from xorfice import XoronEngine

# Initialize the engine
# Auto-detects hardware and optimizes for max performance
engine = XoronEngine(
    model_path="xoron-dev/Xoron-7B",
    max_vram_experts=4, # Offload 4/8 experts to CPU
    device="cuda"
)

# Multimodal Generation with Streaming
# The engine natively handles URLs, local paths, and raw tensors
for token in engine.generate(
    prompt="Explain this video and analyze the speaker's tone.",
    videos="https://example.com/demo.mp4",
    audios="path/to/voice.wav",
    stream=True
):
    print(token, end="", flush=True)

🏗️ Native SOTA Optimizations

Xorfice implements elite-level performance features right out of the box:

1. MoE Expert Offloading

Our custom LRUExpertCache manages Mixture of Experts (MoE) layers dynamically. By keeping only the most frequently used experts in VRAM, we enable 7B+ parameter models to run smoothly on 8GB consumer GPUs.

2. Paged KV Cache

Inspired by vLLM, xorfice uses Paged Attention to manage Key-Value memory. This allows for massively increased throughput and support for thousands of tokens in long-chain reasoning.

3. Integrated Agentic Memory

Xorfice includes a FlatFileMemoryManager that persists user interactions across sessions, allowing Xoron-Dev to "learn" from conversations without full fine-tuning.

4. Zero-Shot Voice Cloning

Using our SOTA VoiceManager, you can clone voices instantly by uploading a short 5-second sample. No retraining required—pure latent-space adaptation.


🎨 Creative Capabilities

Xorfice exposes raw diffusion pipelines through the engine.generate_image() and engine.generate_video() methods, allowing for Text-to-Image (T2I), Image-to-Image (I2I), and Video-to-Video (V2V) workflows.


🤝 Open Source & Contributing

Interested in pushing the boundaries of SOTA AI? Check out our Architecture Deep Dive.

Xorfice: Powering the next generation of omni-modal agents.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xorfice-0.1.37.tar.gz (261.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xorfice-0.1.37-py3-none-any.whl (276.6 kB view details)

Uploaded Python 3

File details

Details for the file xorfice-0.1.37.tar.gz.

File metadata

  • Download URL: xorfice-0.1.37.tar.gz
  • Upload date:
  • Size: 261.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for xorfice-0.1.37.tar.gz
Algorithm Hash digest
SHA256 98c23bcddec6fc090149c793ed7d179512d478d823c34b638e7d28001845b21f
MD5 0882856fe714e3615ccd478fceb83745
BLAKE2b-256 6e49e60f9d5886958a8a4030ae1cc8a113c70e233693ac7aa13d4704d2e7bb35

See more details on using hashes here.

File details

Details for the file xorfice-0.1.37-py3-none-any.whl.

File metadata

  • Download URL: xorfice-0.1.37-py3-none-any.whl
  • Upload date:
  • Size: 276.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for xorfice-0.1.37-py3-none-any.whl
Algorithm Hash digest
SHA256 41627e5b4339b8a436cfcff9dc0a32146412c185ceafcb4264c005e671dee941
MD5 cbfd0a3c53f14ff0004b08fc2c59c5cf
BLAKE2b-256 2028ff608344bad1862e104782bbf4687080780e15a9790a7e26dd239c732b46

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page