SOTA Multimodal Inference Engine (S2S, I2I, V2V) for Xoron-Dev.

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

🚀 Xoron-Dev: Unified Multimodal AI Model

Xoron-Dev Logo Version License Python PyTorch

A state-of-the-art multimodal MoE model that unifies text, image, video, and audio understanding and generation.

🏗️ Architecture Overview

Xoron-Dev is built on a modular, mixture-of-experts architecture designed for maximum flexibility and performance.

🧠 LLM Backbone (Mixture of Experts)

12 Layers, 1024d, 16 Heads - Optimized for efficient inference and training.
Aux-Lossless MoE - 8 experts with top-2 routing and configurable shared expert isolation.
Ring Attention - Memory-efficient processing for up to 128K context.
Qwen2.5 Tokenizer - High-density 151K vocabulary for multilingual and code support.

👁️ Vision & Video

SigLIP-2 Encoder - 384px native resolution with multi-scale support (128-512px).
TiTok 1D Tokenization - Compressed visual representation (256 tokens) for faster processing.
VidTok 3D VAE - Efficient spatiotemporal video encoding with 4x8x8 compression.
3D-RoPE & Temporal MoE - Sophisticated motion pattern recognition and spatial awareness.

🎤 Audio System

Raw Waveform Processing - Direct 16kHz audio input/output (no Mel spectrograms required).
Conformer + RMLA - Advanced speech-to-text with KV compression.
BigVGAN Waveform Decoder - High-fidelity direct waveform generation with Snake activation.
Zero-Shot Voice Cloning - Clone voices from short reference clips using speaker embeddings.

🌟 Features

Multimodal Capabilities

Modality	Input	Output	Strategy
Text	128K Context	Reasoning, Code, Agentic	MoE LLM
Image	128-512px	Understanding & SFT	SigLIP + TiTok
Video	8-24 Frames	Understanding	VidTok + 3D-RoPE
Audio	16kHz Waveform	ASR & TTS	Conformer + BigVGAN

Agentic & Tool Calling

250+ Special Tokens for structured agent behaviors.
Native Tool Use: Execute shell commands, Python scripts, and Jupyter notebooks.
Reasoning: Advanced Chain-of-Thought (<|think|>, <|plan|>) for complex tasks.
Safety: Anti-hallucination tokens (<|uncertain|>, <|cite|>) and confidence scores.

Optimization

LoRA Variants: LoRA+, rsLoRA, and DoRA (r=32, α=64).
Lookahead Optimizer: Enhanced stability and faster convergence.
8-bit Optimization: Save up to 75% optimizer memory with bitsandbytes.
Continuous-Scale Training: Adaptive resolution sampling for optimal VRAM usage.

🚀 Installation

# Clone the repository
git clone https://github.com/nigfuapp-web/Xoron-Dev.git
cd Xoron-Dev

# Install dependencies
pip install -r requirements.txt

💻 Usage

Quick Start (Inference)

from load import load_xoron_model

# Load model and tokenizer
model, tokenizer, device, config = load_xoron_model("Backup-bdg/Xoron-Dev-MultiMoe")

# Generate response
output = model.generate_text("Explain quantum entanglement.", tokenizer)
print(output)

CLI Training

The build.py script provides a powerful interface for training and building models.

# Build a new model from scratch
python build.py --build

# Targeted Fine-tuning
python build.py --hf --text --math        # Fine-tune on Math
python build.py --hf --text --agent       # Fine-tune on Agentic tasks
python build.py --hf --video              # Fine-tune on Video datasets
python build.py --hf --voice              # Fine-tune on Audio/Voice

Granular Text Training Flags

Flag	Description
`--math`	Focus on mathematical reasoning and steps.
`--agent`	Tool use, code execution, and system operations.
`--software`	High-quality software engineering and coding.
`--cot`	Chain-of-Thought and logical reasoning.
`--medical`	Medical knowledge and clinical reasoning.
`--hallucination`	Anti-hallucination and truthfulness.

🏋️ Training

Weighted Loss Strategy

The trainer applies specialized weights to ensure high performance on critical tokens:

Reasoning (CoT): 1.5x
Tool Calling: 1.3x
Anti-Hallucination: 1.2x

Continuous-Scale Strategy

Xoron-Dev dynamically samples resolutions during training:

Image: 128px to 384px (step=32)
Video: 8 to 24 frames, 128px to 320px

📦 Export & Quantization

Export your models for efficient deployment:

# Export to GGUF (for llama.cpp)
python build.py --hf --gguf --gguf-quant q4_k_m

# Export to ONNX
python build.py --hf --onnx --quant-bits 4

🤝 Contributing

Contributions are welcome! If you have ideas for new modalities or optimizations, please open an issue or PR.

📄 License

This project is licensed under the MIT License.

Built with ❤️ by the Xoron-Dev Team

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.1.51

Mar 4, 2026

0.1.50

Mar 4, 2026

0.1.49

Mar 4, 2026

0.1.48

Mar 4, 2026

0.1.47

Mar 4, 2026

0.1.46

Mar 4, 2026

0.1.45

Mar 4, 2026

0.1.44

Mar 4, 2026

0.1.43

Mar 4, 2026

0.1.42

Mar 3, 2026

0.1.41

Mar 3, 2026

0.1.40

Mar 3, 2026

0.1.39

Mar 3, 2026

0.1.38

Mar 3, 2026

0.1.37

Mar 3, 2026

0.1.36

Mar 3, 2026

0.1.35

Mar 3, 2026

0.1.34

Mar 3, 2026

0.1.33

Mar 3, 2026

0.1.32

Mar 3, 2026

0.1.31

Mar 3, 2026

0.1.30

Mar 3, 2026

0.1.29

Mar 3, 2026

0.1.28

Mar 3, 2026

0.1.27

Mar 3, 2026

0.1.26

Mar 3, 2026

0.1.25

Mar 3, 2026

0.1.24

Mar 3, 2026

0.1.23

Mar 3, 2026

0.1.22

Mar 2, 2026

0.1.21

Feb 26, 2026

0.1.20

Feb 26, 2026

This version

0.1.19

Feb 26, 2026

0.1.18

Feb 26, 2026

0.1.17

Feb 26, 2026

0.1.16

Feb 26, 2026

0.1.15

Feb 21, 2026

0.1.14

Feb 21, 2026

0.1.13

Feb 21, 2026

0.1.12

Feb 21, 2026

0.1.11

Feb 21, 2026

0.1.10

Feb 21, 2026

0.1.9

Feb 21, 2026

0.1.8

Feb 21, 2026

0.1.7

Feb 21, 2026

0.1.6

Feb 21, 2026

0.1.5

Feb 21, 2026

0.1.4

Feb 21, 2026

0.1.3

Feb 21, 2026

0.1.2

Feb 21, 2026

0.1.1

Feb 21, 2026

0.1.0

Feb 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xorfice-0.1.19.tar.gz (162.6 kB view details)

Uploaded Feb 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

xorfice-0.1.19-py3-none-any.whl (175.8 kB view details)

Uploaded Feb 26, 2026 Python 3

File details

Details for the file xorfice-0.1.19.tar.gz.

File metadata

Download URL: xorfice-0.1.19.tar.gz
Upload date: Feb 26, 2026
Size: 162.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for xorfice-0.1.19.tar.gz
Algorithm	Hash digest
SHA256	`16d52e1b96992c7ad9c34eaf10d66fb0dfcfd72791ee3882809332be03023614`
MD5	`82f95bea4afca800db9cc6ac5f355df3`
BLAKE2b-256	`7eba13e4afa7f1456ff623e8f663e0d6789bf2f5f4d4a4aff1c58c47ab2054b4`

See more details on using hashes here.

File details

Details for the file xorfice-0.1.19-py3-none-any.whl.

File metadata

Download URL: xorfice-0.1.19-py3-none-any.whl
Upload date: Feb 26, 2026
Size: 175.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for xorfice-0.1.19-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d1ae630b96744e05d51fc24802c8e494a9feceaa00b2c565453d30b779824625`
MD5	`db48a4cfcfb72b0d33b2d6a09fd9c01c`
BLAKE2b-256	`e383dc1f167e90ada3b1c43a592fad631d1a9488432e767d70563af39685a7ea`

See more details on using hashes here.

xorfice 0.1.19

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🚀 Xoron-Dev: Unified Multimodal AI Model

🏗️ Architecture Overview

🧠 LLM Backbone (Mixture of Experts)

👁️ Vision & Video

🎤 Audio System

🌟 Features

Multimodal Capabilities

Agentic & Tool Calling

Optimization

🚀 Installation

💻 Usage

Quick Start (Inference)

CLI Training

Granular Text Training Flags

🏋️ Training

Weighted Loss Strategy

Continuous-Scale Strategy

📦 Export & Quantization

🤝 Contributing

📄 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes