4 projects
rhoshift
RHOAI tool kit for managing and upgrading RHOAI
rlm-engine
Recursive Language Model - Process unlimited context with any LLM
attention-echo
AttentionEcho: Cross-request attention pattern reuse for LLM inference optimization
dfastllm
High-performance inference engine for Diffusion Language Models - 3x faster with advanced optimizations