6 projects
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
ray-cpp
A subpackage of Ray which provides the Ray C++ API.
lotgi
For the love of the game
event-metrics
An embedded, event-time metric collection library
modelzoo
Python package for querying ModelZoo.Live
clipper-admin
Admin commands for the Clipper prediction-serving system