5 projects
xfastertransformer-icx
Boost large language model inference performance on CPU platform.
xfastertransformer
Boost large language model inference performance on CPU platform.
vllm-xft
A high-throughput and memory-efficient inference and serving engine for LLMs
xfastertransformer-devel
Boost large language model inference performance on CPU platform.
xfastertransformer-devel-icx
Boost large language model inference performance on CPU platform.